
Table 4: Some examples of correctly decoded jargon terms via their contextual synonyms generated by RoBERTa.
Jargon Term Synonyms Meaning per ‘Jargon Buster’ document
pav pavilion, pavilion bar Formally, the Pavilion Bar. The University’s student bar,
managed by the Sport Union, located at the eastern end
of College Park.
tangent climate entrepreneurship,
social data science
The ideas workspace, providing courses and events cen-
tred on business and innovation.
forum cafe, restaurant College-operated eatery located in the Business School.
Here you’ll find hot and cold lunch offerings, barista cof-
fee, and - often - pop-ups run by local businesses.
change of less than 0.001 (where p > 0.05). This re-
sult is disappointing, but not entirely unexpected, as
the ENTRP-SRCH is small, with limited query diversity
and a scarcity of jargon rich Q-D pairs. In spite of this,
the qualitative analysis of the generated synonyms re-
vealed promising results for recall as a foundation for
semantic search.
Future work plans will employ additional inde-
pendent datasets to address the limitations of our
ENTRP-SRCH dataset, which centres on just twenty of
the most frequently submitted queries. The use of
click-through data in place of human judgements for
Q-D pair annotation would facilitate larger and more
diverse ES datasets that are better able to capture the
nuances of enterprise specific terminology. Finally, it
would be interesting to perform a longitudinal study
to gauge the JARGES impact on semantic and ex-
ploratory search, based on query expansion and recall
on a real-world ES system.
REFERENCES
Belfathi, A., Gallina, Y., Hernandez, N., Dufour, R., and
Monceaux, L. (2024). Language Model Adaptation
to Specialized Domains through Selective Masking
based on Genre and Topical Characteristics. arXiv,
2402.12036.
Bentley, J. (2011). Mind the Enterprise Search Gap: Smart-
logic Sponsor MindMetre Research Report.
Burges, C., Shaked, T., Renshaw, E., Lazier, A., Deeds, M.,
Hamilton, N., and Hullender, G. (2005). Learning to
rank using gradient descent. In ICML 2005 - Proceed-
ings of the 22nd International Conference on Machine
Learning, pages 89–96.
Cleverley, P. H. and Burnett, S. (2019). Enterprise search
and discovery capability: The factors and generative
mechanisms for user satisfaction:. Journal of Infor-
mation Science, 45(1):29–52.
Craswell, N., Cambridge, M., and Soboroff, I. (2005).
Overview of the TREC-2005 Enterprise Track. In
TREC 2005 conference notebook, pages 199–205.
Daly, C. (2023). Learning to Rank: Performance and
Practical Barriers to Deployment in Enterprise Search.
In 3rd Asia Conference on Information Engineering
(ACIE), pages 21–26. IEEE.
Daly, C. and Hederman, L. (2023). Enterprise Search:
Learning to Rank with Click-Through Data as a Sur-
rogate for Human Relevance Judgements. In 15th In-
ternational Conference on Knowledge Discovery and
Information Retrieval (KDIR), pages 240–247.
Devlin, J., Chang, M.-W., Lee, K., Google, K. T., and
Language, A. I. (2019). BERT: Pre-training of Deep
Bidirectional Transformers for Language Understand-
ing. Proceedings of the 2019 Conference of the North,
pages 4171–4186.
Ganguly, D., Roy, D., Mitra, M., and Jones, G. J. (2015).
A word embedding based generalized language model
for information retrieval. SIGIR 2015 - Proceedings
of the 38th International ACM SIGIR Conference on
Research and Development in Information Retrieval,
pages 795–798.
Google (2025). Search and SEO Blog — Google Search
Central — Google Search Central Blog — Google for
Developers.
Hawking, D. (2004). Challenges in Enterprise Search. In
Proceedings of the 15th Australasian Database Con-
ference - Volume 27, ADC ’04, page 15–24, AUS.
Australian Computer Society, Inc.
Jones, K. S. (1972). A statistical interpretation of term
specificity and its application in retrieval. Journal of
Documentation, 28(1):11–21.
Li, H. (2011). A Short Introduction to Learning to Rank.
IEICE Transactions, 94-D:1854–1862.
Li, L., Deng, H., Dong, A., Chang, Y., Baeza-Yates, R.,
and Zha, H. (2017). Exploring query auto-completion
and click logs for contextual-aware web search and
query suggestion. 26th International World Wide Web
Conference, WWW 2017, pages 539–548.
Liu, T.-Y. (2010). Learning to Rank for Information Re-
trieval, volume 3. Springer Berlin Heidelberg, 2nd
edition.
Liu, Y., Ott, M., Goyal, N., Du, J., Joshi, M., Chen, D.,
Levy, O., Lewis, M., Zettlemoyer, L., and Stoyanov,
V. (2019). RoBERTa: A Robustly Optimized BERT
Pretraining Approach. arXiv, 1(abs/1907.11692).
Pedregosa, F., Michel, V., Grisel, O., Blondel, M., Pretten-
hofer, P., Weiss, R., Vanderplas, J., Cournapeau, D.,
KDIR 2025 - 17th International Conference on Knowledge Discovery and Information Retrieval
362