AcroDef: A Quality Measure for Discriminating Expansions of Ambiguous Acronyms

Roche, Mathieu; Prince, Violaine

doi:10.1007/978-3-540-74255-5_31

Mathieu Roche¹ &
Violaine Prince¹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 4635))

Included in the following conference series:

International and Interdisciplinary Conference on Modeling and Using Context

1404 Accesses
6 Citations

Abstract

This paper presents a set of quality measures to determine the choice of the best expansion for an acronym not defined in the Web page. The method uses statistics computed on Web pages to determine the appropriate expansion. Measures are context-based and rely on the assumption that the most frequent words in the page are related semantically or lexically to the acronym expansion.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Azé, J., Kodratoff, Y.: A study of the effect of noisy data in rule extraction systems. In: Proceedings of EMCSR 2002, vol. 2, pp. 781–786 (2002)
Google Scholar
Bourigault, D., Jacquemin, C.: Term extraction + term clustering: An integrated platform for computer-aided terminology. In: Proceedings of the European Chapter of the Association for Computational Linguistics, pp. 15–22 (1999)
Google Scholar
Brill, E.: Some advances in transformation-based part of speech tagging. In: AAAI, vol. 1, pp. 722–727 (1994)
Google Scholar
Chang, J., Schtze, H., Altman, R.: Creating an online dictionary of abbreviations from medline. Journal of the American Medical Informatics Association 9, 612–620 (2002)
Article Google Scholar
Chauché, J.: Détermination sémantique en analyse structurelle: une expérience basée sur une définition de distance. In: TA Information, pp. 17–24 (1990)
Google Scholar
Church, K.W., Hanks, P.: Word association norms, mutual information, and lexicography. Computational Linguistics 16, 22–29 (1990)
Google Scholar
Daille, B.: Approche mixte pour l’extraction automatique de terminologie: statistiques lexicales et filtres linguistiques. PhD thesis, Université Paris 7 (1994)
Google Scholar
Daille, B.: Study and Implementation of Combined Techniques for Automatic Extraction of Terminology. In: The Balancing Act: Combining Symbolic and Statistical Approaches to Language, pp. 49–66. MIT Press, Cambridge (1996)
Google Scholar
Jacquemin, C.: Variation terminologique: Reconnaissance et acquisition automatiques de termes et de leurs variantes en corpus. In: Mémoire d’Habilitation à Diriger des Recherches en informatique fondamentale, Université de Nantes (1997)
Google Scholar
Lallich, S., Teytaud, O.: Evaluation et validation des règles d’association. Numéro spécial Mesures de qualité pour la fouille des données, Revue des Nouvelles Technologies de l’Information (RNTI), RNTI-E-1 pp. 193–218 (2004)
Google Scholar
Larkey, L.S., Ogilvie, P., Price, M.A., Tamilio, B.: Acrophile: An automated acronym extractor and server. In: Proceedings of the Fifth ACM International Conference on Digital Libraries, pp. 205–214. ACM Press, New York (2000)
Chapter Google Scholar
Petrovic, S., Snajder, J., Dalbelo-Basic, B., Kolar, M.: Comparison of collocation extraction measures for document indexing. In: Proc of Information Technology Interfaces (ITI), pp. 451–456 (2006)
Google Scholar
Roche, M., Azé, J., Kodratoff, Y., Sebag, M.: Learning interestingness measures in terminology extraction. a roc-based approach. In: Proceedings of ROC Analysis in AI Workshop (ECAI 2004), pp. 81–88 (2004)
Google Scholar
Roche, M., Kodratoff, Y.: Pruning Terminology Extracted from a Specialized Corpus for CV Ontology Acquisition. In: Meersman, R., Tari, Z. (eds.) On the Move to Meaningful Internet Systems 2006: CoopIS, DOA, GADA, and ODBASE. LNCS, vol. 4276, pp. 1107–1116. Springer, Heidelberg (2006)
Chapter Google Scholar
Smadja, F., McKeown, K.R., Hatzivassiloglou, V.: Translating collocations for bilingual lexicons: A statistical approach. Computational Linguistics 22(1), 1–38 (1996)
Google Scholar
Thanopoulos, A., Fakotakis, N., Kokkianakis, G.: Comparative Evaluation of Collocation Extraction Metrics. In: Proceedings of LREC 2002, pp. 620–625 (2002)
Google Scholar
Turney, P.D.: Mining the Web for synonyms: PMI–IR versus LSA on TOEFL. In: Flach, P.A., De Raedt, L. (eds.) ECML 2001. LNCS (LNAI), vol. 2167, pp. 491–502. Springer, Heidelberg (2001)
Google Scholar
Vivaldi, J., Màrquez, L., Rodríguez, H.: Improving term extraction by system combination using boosting. In: Proceedings of the 12th European Conference on Machine Learning (ECML), pp. 515–526 (2001)
Google Scholar
Yeates, S.: Automatic extraction of acronyms from text. In: New Zealand Computer Science Research Students’ Conference, pp. 117–124 (1999)
Google Scholar

Download references

Author information

Authors and Affiliations

LIRMM - UMR 5506, CNRS, Univ. Montpellier 2, 34392 Montpellier Cedex 5, France
Mathieu Roche & Violaine Prince

Authors

Mathieu Roche
View author publications
You can also search for this author in PubMed Google Scholar
Violaine Prince
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Boicho Kokinov Daniel C. Richardson Thomas R. Roth-Berghofer Laure Vieu

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Roche, M., Prince, V. (2007). AcroDef: A Quality Measure for Discriminating Expansions of Ambiguous Acronyms. In: Kokinov, B., Richardson, D.C., Roth-Berghofer, T.R., Vieu, L. (eds) Modeling and Using Context. CONTEXT 2007. Lecture Notes in Computer Science(), vol 4635. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-74255-5_31

Download citation

DOI: https://doi.org/10.1007/978-3-540-74255-5_31
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-74254-8
Online ISBN: 978-3-540-74255-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics