Abstract
This paper presents a set of quality measures to determine the choice of the best expansion for an acronym not defined in the Web page. The method uses statistics computed on Web pages to determine the appropriate expansion. Measures are context-based and rely on the assumption that the most frequent words in the page are related semantically or lexically to the acronym expansion.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Azé, J., Kodratoff, Y.: A study of the effect of noisy data in rule extraction systems. In: Proceedings of EMCSR 2002, vol. 2, pp. 781–786 (2002)
Bourigault, D., Jacquemin, C.: Term extraction + term clustering: An integrated platform for computer-aided terminology. In: Proceedings of the European Chapter of the Association for Computational Linguistics, pp. 15–22 (1999)
Brill, E.: Some advances in transformation-based part of speech tagging. In: AAAI, vol. 1, pp. 722–727 (1994)
Chang, J., Schtze, H., Altman, R.: Creating an online dictionary of abbreviations from medline. Journal of the American Medical Informatics Association 9, 612–620 (2002)
Chauché, J.: Détermination sémantique en analyse structurelle: une expérience basée sur une définition de distance. In: TA Information, pp. 17–24 (1990)
Church, K.W., Hanks, P.: Word association norms, mutual information, and lexicography. Computational Linguistics 16, 22–29 (1990)
Daille, B.: Approche mixte pour l’extraction automatique de terminologie: statistiques lexicales et filtres linguistiques. PhD thesis, Université Paris 7 (1994)
Daille, B.: Study and Implementation of Combined Techniques for Automatic Extraction of Terminology. In: The Balancing Act: Combining Symbolic and Statistical Approaches to Language, pp. 49–66. MIT Press, Cambridge (1996)
Jacquemin, C.: Variation terminologique: Reconnaissance et acquisition automatiques de termes et de leurs variantes en corpus. In: Mémoire d’Habilitation à Diriger des Recherches en informatique fondamentale, Université de Nantes (1997)
Lallich, S., Teytaud, O.: Evaluation et validation des règles d’association. Numéro spécial Mesures de qualité pour la fouille des données, Revue des Nouvelles Technologies de l’Information (RNTI), RNTI-E-1 pp. 193–218 (2004)
Larkey, L.S., Ogilvie, P., Price, M.A., Tamilio, B.: Acrophile: An automated acronym extractor and server. In: Proceedings of the Fifth ACM International Conference on Digital Libraries, pp. 205–214. ACM Press, New York (2000)
Petrovic, S., Snajder, J., Dalbelo-Basic, B., Kolar, M.: Comparison of collocation extraction measures for document indexing. In: Proc of Information Technology Interfaces (ITI), pp. 451–456 (2006)
Roche, M., Azé, J., Kodratoff, Y., Sebag, M.: Learning interestingness measures in terminology extraction. a roc-based approach. In: Proceedings of ROC Analysis in AI Workshop (ECAI 2004), pp. 81–88 (2004)
Roche, M., Kodratoff, Y.: Pruning Terminology Extracted from a Specialized Corpus for CV Ontology Acquisition. In: Meersman, R., Tari, Z. (eds.) On the Move to Meaningful Internet Systems 2006: CoopIS, DOA, GADA, and ODBASE. LNCS, vol. 4276, pp. 1107–1116. Springer, Heidelberg (2006)
Smadja, F., McKeown, K.R., Hatzivassiloglou, V.: Translating collocations for bilingual lexicons: A statistical approach. Computational Linguistics 22(1), 1–38 (1996)
Thanopoulos, A., Fakotakis, N., Kokkianakis, G.: Comparative Evaluation of Collocation Extraction Metrics. In: Proceedings of LREC 2002, pp. 620–625 (2002)
Turney, P.D.: Mining the Web for synonyms: PMI–IR versus LSA on TOEFL. In: Flach, P.A., De Raedt, L. (eds.) ECML 2001. LNCS (LNAI), vol. 2167, pp. 491–502. Springer, Heidelberg (2001)
Vivaldi, J., Mà rquez, L., RodrÃguez, H.: Improving term extraction by system combination using boosting. In: Proceedings of the 12th European Conference on Machine Learning (ECML), pp. 515–526 (2001)
Yeates, S.: Automatic extraction of acronyms from text. In: New Zealand Computer Science Research Students’ Conference, pp. 117–124 (1999)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2007 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Roche, M., Prince, V. (2007). AcroDef: A Quality Measure for Discriminating Expansions of Ambiguous Acronyms. In: Kokinov, B., Richardson, D.C., Roth-Berghofer, T.R., Vieu, L. (eds) Modeling and Using Context. CONTEXT 2007. Lecture Notes in Computer Science(), vol 4635. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-74255-5_31
Download citation
DOI: https://doi.org/10.1007/978-3-540-74255-5_31
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-74254-8
Online ISBN: 978-3-540-74255-5
eBook Packages: Computer ScienceComputer Science (R0)