Combining Background Knowledge and Learned Topics
Topics in Cognitive Science 3 (1):18-47 (2011)
Abstract
Statistical topic models provide a general data - driven framework for automated discovery of high-level knowledge from large collections of text documents. Although topic models can potentially discover a broad range of themes in a data set, the interpretability of the learned topics is not always ideal. Human-defined concepts, however, tend to be semantically richer due to careful selection of words that define the concepts, but they may not span the themes in a data set exhaustively. In this study, we review a new probabilistic framework for combining a hierarchy of human-defined semantic concepts with a statistical topic model to seek the best of both worlds. Results indicate that this combination leads to systematic improvements in generalization performance as well as enabling new techniques for inferring and visualizing the content of a documentDOI
10.1111/j.1756-8765.2010.01097.x
My notes
Similar books and articles
Wofür sprechen die daten?Thomas Bartelborth - 2004 - Journal for General Philosophy of Science / Zeitschrift für Allgemeine Wissenschaftstheorie 35 (1):13-40.
Combining argumentation and bayesian nets for breast cancer prognosis.Matt Williams & Jon Williamson - 2006 - Journal of Logic, Language and Information 15 (1-2):155-178.
Computational Methods to Extract Meaning From Text and Advance Theories of Human Cognition.Danielle S. McNamara - 2011 - Topics in Cognitive Science 3 (1):3-17.
Automated Search for Causal Relations - Theory and Practice.Peter Spirtes, Clark Glymour & Richard Scheines - unknown
From implicit skills to explicit knowledge: a bottom‐up model of skill learning.Edward Merrillb & Todd Petersonb - 2001 - Cognitive Science 25 (2):203-244.
Analytics
Added to PP
2010-08-11
Downloads
111 (#113,859)
6 months
1 (#450,425)
2010-08-11
Downloads
111 (#113,859)
6 months
1 (#450,425)
Historical graph of downloads
Citations of this work
Computational Methods to Extract Meaning From Text and Advance Theories of Human Cognition.Danielle S. McNamara - 2011 - Topics in Cognitive Science 3 (1):3-17.
References found in this work
A solution to Plato's problem: The latent semantic analysis theory of acquisition, induction, and representation of knowledge.Thomas K. Landauer & Susan T. Dumais - 1997 - Psychological Review 104 (2):211-240.
Topics in semantic representation.Thomas L. Griffiths, Mark Steyvers & Joshua B. Tenenbaum - 2007 - Psychological Review 114 (2):211-244.
Representing word meaning and order information in a composite holographic lexicon.Michael N. Jones & Douglas J. K. Mewhort - 2007 - Psychological Review 114 (1):1-37.
Probabilistic models of language processing and acquisition.Nick Chater & Christopher D. Manning - 2006 - Trends in Cognitive Sciences 10 (7):335–344.
The role of embodied intention in early lexical acquisition.Chen Yu, Dana H. Ballard & Richard N. Aslin - 2005 - Cognitive Science 29 (6):961-1005.