Results for 'Christopher D. Manning'

(not author) ( search as author name )
1000+ found
Order:
  1. An Introduction to Information Retrieval.Christopher D. Manning - unknown
    1 Boolean retrieval 1 2 The term vocabulary and postings lists 19 3 Dictionaries and tolerant retrieval 49 4 Index construction 67 5 Index compression 85 6 Scoring, term weighting and the vector space model 109 7 Computing scores in a complete search system 135 8 Evaluation in information retrieval 151 9 Relevance feedback and query expansion 177 10 XML retrieval 195 11 Probabilistic information retrieval 219 12 Language models for information retrieval 237 13 Text classification and Naive Bayes 253 (...)
    Direct download  
     
    Export citation  
     
    Bookmark   18 citations  
  2.  44
    Natural Logic for Textual Inference.Christopher D. Manning - unknown
    This paper presents the first use of a computational model of natural logic—a system of logical inference which operates over natural language—for textual inference. Most current approaches to the PAS- CAL RTE textual inference task achieve robustness by sacrificing semantic precision; while broadly effective, they are easily confounded by ubiquitous inferences involving monotonicity. At the other extreme, systems which rely on first-order logic and theorem proving are precise, but excessively brittle. This work aims at a middle way. Our system finds (...)
    Direct download  
     
    Export citation  
     
    Bookmark   6 citations  
  3.  34
    Ergativity: Argument Structure and Grammatical Relations.Christopher D. Manning - unknown
    I wish to present a codi cation of syntactic approaches to dealing with ergative languages and argue for the correctness of one particular approach, which I will call the Inverse Grammatical Relations hypothesis.1 I presume familiarity with the term `ergativity', but, brie y, many languages have ergative case marking, such as Burushaski in (1), in contrast to the accusative case marking of Latin in (2). More generally, if we follow Dixon (1979) and use A to mark the agent-like argument of (...)
    Direct download  
     
    Export citation  
     
    Bookmark   5 citations  
  4.  38
    An extended model of natural logic.Christopher D. Manning & Bill MacCartney - unknown
    We propose a model of natural language inference which identifies valid inferences by their lexical and syntactic features, without full semantic interpretation. We extend past work in natural logic, which has focused on semantic containment and monotonicity, by incorporating both semantic exclusion and implicativity. Our model decomposes an inference problem into a sequence of atomic edits linking premise to hypothesis; predicts a lexical semantic relation for each edit; propagates these relations upward through a semantic composition tree according to properties of (...)
    Direct download  
     
    Export citation  
     
    Bookmark   5 citations  
  5.  34
    Learning to distinguish valid textual entailments.Christopher D. Manning & Daniel Cer - unknown
    This paper proposes a new architecture for textual inference in which finding a good alignment is separated from evaluating entailment. Current approaches to semantic inference in question answering and textual entailment have approximated the entailment problem as that of computing the best alignment of the hypothesis to the text, using a locally decomposable matching score. While this formulation is adequate for representing local (word-level) phenomena such as synonymy, it is incapable of representing global interactions, such as that between verb negation (...)
    Direct download  
     
    Export citation  
     
    Bookmark   4 citations  
  6.  43
    Modeling Semantic Containment and Exclusion in Natural Language Inference.Christopher D. Manning - unknown
    We propose an approach to natural language inference based on a model of natural logic, which identifies valid inferences by their lexical and syntactic features, without full semantic interpretation. We greatly extend past work in natural logic, which has focused solely on semantic containment and monotonicity, to incorporate both semantic exclusion and implicativity. Our system decomposes an inference problem into a sequence of atomic edits linking premise to hypothesis; predicts a lexical entailment relation for each edit using a statistical classifier; (...)
    Direct download  
     
    Export citation  
     
    Bookmark   3 citations  
  7.  12
    Robust Textual Inference via Graph Matching.Christopher D. Manning - unknown
    We present a system for deciding whether a given sentence can be inferred from text. Each sentence is represented as a directed graph (extracted from a dependency parser) in which the nodes represent words or phrases, and the links represent syntactic and semantic relationships. We develop a learned graph matching model to approximate entailment by the amount of the sentence’s semantic content which is contained in the text. We present results on the Recognizing Textual Entailment dataset (Dagan et al., 2005), (...)
    No categories
    Direct download  
     
    Export citation  
     
    Bookmark   3 citations  
  8.  61
    Part-of-Speech Tagging from 97% to 100%: Is It Time for Some Linguistics?Christopher D. Manning - unknown
    I examine what would be necessary to move part-of-speech tagging performance from its current level of about 97.3% token accuracy (56% sentence accuracy) to close to 100% accuracy. I suggest that it must still be possible to greatly increase tagging performance and examine some useful improvements that have recently been made to the Stanford Part-of-Speech Tagger. However, an error analysis of some of the remaining errors suggests that there is limited further mileage to be had either from better machine learning (...)
    Direct download  
     
    Export citation  
     
    Bookmark   2 citations  
  9.  16
    Automatic Acquisition of a Large Subcategorization Dictionary From Corpora.Christopher D. Manning - unknown
    This paper presents a new method for producing a dictionary of subcategorization frames from unlabelled text corpora. It is shown that statistical filtering of the results of a finite state parser running on the output of a stochastic tagger produces high quality results, despite the error rates of the tagger and the parser. Further, it is argued that this method can be used to learn all subcategorization frames, whereas previous methods are not extensible to a general solution to the problem.
    No categories
    Direct download  
     
    Export citation  
     
    Bookmark   2 citations  
  10.  85
    Efficient, Feature-based, Conditional Random Field Parsing.Christopher D. Manning - unknown
    Discriminative feature-based methods are widely used in natural language processing, but sentence parsing is still dominated by generative methods. While prior feature-based dynamic programming parsers have restricted training and evaluation to artificially short sentences, we present the first general, featurerich discriminative parser, based on a conditional random field model, which has been successfully scaled to the full WSJ parsing data. Our efficiency is primarily due to the use of stochastic optimization techniques, as well as parallelization and chart prefiltering. On WSJ15, (...)
    Direct download  
     
    Export citation  
     
    Bookmark   2 citations  
  11.  15
    Soft Constraints Mirror Hard Constraints: Voice and Person in English and Lummi.Christopher D. Manning - unknown
    The same categorical phenomena which are attributed to hard grammatical constraints in some languages continue to show up as statistical preferences in other languages, motivating a grammatical model that can account for soft constraints. The effects of a hierarchy of person (1st, 2nd 3rd) on grammar are categorical in some languages, most famously in languages withError: Illegal entry in bfrange block in ToUnicode CMap inverse systems, but also in languages with person restrictions on passivization. In Lummi, for example, the person (...)
    Direct download  
     
    Export citation  
     
    Bookmark   2 citations  
  12.  26
    The Lexical Integrity of Japanese Causatives.Christopher D. Manning & Ivan A. Sag - unknown
    Grammatical theory has long wrestled with the fact that causative constructions exhibit properties of both single words and complex phrases. However, as Paul Kiparsky has observed, the distribution of such properties of causatives is not arbitrary: ‘construal’ phenomena such as honorification, anaphor and pronominal binding, and quantifier ‘floating’ typically behave as they would if causatives were syntactically complex, embedding constructions; whereas case marking, agreement and word order phenomena all point to the analysis of causatives as single lexical items.1 Although an (...)
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark   2 citations  
  13.  33
    Argument structure as a locus for binding theory.Christopher D. Manning - unknown
    The correct locus (or loci) of binding theory has been a matter of much discussion. Theories can be seen as varying along at least two dimensions. The rst is whether binding theory is con gurationally determined (that is, the theory exploits the geometry of a phrase marker, appealing to such purely structural notions as c-command and government) or whether the theory depends rather on examining the relations between items selected by a predicate (where by selection I am intending to cover (...)
    Direct download  
     
    Export citation  
     
    Bookmark  
  14.  51
    Disambiguating “DE” for Chinese-English Machine Translation.Christopher D. Manning - unknown
    Linking constructions involving dሇ (DE) are ubiquitous in Chinese, and can be translated into English in many different ways. This is a major source of machine translation error, even when syntaxsensitive translation models are used. This paper explores how getting more information about the syntactic, semantic, and discourse context of uses of dሇ (DE) can facilitate producing an appropriate English translation strategy. We describe a finergrained classification of dሇ (DE) constructions in Chinese NPs, construct a corpus of annotated examples, and (...)
    Direct download  
     
    Export citation  
     
    Bookmark  
  15. A Phrase-Based Alignment Model for Natural Language Inference.Christopher D. Manning - unknown
    The alignment problem—establishing links between corresponding phrases in two related sentences—is as important in natural language inference (NLI) as it is in machine translation (MT). But the tools and techniques of MT alignment do not readily transfer to NLI, where one cannot assume semantic equivalence, and for which large volumes of bitext are lacking. We present a new NLI aligner, the MANLI system, designed to address these challenges. It uses a phrase-based alignment representation, exploits external lexical resources, and capitalizes on (...)
    Direct download  
     
    Export citation  
     
    Bookmark   1 citation  
  16.  7
    Romance Complex Predicates: In defence of the right-branching structure.Christopher D. Manning - unknown
    Abeill´e and Godard (1994) seek to show that the rightward branching analysis of French tense auxiliaries shown in (1b), that I argued for in Manning (1992) and which is widely adopted in general, is wrong, and that rather we should adopt a flat analysis for this construction as shown in (1c), and they show how such an analysis can be realized within HPSG (Pollard and Sag 1994).
    Direct download  
     
    Export citation  
     
    Bookmark  
  17.  19
    Stanford University.Christopher D. Manning - unknown
    Technology for local textual inference is central to producing a next generation of intelligent yet robust human language processing systems. One can think of it as Information Retrieval++. It is needed for a search on male fertility may be affected by use of cell phones to match a document saying Startling new research into mobile phones suggests they can reduce a man’s sperm count up to 30%, despite the fact that the only word overlap is phones. But textual inference is (...)
    No categories
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark  
  18.  34
    An Effective Two-Stage Model for Exploiting Non-Local Dependencies in Named Entity Recognition.Christopher D. Manning - unknown
    This paper shows that a simple two-stage approach to handle non-local dependencies in Named Entity Recognition (NER) can outperform existing approaches that handle non-local dependencies, while being much more computationally efficient. NER systems typically use sequence models for tractable inference, but this makes them unable to capture the long distance structure present in text. We use a Conbel.
    No categories
    Direct download  
     
    Export citation  
     
    Bookmark  
  19.  48
    A Simple and Effective Hierarchical Phrase Reordering Model.Christopher D. Manning - unknown
    adjacent phrases, but they typically lack the ability to perform the kind of long-distance reorderings possible with syntax-based systems. In this paper, we present a novel hierarchical phrase reordering model aimed at improving non-local reorderings, which seamlessly integrates with a standard phrase-based system with little loss of computational efficiency. We show that this model can successfully handle the key examples often used to motivate syntax-based systems, such as the rotation of a prepositional phrase around a noun phrase. We contrast our (...)
    No categories
    Direct download  
     
    Export citation  
     
    Bookmark   1 citation  
  20.  3
    Computations.Christopher D. Manning - unknown
    We present a novel algorithm for the fast computation of PageRank, a hyperlink-based estimate of the “importance” of Web pages. The original PageRank algorithm uses the Power Method to compute successive iterates that converge to the principal eigenvector of the Markov matrix representing the Web link graph. The algorithm presented here, called Quadratic Extrapolation, accelerates the convergence of the Power Method by periodically subtracting off estimates of the nonprincipal eigenvectors from the current iterate of the Power Method. In Quadratic Extrapolation, (...)
    No categories
    Direct download  
     
    Export citation  
     
    Bookmark  
  21.  58
    Clustering the Tagged Web.Christopher D. Manning - unknown
    Automatically clustering web pages into semantic groups promises improved search and browsing on the web. In this paper, we demonstrate how user-generated tags from largescale social bookmarking websites such as del.icio.us can be used as a complementary data source to page text and anchor text for improving automatic clustering of web pages. This paper explores the use of tags in 1) K-means clustering in an extended vector space model that includes tags as well as page text and 2) a novel (...)
    Direct download  
     
    Export citation  
     
    Bookmark   1 citation  
  22.  38
    Enforcing Transitivity in Coreference Resolution.Christopher D. Manning - unknown
    A desirable quality of a coreference resolution system is the ability to handle transitivity constraints, such that even if it places high likelihood on a particular mention being coreferent with each of two other mentions, it will also consider the likelihood of those two mentions being coreferent when making a final assignment. This is exactly the kind of constraint that integer linear programming (ILP) is ideal for, but, surprisingly, previous work applying ILP to coreference resolution has not encoded this type (...)
    No categories
    Direct download  
     
    Export citation  
     
    Bookmark  
  23.  9
    Feature Selection for a Rich HPSG Grammar Using Decision Trees.Christopher D. Manning & Kristina Toutanova - unknown
    This paper examines feature selection for log linear models over rich constraint-based grammar (HPSG) representations by building decision trees over features in corresponding probabilistic context free grammars (PCFGs). We show that single decision trees do not make optimal use of the available information; constructed ensembles of decision trees based on different feature subspaces show signifi- cant performance gains (14% parse selection error reduction). We compare the performance of the learned PCFG grammars and log linear models over the same features.
    No categories
    Direct download  
     
    Export citation  
     
    Bookmark   1 citation  
  24.  19
    Learning Random Walk Models for Inducing Word Dependency Distributions.Christopher D. Manning & Kristina Toutanova - unknown
    Many NLP tasks rely on accurately estimating word dependency probabilities P(w1|w2), where the words w1 and w2 have a particular relationship (such as verb-object). Because of the sparseness of counts of such dependencies, smoothing and the ability to use multiple sources of knowledge are important challenges. For example, if the probability P(N |V ) of noun N being the subject of verb V is high, and V takes similar objects to V , and V is synonymous to V , then (...)
    No categories
    Direct download  
     
    Export citation  
     
    Bookmark  
  25. Nested Named Entity Recognition.Christopher D. Manning - unknown
    Many named entities contain other named entities inside them. Despite this fact, the field of named entity recognition has almost entirely ignored nested named entity recognition, but due to technological, rather than ideological reasons. In this paper, we present a new technique for recognizing nested named entities, by using a discriminative constituency parser. To train the model, we transform each sentence into a tree, with constituents for each named entity (and no other syntactic structure). We present results on both newspaper (...)
    No categories
    Direct download  
     
    Export citation  
     
    Bookmark  
  26.  10
    Parse Selection on the Redwoods Corpus: 3rd Growth Results.Christopher D. Manning & Kristina Toutanova - unknown
    This report details experimental results of using stochastic disambiguation models for parsing sentences from the Redwoods treebank (Oepen et al., 2002). The goals of this paper are two-fold: (i) to report accuracy results on the more highly ambiguous latest version of the treebank, as compared to already published results achieved by the same stochastic models on a previous version of the corpus, and (ii) to present some newly developed models using features from the HPSG signs, as well as the MRS (...)
    No categories
    Direct download  
     
    Export citation  
     
    Bookmark  
  27.  33
    The Stanford typed dependencies representation.Christopher D. Manning - unknown
    This paper examines the Stanford typed dependencies representation, which was designed to provide a straightforward description of grammatical relations for any user who could benefit from automatic text understanding. For such purposes, we argue that dependency schemes must follow a simple design and provide semantically contentful information, as well as offer an automatic procedure to extract the relations. We consider the underlying design principles of the Stanford scheme from this perspective, and compare it to the GR and PARC representations. Finally, (...)
    No categories
    Direct download  
     
    Export citation  
     
    Bookmark   1 citation  
  28.  29
    Which words are hard to recognize? Prosodic, lexical, and disfluency factors that increase ASR error rates.Christopher D. Manning - unknown
    Many factors are thought to increase the chances of misrecognizing a word in ASR, including low frequency, nearby disfluencies, short duration, and being at the start of a turn. However, few of these factors have been formally examined. This paper analyzes a variety of lexical, prosodic, and disfluency factors to determine which are likely to increase ASR error rates. Findings include the following. (1) For disfluencies, effects depend on the type of disfluency: errors increase by up to 15% (absolute) for (...)
    Direct download  
     
    Export citation  
     
    Bookmark  
  29. Probabilistic models of language processing and acquisition.Nick Chater & Christopher D. Manning - 2006 - Trends in Cognitive Sciences 10 (7):335–344.
    Probabilistic methods are providing new explanatory approaches to fundamental cognitive science questions of how humans structure, process and acquire language. This review examines probabilistic models defined over traditional symbolic structures. Language comprehension and production involve probabilistic inference in such models; and acquisition involves choosing the best model, given innate constraints and linguistic and other input. Probabilistic models can account for the learning and processing of language, while maintaining the sophistication of symbolic models. A recent burgeoning of theoretical developments and online (...)
    Direct download (3 more)  
     
    Export citation  
     
    Bookmark   44 citations  
  30.  36
    Accurate Unlexicalized Parsing.Dan Klein & Christopher D. Manning - unknown
    We demonstrate that an unlexicalized PCFG can parse much more accurately than previously shown, by making use of simple, linguistically motivated state splits, which break down false independence assumptions latent in a vanilla treebank grammar. Indeed, its performance of 86.36% (LP/LR F1) is better than that of early lexicalized PCFG models, and surprisingly close to the current state-of-theart. This result has potential uses beyond establishing a strong lower bound on the maximum possible accuracy of unlexicalized models: an unlexicalized PCFG is (...)
    Direct download  
     
    Export citation  
     
    Bookmark   25 citations  
  31.  19
    Fast Exact Inference with a Factored Model for Natural Language Parsing.Dan Klein & Christopher D. Manning - unknown
    We present a novel generative model for natural language tree structures in which semantic (lexical dependency) and syntactic (PCFG) structures are scored with separate models. This factorization provides conceptual simplicity, straightforward opportunities for separately improving the component models, and a level of performance comparable to similar, non-factored models. Most importantly, unlike other modern parsing models, the factored model admits an extremely effective A* parsing algorithm, which enables efficient, exact inference.
    Direct download  
     
    Export citation  
     
    Bookmark   5 citations  
  32.  20
    Natural Language Grammar Induction using a Constituent-Context Model.Dan Klein & Christopher D. Manning - unknown
    This paper presents a novel approach to the unsupervised learning of syntactic analyses of natural language text. Most previous work has focused on maximizing likelihood according to generative PCFG models. In contrast, we employ a simpler probabilistic model over trees based directly on constituent identity and linear context, and use an EM-like iterative procedure to induce structure. This method produces much higher quality analyses, giving the best published results on the ATIS dataset.
    Direct download  
     
    Export citation  
     
    Bookmark   5 citations  
  33.  88
    Studying the History of Ideas Using Topic Models.David Hall & Christopher D. Manning - unknown
    How can the development of ideas in a scientific field be studied over time? We apply unsupervised topic modeling to the ACL Anthology to analyze historical trends in the field of Computational Linguistics from 1978 to 2006. We induce topic clusters using Latent Dirichlet Allocation, and examine the strength of each topic over time. Our methods find trends in the field including the rise of probabilistic methods starting in 1988, a steady increase in applications, and a sharp decline of research (...)
    Direct download  
     
    Export citation  
     
    Bookmark   7 citations  
  34.  30
    A Generative Constituent-Context Model for Improved Grammar Induction.Dan Klein & Christopher D. Manning - unknown
    We present a generative distributional model for the unsupervised induction of natural language syntax which explicitly models constituent yields and contexts. Parameter search with EM produces higher quality analyses than previously exhibited by unsupervised systems, giving the best published unsupervised parsing results on the ATIS corpus. Experiments on Penn treebank sentences of comparable length show an even higher F1 of 71% on nontrivial brackets. We compare distributionally induced and actual part-of-speech tags as input data, and examine extensions to the basic (...)
    Direct download  
     
    Export citation  
     
    Bookmark   4 citations  
  35. Labeled LDA: A supervised topic model for credit attribution in multi-labeled corpora.David Hall & Christopher D. Manning - unknown
    A significant portion of the world’s text is tagged by readers on social bookmarking websites. Credit attribution is an inherent problem in these corpora because most pages have multiple tags, but the tags do not always apply with equal specificity across the whole document. Solving the credit attribution problem requires associating each word in a document with the most appropriate tags and vice versa. This paper introduces Labeled LDA, a topic model that constrains Latent Dirichlet Allocation by defining a one-to-one (...)
    No categories
    Direct download  
     
    Export citation  
     
    Bookmark   4 citations  
  36.  20
    Parsing with Treebank Grammars: Empirical Bounds, Theoretical Models, and the Structure of the Penn Treebank.Dan Klein & Christopher D. Manning - unknown
    This paper presents empirical studies and closely corresponding theoretical models of the performance of a chart parser exhaustively parsing the Penn Treebank with the Treebank’s own CFG grammar. We show how performance is dramatically affected by rule representation and tree transformations, but little by top-down vs. bottom-up strategies. We discuss grammatical saturation, including analysis of the strongly connected components of the phrasal nonterminals in the Treebank, and model how, as sentence length increases, the effective grammar rule size increases as regions (...)
    Direct download  
     
    Export citation  
     
    Bookmark   3 citations  
  37.  15
    Conditional Structure versus Conditional Estimation in NLP Models.Dan Klein & Christopher D. Manning - unknown
    This paper separates conditional parameter estima- tion, which consistently raises test set accuracy on statistical NLP tasks, from conditional model struc- tures, such as the conditional Markov model used for maximum-entropy tagging, which tend to lower accuracy. Error analysis on part-of-speech tagging shows that the actual tagging errors made by the conditionally structured model derive not only from label bias, but also from other ways in which the independence assumptions of the conditional model structure are unsuited to linguistic sequences. The (...)
    Direct download  
     
    Export citation  
     
    Bookmark   2 citations  
  38.  40
    An Ç ´Ò¿ µ Agenda-Based Chart Parser for Arbitrary Probabilistic Context-Free Grammars.Dan Klein & Christopher D. Manning - unknown
    While Ç ´Ò¿ µ methods for parsing probabilistic context-free grammars (PCFGs) are well known, a tabular parsing framework for arbitrary PCFGs which allows for botton-up, topdown, and other parsing strategies, has not yet been provided. This paper presents such an algorithm, and shows its correctness and advantages over prior work. The paper finishes by bringing out the connections between the algorithm and work on hypergraphs, which permits us to extend the presented Viterbi (best parse) algorithm to an inside (total probability) (...)
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark   1 citation  
  39.  17
    A∗ parsing: Fast exact viterbi parse selection.Dan Klein & Christopher D. Manning - unknown
    A* PCFG parsing can dramatically reduce the time required to find the exact Viterbi parse by conservatively estimating outside Viterbi probabilities. We discuss various estimates and give efficient algorithms for computing them. On Penn treebank sentences, our most detailed estimate reduces the total number of edges processed to less than 3% of that required by exhaustive parsing, and even a simpler estimate which can be pre-computed in under a minute still reduces the work by a factor of 5. The algorithm (...)
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark   1 citation  
  40.  14
    Combining Heterogeneous Classifiers for Word-Sense Disambiguation.Dan Klein, Christopher D. Manning & Kristina Toutanova - unknown
    This paper discusses ensembles of simple but heterogeneous classifiers for word-sense disambiguation, examining the Stanford-CS224N system entered in the SENSEVAL-2 English lexical sample task. First-order classifiers are combined by a second-order classifier, which variously uses majority voting, weighted voting, or a maximum entropy model. While individual first-order classifiers perform comparably to middle-scoring teams’ systems, the combination achieves high performance. We discuss trade-offs and empirical performance. Finally, we present an analysis of the combination, examining how ensemble performance depends on error independence (...)
    Direct download  
     
    Export citation  
     
    Bookmark  
  41. Distributional Phrase Structure Induction.Dan Klein & Christopher D. Manning - unknown
    Unsupervised grammar induction systems commonly judge potential constituents on the basis of their effects on the likelihood of the data. Linguistic justifications of constituency, on the other hand, rely on notions such as substitutability and varying external contexts. We describe two systems for distributional grammar induction which operate on such principles, using part-of-speech tags as the contextual features. The advantages and disadvantages of these systems are examined, including precision/recall trade-offs, error analysis, and extensibility.
     
    Export citation  
     
    Bookmark   1 citation  
  42.  16
    From instance-level constraints to space-level constraints: Making the most of prior knowledge in data clustering.Dan Klein & Christopher D. Manning - unknown
    We present an improved method for clustering in the presence of very limited supervisory information, given as pairwise instance constraints. By allowing instance-level constraints to have spacelevel inductive implications, we are able to successfully incorporate constraints for a wide range of data set types. Our method greatly improves on the previously studied constrained -means algorithm, generally requiring less than half as many constraints to achieve a given accuracy on a range of real-world data, while also being more robust when over-constrained. (...)
    Direct download  
     
    Export citation  
     
    Bookmark   1 citation  
  43.  13
    Interpreting and extending classical agglomerative clustering algorithms using a model-based approach.Dan Klein & Christopher D. Manning - unknown
    erative clustering. First, we show formally that the common heuristic agglomerative clustering algorithms – Ward’s method, single-link, complete-link, and a variant of group-average – are each equivalent to a hierarchical model-based method. This interpretation gives a theoretical explanation of the empirical behavior of these algorithms, as well as a principled approach to resolving practical issues, such as number of clusters or the choice of method. Second, we show how a model-based viewpoint can suggest variations on these basic agglomerative algorithms. We (...)
    Direct download  
     
    Export citation  
     
    Bookmark  
  44.  20
    Parsing and Hypergraphs.Dan Klein & Christopher D. Manning - unknown
    While symbolic parsers can be viewed as deduction systems, this view is less natural for probabilistic parsers. We present a view of parsing as directed hypergraph analysis which naturally covers both symbolic and probabilistic parsing. We illustrate the approach by showing how a dynamic extension of Dijkstra’s algorithm can be used to construct a probabilistic chart parser with an Ç´Ò¿µ time bound for arbitrary PCFGs, while preserving as much of the flexibility of symbolic chart parsers as allowed by the inherent (...)
    Direct download  
     
    Export citation  
     
    Bookmark   1 citation  
  45.  30
    NP Subject Detection in Verb-Initial Arabic Clauses.Spence Green & Christopher D. Manning - unknown
    Phrase re-ordering is a well-known obstacle to robust machine translation for language pairs with significantly different word orderings. For Arabic-English, two languages that usually differ in the ordering of subject and verb, the subject and its modifiers must be accurately moved to produce a grammatical translation. This operation requires more than base phrase chunking and often defies current phrase-based statistical decoders. We present a conditional random field sequence classi- fier that detects the full scope of Arabic noun phrase subjects in (...)
    Direct download  
     
    Export citation  
     
    Bookmark  
  46.  13
    Optimizing Local Probability Models for Statistical Parsing.Mark Mitchell, Christopher D. Manning & Kristina Toutanova - unknown
    This paper studies the properties and performance of models for estimating local probability distributions which are used as components of larger probabilistic systems — history-based generative parsing models. We report experimental results showing that memory-based learning outperforms many commonly used methods for this task (Witten-Bell, Jelinek-Mercer with fixed weights, decision trees, and log-linear models). However, we can connect these results with the commonly used general class of deleted interpolation models by showing that certain types of memory-based learning, including the kind (...)
    Direct download  
     
    Export citation  
     
    Bookmark  
  47.  60
    Learning Alignments and Leveraging Natural Logic.Nathanael Chambers, Daniel Cer, Trond Grenager, David Hall, Chloe Kiddon, Bill MacCartney, Marie-Catherine de Marneffe, Daniel Ramage, Eric Yeh & Christopher D. Manning - unknown
    We describe an approach to textual inference that improves alignments at both the typed dependency level and at a deeper semantic level. We present a machine learning approach to alignment scoring, a stochastic search procedure, and a new tool that finds deeper semantic alignments, allowing rapid development of semantic features over the aligned graphs. Further, we describe a complementary semantic component based on natural logic, which shows an added gain of 3.13% accuracy on the RTE3 test set.
    Direct download  
     
    Export citation  
     
    Bookmark   3 citations  
  48.  20
    A Generative Model for Semantic Role Labeling.Cynthia A. Thompson, Roger Levy & Christopher D. Manning - unknown
    Determining the semantic role of sentence constituents is a key task in determining sentence meanings lying behind a veneer of variant syntactic expression. We present a model of natural language generation from semantics using the FrameNet semantic role and frame ontology. We train the model using the FrameNet corpus and apply it to the task of automatic semantic role and frame identification, producing results competitive with previous work (about 70% role labeling accuracy). Unlike previous models used for this task, our (...)
    Direct download  
     
    Export citation  
     
    Bookmark  
  49.  28
    Aligning Semantic Graphs for Textual Inference and Machine Reading.Marie-Catherine de Marneffe, Trond Grenager, Bill MacCartney, Daniel Cer, Daniel Ramage, Chloe Kiddon & Christopher D. Manning - unknown
    This paper presents our work on textual inference and situates it within the context of the larger goals of machine reading. The textual inference task is to determine if the meaning of one text can be inferred from the meaning of another and from background knowledge. Our system generates semantic graphs as a representation of the meaning of a text. This paper presents new results for aligning pairs of semantic graphs, and proposes the application of natural logic to derive inference (...)
    No categories
    Direct download  
     
    Export citation  
     
    Bookmark   1 citation  
  50.  92
    Classics in the History of Psychology.Christopher D. Green - unknown
    Psychology as the behaviorist views it is a purely objective experimental branch of natural science. Its theoretical goal is the prediction and control of behavior. Introspection forms no essential part of its methods, nor is the scientific value of its data dependent upon the readiness with which they lend themselves to interpretation in terms of consciousness. The behaviorist, in his efforts to get a unitary scheme of animal response, recognizes no dividing line between man and brute. The behavior of man, (...)
    Direct download  
     
    Export citation  
     
    Bookmark  
1 — 50 / 1000