Results for 'Christopher Manning'

(not author) ( search as author name )
988 found
Order:
  1.  12
    Template Sampling for Leveraging Domain Knowledge in Information Extraction.Christopher Cox, Christopher Manning & Pat Langley - unknown
    We initially describe a feature-rich discriminative Conditional Random Field (CRF) model for Information Extraction in the workshop announcements domain, which offers good baseline performance in the PASCAL shared task. We then propose a method for leveraging domain knowledge in Information Extraction tasks, scoring candidate document labellings as one-value-per-field templates according to domain feasibility after generating sample labellings from a trained sequence classifier. Our relational models evaluate these templates according to our intuitions about agreement in the domain: workshop acronyms should resemble (...)
    No categories
    Direct download  
     
    Export citation  
     
    Bookmark  
  2.  29
    Exploiting the Block structure of theweb for computing pagerank.Christopher Manning with Sepandar D. Kamvar, Taher H. Haveliwala & and Gene H. Golub - manuscript
    Direct download  
     
    Export citation  
     
    Bookmark  
  3. An Introduction to Information Retrieval.Christopher D. Manning - unknown
    1 Boolean retrieval 1 2 The term vocabulary and postings lists 19 3 Dictionaries and tolerant retrieval 49 4 Index construction 67 5 Index compression 85 6 Scoring, term weighting and the vector space model 109 7 Computing scores in a complete search system 135 8 Evaluation in information retrieval 151 9 Relevance feedback and query expansion 177 10 XML retrieval 195 11 Probabilistic information retrieval 219 12 Language models for information retrieval 237 13 Text classification and Naive Bayes 253 (...)
    Direct download  
     
    Export citation  
     
    Bookmark   18 citations  
  4. Accurate unlexicalized parsing.Christopher Manning - manuscript
    assumptions latent in a vanilla treebank grammar. Indeed, its performance of 86.36% (LP/LR F1) is..
    Direct download  
     
    Export citation  
     
    Bookmark   11 citations  
  5.  56
    Generating Typed Dependency Parses from Phrase Structure Parses.Christopher Manning - unknown
    This paper describes a system for extracting typed dependency parses of English sentences from phrase structure parses. In order to capture inherent relations occurring in corpus texts that can be critical in real-world applications, many NP relations are included in the set of grammatical relations used. We provide a comparison of our system with Minipar and the Link parser. The typed dependency extraction facility described here is integrated in the Stanford Parser, available for download.
    No categories
    Direct download  
     
    Export citation  
     
    Bookmark   8 citations  
  6.  65
    Probabilistic syntax.Christopher Manning - manuscript
    “Everyone knows that language is variable.” This is the bald sentence with which Sapir (1921:147) begins his chapter on language as an historical product. He goes on to emphasize how two speakers’ usage is bound to differ “in choice of words, in sentence structure, in the relative frequency with which particular forms or combinations of words are used”. I should add that much sociolinguistic and historical linguistic research has shown that the same speaker’s usage is also variable (Labov 1966, Kroch (...)
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark   8 citations  
  7.  36
    Learning to recognize features of valid textual entailments.Christopher Manning - unknown
    separated from evaluating entailment. Current approaches to semantic inference in question answer-.
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark   7 citations  
  8.  42
    Rightward shift in spatial awareness with declining alertness.Tom Manly, Veronika B. Dobler, Christopher M. Dodds & Melanie A. George - 2005 - Neuropsychologia 43 (12):1721-1728.
  9.  61
    Feature-rich part-of-speech tagging with a cyclic dependency network.Christopher Manning - manuscript
    first-order HMM, the current tag t0 is predicted based on the previous tag t−1 (and the current word).1 The back- We present a new part-of-speech tagger that ward interaction between t0 and the next tag t+1 shows demonstrates the following ideas: (i) explicit up implicitly later, when t+1 is generated in turn. While unidirectional models are therefore able to capture both use of both preceding and following tag con-.
    Direct download  
     
    Export citation  
     
    Bookmark   7 citations  
  10.  44
    Natural Logic for Textual Inference.Christopher D. Manning - unknown
    This paper presents the first use of a computational model of natural logic—a system of logical inference which operates over natural language—for textual inference. Most current approaches to the PAS- CAL RTE textual inference task achieve robustness by sacrificing semantic precision; while broadly effective, they are easily confounded by ubiquitous inferences involving monotonicity. At the other extreme, systems which rely on first-order logic and theorem proving are precise, but excessively brittle. This work aims at a middle way. Our system finds (...)
    Direct download  
     
    Export citation  
     
    Bookmark   6 citations  
  11.  34
    Ergativity: Argument Structure and Grammatical Relations.Christopher D. Manning - unknown
    I wish to present a codi cation of syntactic approaches to dealing with ergative languages and argue for the correctness of one particular approach, which I will call the Inverse Grammatical Relations hypothesis.1 I presume familiarity with the term `ergativity', but, brie y, many languages have ergative case marking, such as Burushaski in (1), in contrast to the accusative case marking of Latin in (2). More generally, if we follow Dixon (1979) and use A to mark the agent-like argument of (...)
    Direct download  
     
    Export citation  
     
    Bookmark   5 citations  
  12.  30
    Incorporating Non-local Information into Information Extraction Systems by Gibbs Sampling.Christopher Manning - unknown
    Most current statistical natural language processing models use only local features so as to permit dynamic programming in inference, but this makes them unable to fully account for the long distance structure that is prevalent in language use. We show how to solve this dilemma with Gibbs sam- pling, a simple Monte Carlo method used to perform approximate inference in factored probabilistic models. By using simulated annealing in place of Viterbi decoding in sequence models such as HMMs, CMMs, and CRFs, (...)
    Direct download  
     
    Export citation  
     
    Bookmark   5 citations  
  13.  38
    An extended model of natural logic.Christopher D. Manning & Bill MacCartney - unknown
    We propose a model of natural language inference which identifies valid inferences by their lexical and syntactic features, without full semantic interpretation. We extend past work in natural logic, which has focused on semantic containment and monotonicity, by incorporating both semantic exclusion and implicativity. Our model decomposes an inference problem into a sequence of atomic edits linking premise to hypothesis; predicts a lexical semantic relation for each edit; propagates these relations upward through a semantic composition tree according to properties of (...)
    Direct download  
     
    Export citation  
     
    Bookmark   5 citations  
  14.  51
    Enriching the knowledge sources used in a maximum entropy part-of-speech tagger.Christopher Manning - manuscript
    Kristina Toutanova Christopher D. Manning Dept of Computer Science Depts of Computer Science and Linguistics Gates Bldg 4A, 353 Serra Mall Gates Bldg 4A, 353 Serra Mall Stanford, CA 94305–9040, USA Stanford, CA 94305–9040, USA [email protected] manning@cs.stanford.edu..
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark   4 citations  
  15.  34
    Learning to distinguish valid textual entailments.Christopher D. Manning & Daniel Cer - unknown
    This paper proposes a new architecture for textual inference in which finding a good alignment is separated from evaluating entailment. Current approaches to semantic inference in question answering and textual entailment have approximated the entailment problem as that of computing the best alignment of the hypothesis to the text, using a locally decomposable matching score. While this formulation is adequate for representing local (word-level) phenomena such as synonymy, it is incapable of representing global interactions, such as that between verb negation (...)
    Direct download  
     
    Export citation  
     
    Bookmark   4 citations  
  16.  80
    Finding contradictions in text.Christopher Manning - manuscript
    Marie-Catherine de Marneffe, Anna N. Rafferty and Christopher D. Manning Linguistics Department Computer Science Department Stanford University Stanford University Stanford, CA 94305 Stanford, CA 94305 {rafferty,manning}@stanford.edu [email protected]..
    No categories
    Direct download  
     
    Export citation  
     
    Bookmark   3 citations  
  17.  40
    Learning alignments and leveraging natural logic.Christopher Manning - manuscript
    Nathanael Chambers, Daniel Cer, Trond Grenager, David Hall, Chloe Kiddon Bill MacCartney, Marie-Catherine de Marneffe, Daniel Ramage Eric Yeh, Christopher D. Manning Computer Science Department Stanford University Stanford, CA 94305.
    No categories
    Direct download  
     
    Export citation  
     
    Bookmark   3 citations  
  18.  43
    Modeling Semantic Containment and Exclusion in Natural Language Inference.Christopher D. Manning - unknown
    We propose an approach to natural language inference based on a model of natural logic, which identifies valid inferences by their lexical and syntactic features, without full semantic interpretation. We greatly extend past work in natural logic, which has focused solely on semantic containment and monotonicity, to incorporate both semantic exclusion and implicativity. Our system decomposes an inference problem into a sequence of atomic edits linking premise to hypothesis; predicts a lexical entailment relation for each edit using a statistical classifier; (...)
    Direct download  
     
    Export citation  
     
    Bookmark   3 citations  
  19.  12
    Robust Textual Inference via Graph Matching.Christopher D. Manning - unknown
    We present a system for deciding whether a given sentence can be inferred from text. Each sentence is represented as a directed graph (extracted from a dependency parser) in which the nodes represent words or phrases, and the links represent syntactic and semantic relationships. We develop a learned graph matching model to approximate entailment by the amount of the sentence’s semantic content which is contained in the text. We present results on the Recognizing Textual Entailment dataset (Dagan et al., 2005), (...)
    No categories
    Direct download  
     
    Export citation  
     
    Bookmark   3 citations  
  20.  54
    Optimizing chinese word segmentation for machine translation performance.Christopher Manning - unknown
    Pi-Chuan Chang, Michel Galley, and Christopher D. Manning Computer Science Department, Stanford University Stanford, CA 94305 pichuan,galley,manning@cs.stanford.edu..
    No categories
    Direct download  
     
    Export citation  
     
    Bookmark   2 citations  
  21.  61
    Part-of-Speech Tagging from 97% to 100%: Is It Time for Some Linguistics?Christopher D. Manning - unknown
    I examine what would be necessary to move part-of-speech tagging performance from its current level of about 97.3% token accuracy (56% sentence accuracy) to close to 100% accuracy. I suggest that it must still be possible to greatly increase tagging performance and examine some useful improvements that have recently been made to the Stanford Part-of-Speech Tagger. However, an error analysis of some of the remaining errors suggests that there is limited further mileage to be had either from better machine learning (...)
    Direct download  
     
    Export citation  
     
    Bookmark   2 citations  
  22.  16
    Automatic Acquisition of a Large Subcategorization Dictionary From Corpora.Christopher D. Manning - unknown
    This paper presents a new method for producing a dictionary of subcategorization frames from unlabelled text corpora. It is shown that statistical filtering of the results of a finite state parser running on the output of a stochastic tagger produces high quality results, despite the error rates of the tagger and the parser. Further, it is argued that this method can be used to learn all subcategorization frames, whereas previous methods are not extensible to a general solution to the problem.
    No categories
    Direct download  
     
    Export citation  
     
    Bookmark   2 citations  
  23.  85
    Efficient, Feature-based, Conditional Random Field Parsing.Christopher D. Manning - unknown
    Discriminative feature-based methods are widely used in natural language processing, but sentence parsing is still dominated by generative methods. While prior feature-based dynamic programming parsers have restricted training and evaluation to artificially short sentences, we present the first general, featurerich discriminative parser, based on a conditional random field model, which has been successfully scaled to the full WSJ parsing data. Our efficiency is primarily due to the use of stochastic optimization techniques, as well as parallelization and chart prefiltering. On WSJ15, (...)
    Direct download  
     
    Export citation  
     
    Bookmark   2 citations  
  24.  15
    Soft Constraints Mirror Hard Constraints: Voice and Person in English and Lummi.Christopher D. Manning - unknown
    The same categorical phenomena which are attributed to hard grammatical constraints in some languages continue to show up as statistical preferences in other languages, motivating a grammatical model that can account for soft constraints. The effects of a hierarchy of person (1st, 2nd 3rd) on grammar are categorical in some languages, most famously in languages withError: Illegal entry in bfrange block in ToUnicode CMap inverse systems, but also in languages with person restrictions on passivization. In Lummi, for example, the person (...)
    Direct download  
     
    Export citation  
     
    Bookmark   2 citations  
  25.  26
    The Lexical Integrity of Japanese Causatives.Christopher D. Manning & Ivan A. Sag - unknown
    Grammatical theory has long wrestled with the fact that causative constructions exhibit properties of both single words and complex phrases. However, as Paul Kiparsky has observed, the distribution of such properties of causatives is not arbitrary: ‘construal’ phenomena such as honorification, anaphor and pronominal binding, and quantifier ‘floating’ typically behave as they would if causatives were syntactically complex, embedding constructions; whereas case marking, agreement and word order phenomena all point to the analysis of causatives as single lexical items.1 Although an (...)
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark   2 citations  
  26.  66
    Fast exact inference with a factored model for natural language parsing.Christopher Manning - manuscript
    We present a novel generative model for natural language tree structures in which semantic (lexical dependency) and syntactic (PCFG) structures are scored with separate models. This factorization provides conceptual simplicity, straightforward opportunities for separately improving the component models, and a level of performance comparable to similar, non-factored models. Most importantly, unlike other modern parsing models, the factored model admits an extremely effective A* parsing algorithm, which enables efficient, exact inference.
    Direct download  
     
    Export citation  
     
    Bookmark   2 citations  
  27.  50
    NIST open machine translation 2008 evaluation: Stanford university's system description.Christopher Manning - unknown
    Michel Galley, Pi-Chuan Chang, Daniel Cer, Jenny R. Finkel, and Christopher D. Manning Computer Science and Linguistics Departments Stanford University..
    Direct download  
     
    Export citation  
     
    Bookmark  
  28. A Phrase-Based Alignment Model for Natural Language Inference.Christopher D. Manning - unknown
    The alignment problem—establishing links between corresponding phrases in two related sentences—is as important in natural language inference (NLI) as it is in machine translation (MT). But the tools and techniques of MT alignment do not readily transfer to NLI, where one cannot assume semantic equivalence, and for which large volumes of bitext are lacking. We present a new NLI aligner, the MANLI system, designed to address these challenges. It uses a phrase-based alignment representation, exploits external lexical resources, and capitalizes on (...)
    Direct download  
     
    Export citation  
     
    Bookmark   1 citation  
  29.  33
    Argument structure as a locus for binding theory.Christopher D. Manning - unknown
    The correct locus (or loci) of binding theory has been a matter of much discussion. Theories can be seen as varying along at least two dimensions. The rst is whether binding theory is con gurationally determined (that is, the theory exploits the geometry of a phrase marker, appealing to such purely structural notions as c-command and government) or whether the theory depends rather on examining the relations between items selected by a predicate (where by selection I am intending to cover (...)
    Direct download  
     
    Export citation  
     
    Bookmark  
  30.  7
    Romance Complex Predicates: In defence of the right-branching structure.Christopher D. Manning - unknown
    Abeill´e and Godard (1994) seek to show that the rightward branching analysis of French tense auxiliaries shown in (1b), that I argued for in Manning (1992) and which is widely adopted in general, is wrong, and that rather we should adopt a flat analysis for this construction as shown in (1c), and they show how such an analysis can be realized within HPSG (Pollard and Sag 1994).
    Direct download  
     
    Export citation  
     
    Bookmark  
  31.  19
    Stanford University.Christopher D. Manning - unknown
    Technology for local textual inference is central to producing a next generation of intelligent yet robust human language processing systems. One can think of it as Information Retrieval++. It is needed for a search on male fertility may be affected by use of cell phones to match a document saying Startling new research into mobile phones suggests they can reduce a man’s sperm count up to 30%, despite the fact that the only word overlap is phones. But textual inference is (...)
    No categories
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark  
  32.  27
    An ¢¡¤£¦¥¨§ agenda-based chart parser for arbitrary probabilistic context-free grammars.Christopher Manning - manuscript
    fundamental rule” in an order-independent manner, such that the same basic algorithm supports top-down and Most PCFG parsing work has used the bottom-up bottom-up parsing, and the parser deals correctly with CKY algorithm (Kasami, 1965; Younger, 1967) with the difficult cases of left-recursive rules, empty elements, Chomsky Normal Form Grammars (Baker, 1979; Jeand unary rules, in a natural way.
    Direct download  
     
    Export citation  
     
    Bookmark   1 citation  
  33.  68
    A Conditional Random Field Word Segmenter.Christopher Manning - unknown
    We present a Chinese word segmentation system submitted to the closed track of Sighan bakeoff 2005. Our segmenter was built using a conditional random field sequence model that provides a framework to use a large number of linguistic features such as character identity, morphological and character reduplication features. Because our morphological features were extracted from the training corpora automatically, our system was not biased toward any particular variety of Mandarin. Thus, our system does not overfit the variety of Mandarin most (...)
    No categories
    Direct download  
     
    Export citation  
     
    Bookmark   1 citation  
  34.  34
    An Effective Two-Stage Model for Exploiting Non-Local Dependencies in Named Entity Recognition.Christopher D. Manning - unknown
    This paper shows that a simple two-stage approach to handle non-local dependencies in Named Entity Recognition (NER) can outperform existing approaches that handle non-local dependencies, while being much more computationally efficient. NER systems typically use sequence models for tractable inference, but this makes them unable to capture the long distance structure present in text. We use a Conbel.
    No categories
    Direct download  
     
    Export citation  
     
    Bookmark  
  35.  36
    A generative model for semantic role labeling.Christopher Manning - manuscript
    Determining the semantic role of sentence constituents is a key task in determining sentence meanings lying behind a veneer of variant syntactic expression. We present a model of natural language generation from semantics using the FrameNet semantic role and frame ontology. We train the model using the FrameNet corpus and apply it to the task of automatic semantic role and frame identification, producing results competitive with previous work (about 70% role labeling accuracy). Unlike previous models used for this task, our (...)
    Direct download  
     
    Export citation  
     
    Bookmark  
  36.  48
    A Simple and Effective Hierarchical Phrase Reordering Model.Christopher D. Manning - unknown
    adjacent phrases, but they typically lack the ability to perform the kind of long-distance reorderings possible with syntax-based systems. In this paper, we present a novel hierarchical phrase reordering model aimed at improving non-local reorderings, which seamlessly integrates with a standard phrase-based system with little loss of computational efficiency. We show that this model can successfully handle the key examples often used to motivate syntax-based systems, such as the rotation of a prepositional phrase around a noun phrase. We contrast our (...)
    No categories
    Direct download  
     
    Export citation  
     
    Bookmark   1 citation  
  37.  16
    A System For Identifying Named Entities in Biomedical Text: How Results From Two Evaluations Reflect on Both the System and the Evaluations.Christopher Manning - unknown
    We present a maximum-entropy based system for identifying Named Entities (NEs) in biomedical abstracts and present its performance in the only two biomedical Named Entity Recognition (NER) comparative evaluations that have been held to date, namely BioCreative and Coling BioNLP. Our system obtained an exact match f-score of 83.2% in the BioCreative evaluation and 70.1% in the BioNLP evaluation. We discuss our system in detail including its rich use of local features, attention to correct boundary identification, innovative use of external (...)
    No categories
    Direct download  
     
    Export citation  
     
    Bookmark  
  38.  3
    Computations.Christopher D. Manning - unknown
    We present a novel algorithm for the fast computation of PageRank, a hyperlink-based estimate of the “importance” of Web pages. The original PageRank algorithm uses the Power Method to compute successive iterates that converge to the principal eigenvector of the Markov matrix representing the Web link graph. The algorithm presented here, called Quadratic Extrapolation, accelerates the convergence of the Power Method by periodically subtracting off estimates of the nonprincipal eigenvectors from the current iterate of the Power Method. In Quadratic Extrapolation, (...)
    No categories
    Direct download  
     
    Export citation  
     
    Bookmark  
  39.  26
    Computing pagerank using power extrapolation.Christopher Manning - manuscript
    We present a novel technique for speeding up the computation of PageRank, a hyperlink-based estimate of the “importance” of Web pages, based on the ideas presented in [7]. The original PageRank algorithm uses the Power Method to compute successive iterates that converge to the principal eigenvector of the Markov matrix representing the Web link graph. The algorithm presented here, called Power Extrapolation, accelerates the convergence of the Power Method by subtracting off the error along several nonprincipal eigenvectors from the current (...)
    Direct download  
     
    Export citation  
     
    Bookmark  
  40.  58
    Clustering the Tagged Web.Christopher D. Manning - unknown
    Automatically clustering web pages into semantic groups promises improved search and browsing on the web. In this paper, we demonstrate how user-generated tags from largescale social bookmarking websites such as del.icio.us can be used as a complementary data source to page text and anchor text for improving automatic clustering of web pages. This paper explores the use of tags in 1) K-means clustering in an extended vector space model that includes tags as well as page text and 2) a novel (...)
    Direct download  
     
    Export citation  
     
    Bookmark   1 citation  
  41.  44
    Dissociations between argument structure and grammatical relations.Christopher Manning - manuscript
    In Pollard and Sag (1987) and Pollard and Sag (1994:Ch. 1–8), the subcategorized arguments of a head are stored on a single ordered list, the subcat list. However, Borsley (1989) argues that there are various defi- ciencies in this approach, and suggests that the unified list should be split into separate lists for subjects, complements, and specifiers. This proposal has been widely adopted in what is colloquially known as HPSG3 (Pollard and Sag (1994:Ch. 9) and other recent work in HPSG). (...)
    Direct download (3 more)  
     
    Export citation  
     
    Bookmark   1 citation  
  42.  51
    Disambiguating “DE” for Chinese-English Machine Translation.Christopher D. Manning - unknown
    Linking constructions involving dሇ (DE) are ubiquitous in Chinese, and can be translated into English in many different ways. This is a major source of machine translation error, even when syntaxsensitive translation models are used. This paper explores how getting more information about the syntactic, semantic, and discourse context of uses of dሇ (DE) can facilitate producing an appropriate English translation strategy. We describe a finergrained classification of dሇ (DE) constructions in Chinese NPs, construct a corpus of annotated examples, and (...)
    Direct download  
     
    Export citation  
     
    Bookmark  
  43.  5
    Exploiting Context for Biomedical Entity Recognition: From Syntax to the Web.Christopher Manning - unknown
    We describe a machine learning system for the recognition of names in biomedical texts. The system makes extensive use of local and syntactic features within the text, as well as external resources including the web and gazetteers. It achieves an F- score of 70% on the Coling 2004 NLPBA/BioNLP shared task of identifying five biomedical named entities in the GENIA corpus.
    Direct download  
     
    Export citation  
     
    Bookmark  
  44.  31
    Extensions to HMM-based statistical word alignment models.Christopher Manning - manuscript
    translation. We present a method for using part of speech tag information to improve alignment accu-.
    No categories
    Direct download  
     
    Export citation  
     
    Bookmark  
  45.  38
    Enforcing Transitivity in Coreference Resolution.Christopher D. Manning - unknown
    A desirable quality of a coreference resolution system is the ability to handle transitivity constraints, such that even if it places high likelihood on a particular mention being coreferent with each of two other mentions, it will also consider the likelihood of those two mentions being coreferent when making a final assignment. This is exactly the kind of constraint that integer linear programming (ILP) is ideal for, but, surprisingly, previous work applying ILP to coreference resolution has not encoded this type (...)
    No categories
    Direct download  
     
    Export citation  
     
    Bookmark  
  46.  9
    Feature Selection for a Rich HPSG Grammar Using Decision Trees.Christopher D. Manning & Kristina Toutanova - unknown
    This paper examines feature selection for log linear models over rich constraint-based grammar (HPSG) representations by building decision trees over features in corresponding probabilistic context free grammars (PCFGs). We show that single decision trees do not make optimal use of the available information; constructed ensembles of decision trees based on different feature subspaces show signifi- cant performance gains (14% parse selection error reduction). We compare the performance of the learned PCFG grammars and log linear models over the same features.
    No categories
    Direct download  
     
    Export citation  
     
    Bookmark   1 citation  
  47.  34
    Lexical conceptual structure and marathi.Christopher Manning - manuscript
    Jackendoff (1987, 1990) has brought up various problems with the current use of thematic roles (Kiparsky, 1987; Bresnan & Kanerva, 1989 and references cited therein) and suggested a different way of thinking of thematic roles as structural configurations in his semantic Lexical Conceptual Structures (LCSs). Conversely, Joshi (1989) has claimed that Jackendoff’s LCSs alone are insufficient, and that an analysis of certain facts in Marathi additionally requires the existence of a level of predicate-argument structure (PAS). Below we will mention a (...)
    Direct download  
     
    Export citation  
     
    Bookmark  
  48.  37
    Log-linear models for label ranking.Christopher Manning, Ofer Dekel & Yoram Singer - manuscript
    In Sebastian Thrun, Lawrence K. Saul, and Bernhard Schölkopf (eds), Advances in Neural Information Processing Systems 16 (NIPS 2003). Cambridge, MA: MIT Press, pp. 497-504.
    No categories
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark  
  49.  19
    Learning Random Walk Models for Inducing Word Dependency Distributions.Christopher D. Manning & Kristina Toutanova - unknown
    Many NLP tasks rely on accurately estimating word dependency probabilities P(w1|w2), where the words w1 and w2 have a particular relationship (such as verb-object). Because of the sparseness of counts of such dependencies, smoothing and the ability to use multiple sources of knowledge are important challenges. For example, if the probability P(N |V ) of noun N being the subject of verb V is high, and V takes similar objects to V , and V is synonymous to V , then (...)
    No categories
    Direct download  
     
    Export citation  
     
    Bookmark  
  50.  43
    Language varieties.Christopher Manning - unknown
    Part-of-speech tagging, like any supervised statistical NLP task, is more difficult when test sets are very different from training sets, for example when tagging across genres or language varieties. We examined the problem of POS tagging of different varieties of Mandarin Chinese. An analytic study first showed that unknown words were a major source of difficulty in cross-variety tagging. Unknown words in English tend to be proper nouns. By contrast, we found that Mandarin unknown words were mostly common nouns and (...)
    Direct download  
     
    Export citation  
     
    Bookmark  
1 — 50 / 988