|Abstract||lecting statistics about missing bindings and macros, and other errors. This guides debugging and development eﬀorts, leading to iterative improvements in both the tools and the quality of the converted corpus. The build system thus serves as both a production conversion engine and software test harness. We have now processed the complete arχiv collection through 2006 consisting of more than 400,000 documents (a complete run is a processor-yearsize undertaking), continuously improving our success rate. We are now able to convert more than 90% of these documents to XHTML+MathML. We consider over 60% to be successes, converted with no or minor warnings. While the remaining 30% can also be converted, their quality is doubtful, due to unsupported macros or conversion errors|
|Keywords||No keywords specified (fix it)|
No categories specified
(categorize this paper)
|Through your library||Only published papers are available at libraries|
Similar books and articles
Go Eguchi & Laurence L. Leff (2002). Rule-Based XML. Artificial Intelligence and Law 10 (4).
Fady Farah & François Rousselot (2007). DARES: Documents Annotation and Recombining System—Application to the European Law. [REVIEW] Artificial Intelligence and Law 15 (2):83-102.
Howard Turtle (1995). Text Retrieval in the Legal World. Artificial Intelligence and Law 3 (1-2):5-54.
D. Ginev, C. David & M. Kohlhase, An Architecture for Linguistic and Semantic Analysis on the ARXMLIV Corpus.
Michael Kohlhase, Towards Mkm in the Large: Modular Representation and Scalable Software Architecture.
Ronald N. Kostoff, Dustin Johnson, J. Antonio Ridelo, Louis A. Bloomfield, Michael F. Shlesinger, Guido Malpohl & Hector D. Cortes (2006). Duplicate Publication and 'Paper Inflation' in the Fractals Literature. Science and Engineering Ethics 12 (3).
Sorry, there are not enough data points to plot this chart.
Added to index2010-12-22
Recent downloads (6 months)0
How can I increase my downloads?