A Unique Indexing Technique for Discourse Structures

Journal of Intelligent Systems 23 (3):231-243 (2014)
  Copy   BIBTEX

Abstract

Sutra is a form of text representation that has been used in both Tamil and Sanskrit literature to convey information in a short and crisp manner. Nanool, an ancient Tamil grammar masterpiece has used sutras for defining grammar rules. Similarly, in Sanskrit literature, many of the Shāstrās have used sutras for a concise representation of their content. Sutras are defined as short aphorisms, formulae-like structures that convey the complete essence of the text. They act as indices to the elaborate content they refer to. Inspired by their characteristics, this article proposes an indexing mechanism based on sutras for discourse structures built using rhetorical structure theory and also using Sangati, a concept proposed in Sanskrit literature. The indices identified by the indexer are ideal for question answering, summary generation, and information retrieval systems. The indexer has been tested on IR system using 1000 Tamil language text documents. A performance comparison has also been made with one of the existing RST-based indexing technique.

Links

PhilArchive



    Upload a copy of this work     Papers currently archived: 93,612

External links

Setup an account with your affiliations in order to access resources via your University's proxy server

Through your library

Similar books and articles

Tirumūlar and the Tamil Yoga Connection.Kanniks Kannikeswaran - 2021 - Journal of Dharma Studies 4 (2):241-260.
On the Concept "text" of Tiantai Zhiyi.Chaoshun Guo - 2003 - Philosophy and Culture 30 (3):61-76.

Analytics

Added to PP
2017-01-11

Downloads
6 (#711,559)

6 months
6 (#1,472,471)

Historical graph of downloads
How can I increase my downloads?

Citations of this work

No citations found.

Add more citations

References found in this work

No references found.

Add more references