Skip to main content
Log in

A Principled Approach to Grammars for Controlled Natural Languages and Predictive Editors

  • Published:
Journal of Logic, Language and Information Aims and scope Submit manuscript

Abstract

Controlled natural languages (CNL) with a direct mapping to formal logic have been proposed to improve the usability of knowledge representation systems, query interfaces, and formal specifications. Predictive editors are a popular approach to solve the problem that CNLs are easy to read but hard to write. Such predictive editors need to be able to “look ahead” in order to show all possible continuations of a given unfinished sentence. Such lookahead features, however, are difficult to implement in a satisfying way with existing grammar frameworks, especially if the CNL supports complex nonlocal structures such as anaphoric references. Here, methods and algorithms are presented for a new grammar notation called Codeco, which is specifically designed for controlled natural languages and predictive editors. A parsing approach for Codeco based on an extended chart parsing algorithm is presented. A large subset of Attempto Controlled English has been represented in Codeco. Evaluation of this grammar and the parser implementation shows that the approach is practical, adequate and efficient.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  • Adriaens, G., & Schreors, D. (1992). From COGRAM to ALCOGRAM: Toward a controlled English grammar checker. In: Proceedings of the 14th conference on computational linguistics, vol. 2, pp. 595–601. Association for Computational Linguistics, Morristown, NJ.

  • Angelov, K., & Ranta, A. (2010). Implementing controlled languages in GF. In: Proceedings of the workshop on controlled natural language (CNL 2009), Lecture Notes in Computer Science, vol. 5972, pp. 82–101. Berlin: Springer.

  • Bernstein, A., & Kaufmann, E. (2006). GINO—a guided input natural language ontology editor. In: The Semantic Web—ISWC 2006, Proceedings of the 5th international semantic web conference, lecture notes in computer science, vol. 4273, pp. 144–157. Berlin: Springer.

  • Chomsky N. (1980) On binding. Linguistic Inquiry 11(1): 1–46

    Google Scholar 

  • Clark, P., Chaw, S. Y., Barker, K., Chaudhri, V., Harrison, P., Fan, J., John, B., Porter, B., Spaulding, A., Thompson, J., & Yeh, P. (2007). Capturing and answering questions posed to a knowledge-based system. In: K-CAP ’07: Proceedings of the 4th international conference on knowledge capture, pp. 63–70. ACM.

  • Clark, P., Harrison, P., Jenkins, T., Thompson, J., & Wojcik, R.H. (2005). Acquiring and using world knowledge using a restricted subset of English. In: Proceedings of the eighteenth international Florida artificial intelligence research society conference (FLAIRS 2005), pp. 506–511. AAAI Press.

  • Cole, R., Mariani, J., Uszkoreit, H., Varile, G. B., Zaenen, A., Zampolli, A., Zue, V. (eds) (1997) Survey of the state of the art in human language technology. Cambridge University Press, Cambridg

    Google Scholar 

  • Covington M. A. (1994) Natural language processing for Prolog programmers. Prentice Hall, Englewood Cliffs, NJ, USA

    Google Scholar 

  • Dahl, V., Tarau, P., & Li, R. (1997). Assumption grammars for processing natural language. In: L. Naish (ed.) Proceedings of the fourteenth international conference on logic programming, pp. 256–270. Cambridge, MA : MIT Press.

  • Dimitrova, V., Denaux, R., Hart, G., Dolbear, C., Holt, I., & Cohn, A.G. (2008). Involving domain experts in authoring owl ontologies. The semantic web—proceedings of the 7th international semantic web conference (ISWC 2008) pp. 1–16.

  • Earley J. (1970) An efficient context-free parsing algorithm. Communications of the ACM 13(2): 94–102

    Article  Google Scholar 

  • Franconi, E., Guagliardo, P., Tessaris, S., & Trevisan, M. (2011). Quelo: an ontology-driven query interface. In: Proceedings of the 24th international workshop on description logics (DL 2011).

  • Fuchs, N. E., Kaljurand, K., & Kuhn, T. (2008). Attempto Controlled English for knowledge representation. In: Reasoning Web—4th international summer school 2008, lecture notes in computer science, vol. 5224, pp. 104–124. Berlin: Springer.

  • Fuchs, N. E., Schwertel, U., & Schwitter, R. (1990). Attempto Controlled English - not just another logic specification language. In: Proceedings of the 8th International Workshop on Logic Programming Synthesis and Transformation (LOPSTR ’98). Berlin: Springer.

  • Funk, A., Tablan, V., Bontcheva, K., Cunningham, H., Davis, B., & Handschuh, S. (2007). CLOnE: Controlled language for ontology editing. In: Proceedings of the 6th international semantic web conference and the 2nd Asian semantic web conference (ISWC 2007 + ASWC 2007), lecture notes in computer science, vol. 4825, pp. 142–155. Berlin: Springer.

  • Gazdar G. (1985) Generalized phrase structure grammar. Harvard University Press, Cambridge

    Google Scholar 

  • Gazdar G., Mellish C. (1989) Natural language processing in PROLOG. Addison-Wesley, Reading, MA

    Google Scholar 

  • Grune D., Jacobs C. J. (2008) Parsing techniques—a practical guide, second edn monographs in computer science. Springer Science+Business Media, New York, NY

    Google Scholar 

  • Hobbs J. R. (1978) Resolving pronoun references. Lingua 44(4): 311–338

    Article  Google Scholar 

  • Johnson, S.C. (1975). Yacc: Yet another compiler-compiler. Computer science technical report 32 Murray Hill, NJ: Bell Laboratories

  • Johnson, M., & Klein, E. (1986). Discourse, anaphora and parsing. In: Proceedings of the 11th coference on computational linguistics, COLING ’86, pp. 669–675. Association for computational linguistics, Stroudsburg, PA.

  • Joshi A. K., Levy L. S., Takahashi M. (1975) Tree adjunct grammars. Journal of Computer and System Sciences 10(1): 136–163

    Article  Google Scholar 

  • Kaplan R. M., Bresnan J. (1982) Lexical-functional grammar: A formal system for grammatical representation. In: Bresnan J. (eds) The mental representation of grammatical relations. MIT Press, Cambridge, MA, pp 173–281

    Google Scholar 

  • Knuth D. E. (1964) Backus normal form vs. backus naur form. Communications of the ACM 7(12): 735–736

    Article  Google Scholar 

  • Kuhn, T. (2007). AceRules: Executing rules in controlled natural language. In: Web reasoning and rule systems—first international conference (RR 2007), lecture notes in computer science, vol. 4524, pp. 299–308. Berlin: Springer.

  • Kuhn, T. (2009). AceWiki: A natural and expressive semantic wiki. In: Proceedings of the fifth international workshop on semantic web user interaction (SWUI 2008)—exploring HCI challenges, CEUR workshop proceedings, vol. 543. CEUR-WS. http://ceur-ws.org/Vol-543/kuhn_swui2008.pdf.

  • Kuhn, T. (2009). How controlled English can improve semantic wikis. In: Proceedings of the forth semantic wiki workshop (SemWiki 2009), CEUR Workshop Proceedings, vol. 464. CEUR-WS.

  • Kuhn, T. (2010). Controlled English for knowledge representation. Ph.D. thesis, faculty of economics, business administration and information technology of the University of Zurich.

  • Kuhn, T. (2012). Codeco: A practical notation for controlled English grammars in predictive editors. In: Proceedings of the second workshop on controlled natural language (CNL 2010), lecture notes in computer science, vol. 7175, pp. 95–114. Berlin: Springer.

  • Kuhn, T.: The understandability of OWL statements in controlled English. Semantic Web journal (to appear).

  • Kuhn, T., & Höfler, S.: Coral: Corpus access in controlled language. Corpora 7(2) (2012, to appear).

  • Kuhn, T., & Schwitter, R. (2008). Writing support for controlled natural languages. In: Proceedings of the Australasian language technology association workshop 2008, pp. 46–54.

  • Lappin S., Leass H. J. (1994) An algorithm for pronominal anaphora resolution. Computational Linguistics 20(4): 535–561

    Google Scholar 

  • Martin, P. (2002). Knowledge representation in CGLF, CGIF, KIF, Frame-CG and Formalized-English. In: Conceptual structures: Integration and interfaces—proceedings of the 10th international conference on conceptual structures (ICCS 2002), lecture notes in artificial intelligence, vol. 2393, pp. 77–91. Berlin: Springer.

  • Mueckstein, E. M. (1985). Controlled natural language interfaces: The best of three worlds. In: CSC ’85: Proceedings of the 1985 ACM thirteenth annual conference on computer science, pp. 176–178. ACM.

  • Naur P., Backus J. W., Bauer F. L., Green J., Katz C., McCarthy J., Perils A. J., Rutishauser H., Samelson K., Vauquois B., Wegstein J. H., van Wijngaarden A., Woodger M. (1963) Revised report on the algorithmic language ALGOL 60. Communications of the ACM 6(1): 1–17

    Article  Google Scholar 

  • Ogden, C. K. (1932). The A B C of basic English (in Basic). No. 43 in Psyche Miniatures General Series. K. Paul, Trench, Trubner, London.

  • Pereira, F., Warren, D. H. D. (1986). Definite clause grammars for language analysis. In: Readings in natural language processing, pp. 101–124. Los Altos, CA: Morgan Kaufmann Publishers.

  • Pollard C., Sag I. (1994) Head-driven phrase structure grammar. Studies in contemporary linguistics. Chicago University Press, Chicago

    Google Scholar 

  • Pool, J. (2006). Can controlled languages scale to the Web? In: Proceedings of the 5th international workshop on controlled language applications (CLAW 2006).

  • Power, R., Stevens, R., Scott, D., & Rector, A. (2009). Editing OWL through generated CNL. In: Pre-proceedings of the workshop on controlled natural language (CNL 2009), CEUR workshop proceedings, vol. 448. CEUR-WS.

  • Schwitter, R., Kaljurand, K., Cregan, A., Dolbear, C., & Hart, G. (2008). A comparison of three controlled natural languages for OWL 1.1. In: Proceedings of the fourth OWLED workshop on OWL: Experiences and directions, CEUR workshop proceedings, vol. 496. CEUR-WS.

  • Schwitter, R., Ljungberg, A., & Hood, D. (2003). ECOLE—a look-ahead editor for a controlled language. In: Controlled translation—proceedings of the joint conference combining the 8th international workshop of the European association for machine translation and the 4th controlled language application workshop (EAMT-CLAW03), pp. 141–150. Ireland: Dublin City University.

  • Shiffman, R. N., Michel, G., Krauthammer, M., Fuchs, N. E., Kaljurand, K., & Kuhn, T. (2010). Writing clinical practice guidelines in controlled natural language. In: Proceedings of the workshop on controlled natural language (CNL 2009), lecture notes in computer science, vol. 5972, pp. 265–280. Berlin: Springer.

  • Spreeuwenberg, S., & Anderson Healy, K. (2010). SBVR’s approach to controlled natural language. In: Proceedings of the workshop on controlled natural language (CNL 2009), lecture notes in computer science, vol. 5972, pp. 155–169. Berlin: Springer.

  • Steedman, M., & Baldridge, J. (2011). Combinatory categorial grammar. In: Non-transformational syntax, pp. 181–224. Wiley-Blackwell.

  • Sukkarieh, J. Z., & Pulman, S. G. (1999). Computer processable English and McLogic. In: Proceedings of the third international workshop on computational semantics, pp. 367–380.

  • Tennant, H. R., Ross, K. M., Saenz, R. M., Thompson, C. W., & Miller, J. R. (1983). Menu-based natural language understanding. In: Proceedings of the 21st annual meeting on association for computational linguistics, pp. 151–158. Association for Computational Linguistics.

  • Verbeke C. A. (1973) Caterpillar fundamental English. Training and Development Journal 27(2): 36–40

    Google Scholar 

  • Würsch, M., Ghezzi, G., Reif, G., & Gall, H. C. (2010). Supporting developers with natural language queries. In: ICSE ’10: Proceedings of the 32nd ACM/IEEE international conference on software engineering, pp. 165–174. ACM, New York, NY.

  • Wyner, A., Angelov, K., Barzdins, G., Damljanovic, D., Davis, B., & Fuchs, N. et al. (2010). On controlled natural languages: Properties and prospects. In: Proceedings of the workshop on controlled natural language (CNL 2009), lecture notes in computer science, vol. 5972, pp. 281–289. Berlin: Springer.

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Tobias Kuhn.

Additional information

The work presented here was funded by the research grant (Forschungskredit) programs 2006 and 2008 of the University of Zurich. I would like to thank Norbert E. Fuchs, George Herson, Stefan Höfler, Michael Hess, Kaarel Kaljurand, Marc Lutz, and Adam Wyner for comments, discussions and proof-reading on this paper and earlier versions thereof.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Kuhn, T. A Principled Approach to Grammars for Controlled Natural Languages and Predictive Editors. J of Log Lang and Inf 22, 33–70 (2013). https://doi.org/10.1007/s10849-012-9167-z

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10849-012-9167-z

Keywords

Navigation