Using ontologies to interlink linguistic annotations and improve their accuracy

Written by
Peer-reviewed:
Permalink:
Language: English

© 2016 Antonio Pareja-Lora.


Crossref Similarity Check logo

Abstract

For the new approaches to language e-learning (e.g. language blended learning, language autonomous learning or mobile-assisted language learning) to succeed, some automatic functions for error correction (for instance, in exercises) will have to be included in the long run in the corresponding environments and/or applications. A possible way to achieve this is to use some Natural Language Processing (NLP) functions within language e-learning applications. These functions should be based on some truly reliable and wide-coverage linguistic annotation tools (e.g. a Part-Of-Speech (POS) tagger, a syntactic parser and/or a semantic tagger). However, linguistic annotation tools usually introduce a not insignificant rate of errors and ambiguities when tagging, which prevents them from being used ‘as is’ for this purpose. In this paper, we present an annotation architecture and methodology that has helped reduce the rate of errors in POS tagging, by making several POS taggers interoperate and supplement each other. We also introduce briefly the set of ontologies that have helped all these tools intercommunicate and collaborate in order to produce a more accurate joint POS tagging, and how these ontologies were used towards this end. The resulting POS tagging error rate is around 6%, which should allow this function to be included in language e-learning applications for the purpose aforementioned.

Keywords: ontology, interoperability, POS tagging, accuracy, linguistic annotation, tools.

References

Borst, W. N. (1997). Construction of engineering ontologies. PhD thesis. Enschede. Netherlands: University of Twente.

Gruber, T. R. (1993). A translation approach to portable ontologies. Journal on Knowledge Acquisition, 5(2), 199-220. Retrieved from http://dx.doi.org/10.1006/knac.1993.1008

Pareja-Lora, A. (2012a). Providing linked linguistic and semantic web annotations – The OntoTag hybrid annotation model. Saarbrücken: LAP – LAMBERT Academic Publishing.

Pareja-Lora, A. (2012b). OntoTag: a linguistic and ontological annotation model suitable for the semantic web. PhD thesis. Madrid: Universidad Politécnica De Madrid. Retrieved from http://oa.upm.es/13827/

Pareja-Lora, A., & Aguado de Cea, G. (2010). Ontology-based interoperation of linguistic tools for an improved lemma annotation in Spanish. In Proceedings of the 7th Conference on Language Resources and Evaluation (LREC 2010) (pp. 1476-1482). Valletta, Malta: ELDA.

Urbano-Mendaña, M., Corpas-Pastor, G., & Mitkov, R. (2013). NLP-enhanced self-study learning materials for quality healthcare in Europe. In Proceedings of the “Workshop on optimizing understanding in multilingual hospital encounters”. 10th International Conference on Terminology and Artificial Intelligence (TIA’2013) (pp. 29-32). Paris, France: Laboratoire d’Informatique de Paris Nord (LIPN).

How to cite

Citation is provided in standard text format below. For full citation export options, click Export citation.

Pareja-Lora, Antonio. (2016). Using ontologies to interlink linguistic annotations and improve their accuracy. In Antonio Pareja-Lora, Cristina Calle-Martínez, Pilar Rodríguez-Arancón (Eds), New perspectives on teaching and working with languages in the digital era (pp. 351-362). Research-publishing.net. https://doi.org/10.14705/rpnet.2016.tislid2014.447

Request permissions

This article is published under the Attribution-NonCommercial-NoDerivatives International 4.0 (CC BY-NC-ND 4.0) licence. Under this licence, the contents are freely available online (as PDF files) for anybody to read, download, copy, and redistribute provided that the AUTHOR(s), EDITORIAL TEAM and PUBLISHER are properly cited. Commercial use and derivative works are, however, not permitted.

Permission is not required for the republication of tables, figures or illustrations, as long as they are reproduced accurately and the source material is fully cited. It may be the case that the licence does not give you all of the permissions necessary for your intended use. If this is your current situation, please do feel free to ask the copyright holders, i.e. the authors; email addresses are listed on individual papers.

From the same author

Toward Mobile Assisted Language Learning Apps for Professionals that Integrate Learning into the Daily Routine
Pareja-Lora, Antonio; Arús-Hita, Jorge; Read, Timothy; Rodríguez-Arancón, Pilar; Calle-Martínez, Cristina; Pomposo, Lourdes; Martín-Monje, Elena; Bárcena, Elena.
doi:10.14705/rpnet.2013.000162
Preliminary conclusions after the design and implementation of mobile learning apps for professionals
Castrillo, María Dolores; Bárcena, Elena; Pareja-Lora, Antonio.
doi:10.14705/rpnet.2014.000192
State of the art of language learning design using mobile technology: sample apps and some critical reflection
Bárcena, Elena; Read, Timothy; Underwood, Joshua; Obari, Hiroyuki; Cojocnean, Diana; Koyama, Toshiko; Pareja-Lora, Antonio; Calle, Cristina; Pomposo, Lourdes; Talaván, Noa; Ávila-Cabrera, José; Ibañez, Ana; Vermeulen, Anna; Jordano, María; Arús-Hita, Jorge; Rodríguez, Pilar; Castrillo, María Dolores; Kétyi, Andras; Selwood, Jaime; Gaved, Mark; Kukulska-Hulme, Agnes.
doi:10.14705/rpnet.2015.000307
Acknowledgements
Pareja-Lora, Antonio; Calle-Martínez, Cristina; Rodríguez-Arancón, Pilar.
doi:10.14705/rpnet.2016.tislid2014.417
Applying information and communication technologies to language teaching and research: an overview
Pareja-Lora, Antonio; Rodríguez-Arancón, Pilar; Calle-Martínez, Cristina.
doi:10.14705/rpnet.2016.tislid2014.418
Design and implementation of BusinessApp, a MALL application to make successful business presentations
Calle-Martínez, Cristina; Pomposo Yanes, Lourdes; Pareja-Lora, Antonio.
doi:10.14705/rpnet.2016.tislid2014.437