OntoLex

OntoLex is the short name of a vocabulary for lexical resources in the web of data (OntoLex-Lemon) and the short name of the W3C community group that created it (W3C Ontology-Lexica Community Group).[1]

Applications

OntoLex-Lemon is widely used for lexical resources in the context of Linguistic Linked Open Data. Selected applications include

  • OASIS Lexicographic Infrastructure Data Model and API (LEXIDMA), a framework for internationally interoperable lexicographic work[14]
  • European public multilingual knowledge infrastructure [15][16]
  • Lex0, a collaborative web editor used for the creation and management of (multilingual) lexical and terminological resources as linked data resources[17]
  • VocBench, a web-based, multilingual, collaborative development platform for managing ontologies, thesauri, lexicons and RDF data[18][19][20]
  • The Lexicala API by K Dictionaries that provides access to cross-lingual lexical data of 50 languages and 150 language pairs.[21]
  • DiTMAO, a lexicographic editor developed for creating the Dictionary of Old Occitan medico-botanical terminology[22]
  • a series of Shared Tasks on Translation Inference Across Dictionaries (TIAD-2017[23][24], TIAD-2019[25][26], TIAD-2020[27])
  • DBnary, RDF edition of 16 language editions of Wiktionary[28][29]
  • PanLex, a large-scale lexical network of about 2,500 dictionaries and more than 500 languages[30]
  • Princeton WordNet 3.1, a large-scale, hierarchically and relationally structured lexical resource for English[31]
  • Global WordNet Association, a community effort to produce, maintain and interlink multilingual WordNets[32]
  • BabelNet, a large-scale multilingual lexical network[33][34]
  • LiLa, a knowledge base of linguistic resources for Latin based on a large lexicon consisting of a collection of citation forms[35][36][37]

OntoLex development is regularly addressed in scientific events dedicated to ontologies, linked data or lexicography. Since 2017, a designated workshop series on the OntoLex module is conducted biannually.[38]

Related vocabularies that focus on standardizing and publishing lexical resources include DICT (text-based format), the XML Dictionary eXchange Format, TEI-Dict (XML) and the Lexical Markup Framework (abstract model usually serialized in XML; the Lemon vocabulary originally evolved from an RDF serialization of LMF). OntoLex-Lemon differs from these earlier models in being a native Linked Open Data vocabulary that does not (just) formalize structure and semantics of machine-readable dictionaries, but is designed to facilitate information integration between them.

References

  1. "OntoLex community portal". W3C. Retrieved 6 December 2019.
  2. Cimiano, Phillip; McCrae, John P.; Buitelaar, Paul. "Lexicon Model for Ontologies: Community Report, 10 May 2016 Final Community Group Report 10 May 2016". W3C. Retrieved 6 December 2019.
  3. Julia Bosque-Gil, Jorge Gracia and Elena Montiel-Ponsoda (July 2017). "Towards a module for lexicography in OntoLex" (PDF). Kernerman Dictionary News (25). Retrieved 5 April 2020.
  4. McCrae, John; Spohr, Dennis; Cimiano, Philipp (2011). "Linking lexical resources and ontologies on the Semantic Web with Lemon". Proceedings of the Extended Semantic Web Conference (ESWC-2011), Iraklion, Greece: 245–259.
  5. Bosque-Gil, Julia; Gracia, Jorge. "The OntoLex Lemon Lexicography Module". W3C. Retrieved 6 December 2019.
  6. Fiorelli, Manuel; Stellato, Armando; McCrae, John P.; Cimiano, Philipp; Pazienza, Maria Teresa (2015). Gandon, Fabien; Sabou, Marta; Sack, Harald; d’Amato, Claudia; Cudré-Mauroux, Philippe; Zimmermann, Antoine (eds.). "LIME: The Metadata Module for OntoLex". The Semantic Web. Latest Advances and New Domains. Lecture Notes in Computer Science. Springer International Publishing. 9088: 321–336. doi:10.1007/978-3-319-18818-8_20. ISBN 978-3-319-18818-8.
  7. "Linguistic Linked Open Data. Information about the current status of the growing cloud of linguistic linked open data". Retrieved 10 December 2019.
  8. Bosque-Gil, Julia; Gracia, Jorge. "The OntoLex Lemon Lexicography Module Final Community Group Report 17 September 2019". W3C. Retrieved 10 December 2019.
  9. "Morphology". Retrieved 10 December 2019.
  10. Klimek, Bettina; McCrae, John P.; Bosque-Gil, Julia; Ionov, Maxim; Tauber, James K.; Chiarcos, Christian. Challenges for the Representation of Morphology in Ontology Lexicons, in: Kosem, I., Zingano Kuhn, T., Correia, M., Ferreria, J. P., Jansen, M., Pereira, I., Kallas, J., Jakubíček, M., Krek, S. & Tiberius, C. (eds.) 2019. Electronic lexicography in the 21st century. Proceedings of the eLex 2019 conference. 1-3 October 2019, Sintra, Portugal (PDF). Brno: Lexical Computing CZ, s.r.o. pp. 570–591.
  11. "Frequency, Attestation and Corpus Information". Retrieved 10 December 2019.
  12. Chiarcos, Christian; Ionov, Maxim. "OntoLex-Lemon Module for Frequency, Attestation and Corpus Information (draft specification)". Retrieved 9 April 2020.
  13. "LexInfo - Data Category Ontology for OntoLex-Lemon". Retrieved 4 January 2020.
  14. censign. "Call for Participation: OASIS Lexicographic Infrastructure Data Model and API (LEXIDMA) TC". OASIS. Retrieved 10 December 2019.
  15. Schmitz, P.; Francesconi, E.; Hajlaoui, N.; Batouche, B.; Stellato, A. (2018). Semantic Interoperability of Multilingual Language Resources by Automatic Mapping, In: International Conference on Electronic Government and the Information Systems Perspective. Cham: Springer. pp. 153–163.
  16. Batouche, Brahim; Schmitz, Peter; Francesconi, Enrico; Hajlaoui, Najeh (12/02/2018). PMKI–Public Multilingual Knowledge. Documentation of the PMKI data modelInfrastructure (PDF). European Technical Specification. Retrieved 10 December 2019. Check date values in: |date= (help)
  17. Lenardič, Jakob. "CLARIN-IT presents LexO: Where Lexicography Meets the Semantic Web". CLARIN. Retrieved 10 December 2019.
  18. The AIMS Team. "Version 4.0.2 of VocBench was released in August 2018". FAO of the United Nations in Italy. Retrieved 10 December 2019.
  19. Stellato, Armando; Rajbhandari, Sachit; Turbati, Andrea; Fiorelli, Manuel; Caracciolo, Caterina; Lorenzetti, Tiziano; Keizer, Johannes; Pazienza, Maria Teresa (2015). Gandon, Fabien; Sabou, Marta; Sack, Harald; d’Amato, Claudia; Cudré-Mauroux, Philippe; Zimmermann, Antoine (eds.). "VocBench: A Web Application for Collaborative Development of Multilingual Thesauri". The Semantic Web. Latest Advances and New Domains. Lecture Notes in Computer Science. Springer International Publishing. 9088: 38–53. doi:10.1007/978-3-319-18818-8_3. ISBN 978-3-319-18818-8.
  20. "VocBench 3: a Collaborative Semantic Web Editor for Ontologies, Thesauri and Lexicons | www.semantic-web-journal.net". semantic-web-journal.net. Retrieved 2020-01-17.
  21. Ilan Kernerman and Dorielle Lonke (July 2019). "Lexicala API: A new era in dictionary data" (PDF). Kernerman Dictionary News (27). Retrieved 5 April 2020.
  22. "Dictionary of Old Occitan medico-botanical terminology". Retrieved 10 December 2019.
  23. "TIAD-2017 Shared Task – Translation Inference Across Dictionaries. Call for Participation". Retrieved 10 December 2019.
  24. McCrae, John P.; Bond, Francis; Buitelaar, Paul; Cimiano, Philipp; Declerck, Thierry; Gracia, Jorge; Kernerman, Ilan; Montiel Ponsoda, Elena; Ordan, Noam; Piasacki, Maciej (June 18, 2017). Proceedings of the LDK 2017 Workshops: 1st Workshop on the OntoLex Model (OntoLex-2017), Shared Task on Translation Inference Across Dictionaries & Challenges for Wordnets. CEUR. Retrieved 10 December 2019.
  25. "TIAD 2019. 2nd Translation Inference Across Dictionaries (TIAD) Shared Task". Retrieved 10 December 2019.
  26. Gracia, Jorge; Kabashi, Besim; Kernerman, Ilan (May 20, 2019). Proceedings of TIAD-2019 Shared Task – Translation Inference Across Dictionaries. Leipzig, Germany: CEUR.
  27. "TIAD 2020 -- 2rd Translation Inference Across Dictionaries (TIAD) shared task".
  28. "Dbnary Wiktionary as Linguistic Linked Open Data". Retrieved 10 December 2019.
  29. Sérasset, Gilles (2016). "DBnary: Wiktionary as a Lemon-Based Multilingual Lexical Resource in RDF". Semantic Web. Retrieved 10 December 2019.
  30. Kamholz, David; Pool, Jonathan; Colowick, Susan M. (2014). PanLex: Building a Resource for Panlingual Lexical Translation, In Proceedings of the 9th Language Resource and Evaluation Conference (LREC-2014), Reykjavik, Iceland, May 2014. European Language Resource Association. pp. 3145–3150. Retrieved 10 December 2019.
  31. "Princeton WordNet 3.1. WordNet RDF". Retrieved 10 December 2019.
  32. "Global Wordnet Formats: RDF". Retrieved 10 December 2019.
  33. "BabelNet SPARQL endpoint". Retrieved 10 December 2019.
  34. Ehrmann, M.; Ceccioni, F.; Vanella, D.; McCrae, J.P.; Cimiano, P.; Navigli, R. Representing Multilingual Data as Linked Data: the Case of BabelNet 2.0. In: Proceedings of the 9th Language Resource and Evaluation Conference (LREC-2014), Reykjavik, Iceland, May 2014. European Language Resource Association. pp. 401–408. Retrieved 10 December 2019.
  35. "LiLa SPARQL endpoint". Retrieved 4 April 2020.
  36. "LiLa query interface". Retrieved 4 April 2020.
  37. Passarotti, M.C.; Cecchini, F.M.; Franzini, G.; Litta, E.; Mambrini, F.; Ruffolo, P. LiLa: Linking Latin. A Knowledge Base of Linguistic Resources and NLP Tools. In: Proceedings of the 2nd Conference on Language, Data and Knowledge (LDK 2019), Leipzig, Germany, 20-23 May 2019. CEUR Workshop Proceedings. Retrieved 4 April 2020.
  38. Cimiano, Philipp (July 2017). "OntoLex 2017 – 1st workshop on the OntoLex model" (PDF). Kernerman Dictionary News (25). Retrieved 5 April 2020.
  • OntoLex-Lemon specification
  • OntoLex-Lemon lexicography module
  • OntoLex Github repository
This article is issued from Wikipedia. The text is licensed under Creative Commons - Attribution - Sharealike. Additional terms may apply for the media files.