UD_Latgalian-Cairo is an example treebank to provide minimal dataset for Latgalian based on the Cairo sample sentences. Created by AI Lab at Institute of Mathematics and Computer Science, University of Latvia.
This treebank was developed as a proof-of-concept by the team developing Latvian UD Treebank (UD_Latvian-LVTB). It contains the 20 Cairo example sentences and is as far as we are know the only Latgalian treebank in existance.
This work was supported by the State Research Programme's project Research on Modern Latvian Language and Development of Language Technology under the grant agreement No. VPP-LETONIKA-2021/1-0006.
- Pretkalniņa L., Rituma L., Saulīte B. Deriving enhanced Universal Dependencies from a hybrid dependency-constituency treebank. Proceedings of the 21sh International Conference Text, Speech, and Dialogue, LNCS, Vol. 11107, Springer Link, 2018, pp. 95-105
- 2024-05-15 v2.14
- Initial release in Universal Dependencies.
=== Machine-readable metadata (DO NOT REMOVE!) ================================ Data available since: UD v2.14 License: CC BY-SA 4.0 Includes text: yes Genre: grammar-examples Lemmas: manual native UPOS: converted from manual XPOS: manual native Features: converted from manual Relations: converted from manual Contributors: Pretkalniņa, Lauma; Nešpore-Bērzkalne, Gunta Contributing: elsewhere Contact: [email protected] ===============================================================================