Prise en compte de la variation dans l'annotation automatique morphosyntaxique de l'occitan

Type de ressource
Presentation
Auteurs/contributeurs
Title
Prise en compte de la variation dans l'annotation automatique morphosyntaxique de l'occitan
Abstract
Occitan is a Romance language of France, a little part of Italy and Spain. It includes many written variations, dialectal and spelling variations. Being able to take variation into account is a major challenge to provide the language. Automatic processing of Occitan has been developing over the last ten years. Resources and tools have been developed and are beginning to take dialectal variation into account in these works. However, graphical variation is rarely taken into account. Our research focuses on the automatic annotation into lemmas, parts of speech and verbal inflection of a corpus of texts containing these two types of variation. From this corpus we train robust automatic annotation tools on global variation in Occitan.
Date
2023-11
Accessed
02/08/2024 14:29
Extra
Pages: 15-22 Published: 5èmes journées du Groupement de Recherche CNRS “ Linguistique Informatique, Formelle et de Terrain ” LIFT
Notes

Poster

Référence
Poujade, C., Fort, K., Gardent, C., & Parmentier, Y. (2023, November). Prise en compte de la variation dans l’annotation automatique morphosyntaxique de l’occitan. https://hal.science/hal-04622672