Corpora with Part-of-Speech Annotations for Three Regional Languages of France: Alsatian, Occitan and Picard
Type de ressource
Conference Paper
Auteurs/contributeurs
- Bernhard, Delphine (Author)
- Ligozat, Anne-Laure (Author)
- Martin, Fanny (Author)
- Bras, Myriam (Author)
- Magistry, Pierre (Author)
- Vergez-Couret, Marianne (Author)
- Steiblé, Lucie (Author)
- Erhart, Pascale (Author)
- Hathout, Nabil (Author)
- Huck, Dominique (Author)
- Rey, Christophe (Author)
- Reynés, Philippe (Author)
- Rosset, Sophie (Author)
- Sibille, Jean (Author)
- Lavergne, Thomas (Author)
Title
Corpora with Part-of-Speech Annotations for Three Regional Languages of France: Alsatian, Occitan and Picard
Abstract
This article describes the creation of corpora with part-of-speech annotations for three regional languages of France: Alsatian, Occitan and Picard. These manual annotations were performed in the context of the RESTAURE project, whose goal is to develop resources and tools for these under-resourced French regional languages. The article presents the tagsets used in the annotation process as well as the resulting annotated corpora.
Date
2018-05
Proceedings Title
11th edition of the Language Resources and Evaluation Conference
Place
Miyazaki, Japan
Short Title
Corpora with Part-of-Speech Annotations for Three Regional Languages of France
Accessed
05/03/2024 14:06
Library Catalog
HAL Archives Ouvertes
Référence
Bernhard, D., Ligozat, A.-L., Martin, F., Bras, M., Magistry, P., Vergez-Couret, M., Steiblé, L., Erhart, P., Hathout, N., Huck, D., Rey, C., Reynés, P., Rosset, S., Sibille, J., & Lavergne, T. (2018, May). Corpora with Part-of-Speech Annotations for Three Regional Languages of France: Alsatian, Occitan and Picard. 11th Edition of the Language Resources and Evaluation Conference. https://hal.science/hal-01704806
Corpus
Langue
Lien vers cette notice