COLaF
COLaF
About
Funding
Partners
Team
Results
Publications
Bibliographic sources
Identified ressources
Contact
Light
Dark
Automatic
English
Français
Résultats
Tools
Deucalion
Lemmatization service
Pyrrha
Post-correction of lemmatized and morphosyntactic tagged corpora
Release the Kraken (RTK)
Task managament scripting librairy for image2tei pipeline
XML-TEI Schema
Schema and documentation for the text encoding in COLaF
Data
Layout Analysis Dataset with SegmOnto (LADaS)
Diachronic and multidocuments layout analysis dataset
Molyé
Modern texts corpus with creole and french variations
Picard Contest
Prose, poetry and drama written in picard
Works in progress
Forum Occitania
Occitan-spoken discussion forum
LADaS2TEI
Python script to convert OCR output into TEI
Nubis
Monographies from the Sorbonne Library (NuBIS)