Résultats

Tools

Avatar

Deucalion

Lemmatization service

Avatar

Pyrrha

Post-correction of lemmatized and morphosyntactic tagged corpora

Avatar

Release the Kraken (RTK)

Task managament scripting librairy for image2tei pipeline

Avatar

XML-TEI Schema

Schema and documentation for the text encoding in COLaF

Data

Avatar

Layout Analysis Dataset with SegmOnto (LADaS)

Diachronic and multidocuments layout analysis dataset

Avatar

Molyé

Modern texts corpus with creole and french variations

Avatar

Picard Contest

Prose, poetry and drama written in picard

Works in progress

Avatar

Forum Occitania

Occitan-spoken discussion forum

Avatar

LADaS2TEI

Python script to convert OCR output into TEI

Avatar

Nubis

Monographies from the Sorbonne Library (NuBIS)