five

Le Breton Usuel — Grammaire, Vocabulaire, Conversations – Parallel Corpus (Breton Vannetais / French)

收藏
Zenodo2026-05-25 更新2026-05-26 收录
下载链接:
https://zenodo.org/doi/10.5281/zenodo.20375475
下载链接
链接失效反馈
官方服务:
资源简介:
This dataset contains a sentence-aligned Breton Vannetais / French parallel corpus derived from the second enlarged edition (1934) of Le Breton Usuel — Grammaire, Vocabulaire, Conversations by Loeiz Herrieu, published by Éditions Dihunamb in Lorient. The corpus belongs to the HERITAGE category within the IAgwened corpus framework and preserves the historical orthography and dialectal characteristics of the original printed source as faithfully as possible. The digitized source originates from a historical copy formerly owned by Camille Brazideg (1922–?), later rector of Brec'h (Morbihan). The volume contains handwritten annotations and corrections produced by former readers, providing additional evidence of Breton language transmission and pedagogical practices during the twentieth century. The dataset includes:- sentence-aligned Breton / French text,- annotated and normalized corpus variants,- editorial metadata,- documentation files,- and scans of the original printed cover. The resource was prepared for:- corpus linguistics,- Breton language preservation,- digital humanities,- natural language processing (NLP),- machine translation,- and research on under-resourced Celtic languages. The dataset was curated within the IAgwened – Breton Language and AI Project at the Institut Culturel de Bretagne.
提供机构:
Zenodo
创建时间:
2026-05-25
二维码
社区交流群
二维码
科研交流群
商业服务