ParlaMintCAT corpus
收藏DataCite Commons2025-06-10 更新2024-07-13 收录
下载链接:
https://dataverse.csuc.cat/citation?persistentId=doi:10.34810/data1137
下载链接
链接失效反馈官方服务:
资源简介:
Parliamentary speeches are considered to be of interest for different
research areas because they are publicly available transcriptions, produced under controlled and regulated procedures that add totally reliable sociodemographic data like gender, age, and other details of the speakers. Moreover, speeches are rich in topics and domains, and they are actually public domain data, not subject to copyright restrictions. The ParlaMint project: Towards Comparable Parliamentary Corpora is developing a comparable and uniformly annotated multilingual corpus
with the data from 33 different parliaments in Europe. This paper describes the details of building the ParlaMintCAT corpus, for which the transcriptions of the Catalan Parliament General Assembly sessions from 2015 to 2022 have been compiled, processed and annotated.
提供机构:
CORA.Repositori de Dades de Recerca
创建时间:
2024-02-26



