FrancophonIA/CorpusDRF
收藏Hugging Face2025-03-30 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/FrancophonIA/CorpusDRF
下载链接
链接失效反馈官方服务:
资源简介:
CorpusDRF是一个开源的数字化集合,包含了法国地区主义词汇、它们的词性以及认知率,数据来源于《法国地区主义词典》。该数据集能够实现20世纪法国地区主义最大规模研究的可视化分析,并以表格格式记录了每个条目的认知率数值,按法国大陆94个部门排序。每个CSV文件包含94个部门的95行数据(包括表头)和936个DRF条目作为列。数据集提供了三种处理空值的不同版本。
CorpusDRF is an open-source, digitized collection of regionalisms, their parts of speech, and recognition rates from the Dictionnaire des Regionalismes de France (DRF, Dictionary of Regionalisms of France). It enables the visualization and analysis of the largest-scale study of French regionalisms in the 20th century using publicly available data. The dataset records, in a tabular format, the numerical values of recognition rates for each DRF entry sorted by each of the 94 departments in continental France. Each CSV file contains 95 rows for the 94 French departments (including a header) and 936 DRF entries as columns. There are three versions of the dataset, each handling missing values (NAs) differently.
提供机构:
FrancophonIA



