FrancophonIA/Benchmark_Database_Phonetic_Alignments
收藏Hugging Face2025-03-30 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/FrancophonIA/Benchmark_Database_Phonetic_Alignments
下载链接
链接失效反馈官方服务:
资源简介:
该数据集名为“ Benchmark Database for Phonetic Alignments”(语音对齐基准数据库,BDPA),它是一个提供不同语言变体中的同源词汇集合的新数据资源。与关注同源性和词汇变化的其他资源不同,BDPA以成对和对多的对齐形式表示数据。对齐是一个矩阵表示,其中两个或更多序列中的对应部分被放置在同一列中,不匹配部分产生的空单元格由间隙符号填充。目前,BDPA基于12种不同的语言和方言变体提供了总共750个对齐。
The dataset named Benchmark Database for Phonetic Alignments (BDPA) is a new data resource that offers collections of cognate words from different language varieties. Unlike other resources that focus on questions of cognacy and lexical change, the BDPA represents the data in the form of pairwise and multiple alignments. An alignment is a matrix representation of two or more sequences in which corresponding segments in the sequences are placed in the same column, with empty cells resulting from non-matching segments being filled by gap symbols. Currently, the BDPA provides a total of 750 alignments based on 12 different sources of language and dialect varieties.
提供机构:
FrancophonIA



