OKanishcheva/ASSETUK
收藏Hugging Face2024-06-22 更新2024-06-29 收录
下载链接:
https://hf-mirror.com/datasets/OKanishcheva/ASSETUK
下载链接
链接失效反馈官方服务:
资源简介:
ASSETUK数据集是通过将ASSET数据集翻译成乌克兰语并手动检查翻译结果而获得的。该语料库由2000个验证句子和359个测试句子组成,每个句子由不同的注释者简化了10次。
The ASSETUK dataset, obtained by translating the ASSET dataset into Ukrainian and then manually checking the translation. The corpus is composed of 2000 validation and 359 test original sentences, each simplified 10 times by different annotators.
提供机构:
OKanishcheva
原始信息汇总
ASSETUK 数据集
概述
- 名称: ASSETUK
- 语言: 乌克兰语
- 许可: CC BY 4.0
- 大小: 10K < n < 100K
- 别名: Dataset for Ukrainian Text Simplification
数据组成
- 验证集: 2000 个原始句子
- 测试集: 359 个原始句子
- 简化版本: 每个原始句子由不同的标注者简化10次,文件格式为
dataset/asset.{valid,test}.simp.{0,1,2,3,4,5,6,7,8,9}
引用
Olha Kanishcheva. 2023. ASSETUK: a Dataset for Ukrainian Text Simplification, 10th Language & Technology Conference: Human Language Technologies as a Challenge for Computer Science and Linguistics, April 21-23, 2023, Poznań, Poland, pp. 122-125.



