asusevski/check_repeated_tokens
收藏Hugging Face2024-10-20 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/asusevski/check_repeated_tokens
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含多个特征字段,如id、语言、目标文本、回译文本、参考文本、BERT评分(F1、精确度、召回率)、BLEU评分(2-gram标准化)、长度和标志位。数据集包含一个训练集分割,共有3906896个样本,文件大小为1468848306字节。
The dataset contains multiple feature fields such as id, language, target text, back translations, reference text, BERT scores (F1, precision, recall), BLEU score (2-gram normalization), lengths, and a flag. The dataset includes a training split with 3906896 samples and a file size of 1468848306 bytes.
提供机构:
asusevski



