mkshing/xlsum_ja
收藏Hugging Face2023-06-20 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/mkshing/xlsum_ja
下载链接
链接失效反馈官方服务:
资源简介:
这是XL-Sum数据集的过滤后的日语子集,遵循了PaLM 2的研究。过滤条件为15-gram重叠。数据集包含训练集4215个样本(原为7113个),验证集758个样本(原为889个),测试集766个样本(原为889个)。
This is the filtered Japanese subset of the XL-Sum dataset, adhering to the filtering criteria used in the PaLM 2 research. The filtering is conducted based on 15-gram overlap. The dataset contains 4,215 training samples (originally 7,113), 758 validation samples (originally 889), and 766 test samples (originally 889).
提供机构:
mkshing



