LM-Polygraph/xsum
收藏Hugging Face2025-07-16 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/LM-Polygraph/xsum
下载链接
链接失效反馈官方服务:
资源简介:
这是一个xsum数据集的预处理版本,用于LM-Polygraph的基准测试。数据集包含一个名为continuation的子集,该子集对应于LM-Polygraph中使用的主要数据集。数据集结构包括训练集和测试集,每个集包含两个字符串列:input和output,分别对应于LM-Polygraph的输入和输出。数据集的创建目的是将数据集创建代码与基准测试代码分离。数据集来源于https://huggingface.co/datasets/xsum,并通过build_dataset.py脚本进行处理。数据集继承了源数据集的偏见、风险和限制。
This is a preprocessed version of the xsum dataset, specifically designed for benchmarks in LM-Polygraph. The dataset includes two main parts: input and output, corresponding to the input and output for LM-Polygraph. The dataset is divided into training and test sets, each with respective byte counts and number of examples. The creation of this dataset aims to separate dataset creation code from benchmarking code. The source data for this dataset is from https://huggingface.co/datasets/xsum, processed using the build_dataset.py script in the repository. The dataset contains the same biases, risks, and limitations as its source dataset.
提供机构:
LM-Polygraph



