timaeus/pubmed_central_max_loss_delta_ablation_l0h5
收藏Hugging Face2025-03-18 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/timaeus/pubmed_central_max_loss_delta_ablation_l0h5
下载链接
链接失效反馈官方服务:
资源简介:
这是一个包含文本和元数据信息的训练数据集,其中文本字段包含文本内容,元数据字段包含数据集的pile_set_name信息。数据集分为训练集,共有10000个示例,总大小为79724706字节。
This is a training dataset containing text and metadata information, where the text field includes the text content, and the metadata field includes the pile_set_name information of the dataset. The dataset is split into a training set with a total of 10,000 examples and a total size of 79724706 bytes.
提供机构:
timaeus



