dgambettaphd/D_gen8_run2_llama2-7b_wiki_doc1000_real64_synt64
收藏Hugging Face2024-12-04 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/dgambettaphd/D_gen8_run2_llama2-7b_wiki_doc1000_real64_synt64
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含两个特征:id(文档的唯一标识,数据类型为int64)和doc(文档内容,数据类型为string)。数据集被分割为训练集,包含1000个样本,总大小为586816字节。数据文件位于data/train-*路径下。
This dataset includes two features: id (a unique identifier for the document, dtype: int64) and doc (the document content, dtype: string). The dataset is split into a training set with 1000 examples, totaling 586816 bytes. The data files are located at data/train-*.
提供机构:
dgambettaphd



