Sasoribi/sample_data
收藏Hugging Face2025-11-02 更新2025-11-15 收录
下载链接:
https://hf-mirror.com/datasets/Sasoribi/sample_data
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含四个字段:文本内容(text),来源(source),词数(token_count)和索引(__index_level_0__)。文本内容为字符串类型,来源也为字符串类型,词数为整数类型,索引为整数类型。数据集被划分为训练集(train),训练集大小为8555372905字节,共有1284879个示例。数据集的总大小为8555372905字节,下载大小为4469124020字节。
The dataset includes four fields: text content (text), source (source), token count (token_count), and index (__index_level_0__). The text content is of string type, the source is also of string type, the token count is an integer type, and the index is an integer type. The dataset is split into a training set (train), which is 8555372905 bytes in size and contains 1284879 examples. The total size of the dataset is 8555372905 bytes, and the download size is 4469124020 bytes.
提供机构:
Sasoribi



