EthanHosier/chunked_text_piano_llm
收藏Hugging Face2025-06-24 更新2025-10-25 收录
下载链接:
https://hf-mirror.com/datasets/EthanHosier/chunked_text_piano_llm
下载链接
链接失效反馈官方服务:
资源简介:
该数据集是一个名为chunked_text_piano_llm的文本数据集,包含四个字段:url(字符串类型),chunk_size(整型类型),chunk_index(整型类型),tokens(字符串类型)。数据集分为训练集,共有13,577,39个示例,总大小为24,436,133,411字节。数据集提供了一个默认配置,包含训练集的数据文件。
This dataset is a text dataset named chunked_text_piano_llm, containing four fields: url (string type), chunk_size (integer type), chunk_index (integer type), tokens (string type). The dataset is split into a training set with a total of 13,577,39 examples and a size of 24,436,133,411 bytes. The dataset provides a default configuration that includes the data files for the training set.
提供机构:
EthanHosier



