fjxdaisy/sentence_split_finemath_4plus_part_3
收藏Hugging Face2025-02-26 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/fjxdaisy/sentence_split_finemath_4plus_part_3
下载链接
链接失效反馈官方服务:
资源简介:
该数据集是一个包含多个字段信息的文本数据集,适用于文本处理和语言分析任务。它包含了如URL、抓取时间、MIME类型、文本内容、字符数、元数据等字段。数据集分为训练集,共有1000000个示例,总大小约为10.96GB。数据集的配置信息中指定了训练集的数据文件路径。
This dataset is a text dataset with multiple fields information, suitable for text processing and language analysis tasks. It includes fields such as URL, fetch time, MIME type, text content, character count, metadata, etc. The dataset is split into a training set with a total of 1,000,000 examples, with a total size of approximately 10.96GB. The configuration information of the dataset specifies the data file path for the training set.
提供机构:
fjxdaisy



