Aarushhh/finemath-refined
收藏Hugging Face2025-01-28 更新2025-02-15 收录
下载链接:
https://hf-mirror.com/datasets/Aarushhh/finemath-refined
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含网页的URL、抓取时间、MIME类型、Warc文件名、文本内容、字符数、元数据、得分、整数量得分、抓取方式、快照类型、语言和语言得分等信息。数据集经过筛选,只保留了整数量得分为5的记录。训练集包含644,840个示例,数据大小为3,763,192,504.45字节。
The dataset includes information such as web page URL, fetch time, MIME type, Warc filename, text content, character count, metadata, score, integer score, crawling method, snapshot type, language, and language score. The dataset has been filtered to retain only records with an integer score of 5. The training set contains 644,840 examples, with a total data size of 3,763,192,504.45 bytes.
提供机构:
Aarushhh



