ChavyvAkvar/finemath-200K
收藏Hugging Face2025-02-02 更新2025-02-15 收录
下载链接:
https://hf-mirror.com/datasets/ChavyvAkvar/finemath-200K
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含网页URL、抓取时间、MIME类型、Warc文件名、Warc记录偏移量、Warc记录长度、文本内容、词汇数、字符数、元数据、评分、整数评分、抓取方式、快照类型、语言和语言评分等字段。数据集分为训练集,包含大约200万个示例,总大小约为1.3GB。具体的数据集用途和内容描述在README文件中未提及。
The dataset includes fields such as web page URL, fetch time, MIME type, Warc filename, Warc record offset, Warc record length, text content, token count, character count, metadata, score, integer score, crawl method, snapshot type, language, and language score. The dataset is split into a training set with approximately 2 million examples, totaling about 1.3GB in size. The specific purpose and content description of the dataset are not mentioned in the README file.
提供机构:
ChavyvAkvar



