quinnlue/reasoning-llm
收藏Hugging Face2025-11-04 更新2025-11-15 收录
下载链接:
https://hf-mirror.com/datasets/quinnlue/reasoning-llm
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含网页的URL、抓取时间、MIME类型、Warc文件名、文本内容、词汇数量、字符数量、元数据、评分、整数评分、爬虫信息、快照类型、语言以及语言评分等信息。数据集分为训练集,大小为约5.82GB,共有997978个样本。数据集的配置信息中提供了训练集的数据文件路径。
The dataset includes web page URL, fetch time, MIME type, Warc filename, text content, token count, character count, metadata, score, integer score, crawler information, snapshot type, language, and language score, etc. The dataset is split into a training set, which is approximately 5.82GB in size and contains 997978 samples. The configuration information of the dataset provides the path to the data files for the training set.
提供机构:
quinnlue



