five

suzakuteam/token5000_cs_engineering_other_num10000

收藏
Hugging Face2025-08-21 更新2025-09-13 收录
下载链接:
https://hf-mirror.com/datasets/suzakuteam/token5000_cs_engineering_other_num10000
下载链接
链接失效反馈
官方服务:
资源简介:
这是一个包含问答对的数据集,其中包括问题(question)、思考过程(thinking)、回答(answer)以及与内容(content)相关的其他信息。数据集还包含了每个样本的字数(str_count)、内容长度(content_length)、token数量(num_tokens)等元数据信息。此外,数据集还标注了每个样本的来源(source)。整个数据集被划分为训练集(train),共有10000个样本。

This is a dataset containing question-answer pairs, which includes the question (question), the thought process (thinking), the answer (answer), and other information related to the content (content). The dataset also contains metadata such as the number of characters (str_count), content length (content_length), and number of tokens (num_tokens) for each sample. In addition, the dataset is labeled with the source (source) of each sample. The entire dataset is split into a training set (train) with a total of 10,000 samples.
提供机构:
suzakuteam
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作