suzakuteam/token5000_cs_engineering_other_num10000
收藏Hugging Face2025-08-21 更新2025-09-13 收录
下载链接:
https://hf-mirror.com/datasets/suzakuteam/token5000_cs_engineering_other_num10000
下载链接
链接失效反馈官方服务:
资源简介:
这是一个包含问答对的数据集,其中包括问题(question)、思考过程(thinking)、回答(answer)以及与内容(content)相关的其他信息。数据集还包含了每个样本的字数(str_count)、内容长度(content_length)、token数量(num_tokens)等元数据信息。此外,数据集还标注了每个样本的来源(source)。整个数据集被划分为训练集(train),共有10000个样本。
This is a dataset containing question-answer pairs, which includes the question (question), the thought process (thinking), the answer (answer), and other information related to the content (content). The dataset also contains metadata such as the number of characters (str_count), content length (content_length), and number of tokens (num_tokens) for each sample. In addition, the dataset is labeled with the source (source) of each sample. The entire dataset is split into a training set (train) with a total of 10,000 samples.
提供机构:
suzakuteam



