open-thoughts-4-code-qwen3-32b-annotated

Name: open-thoughts-4-code-qwen3-32b-annotated
Creator: maas
Published: 2025-12-05 16:57:09
License: 暂无描述

魔搭社区2025-12-05 更新2025-12-06 收录

下载链接：

https://modelscope.cn/datasets/marin-community/open-thoughts-4-code-qwen3-32b-annotated

下载链接

链接失效反馈

官方服务：

资源简介：

# Dataset Card for Open-Thoughts-4-Code-Qwen3-32B-Annotated  This dataset is the Qwen3-32B annotated version of [mlfoundations-dev/hero_run_4_code](https://huggingface.co/datasets/mlfoundations-dev/hero_run_4_code) curated by the OpenThoughts4 team. We provide the responses from Qwen3-32B in the `generated_text` column. These samples were generated using temperature = 0.8 and max output tokens = 7,500. We note that many of the responses are truncated, so use this dataset wisely! ## Dataset Details ### Dataset Description  - **Curated by:** [More Information Needed] - **Funded by [optional]:** [More Information Needed] - **Shared by [optional]:** [More Information Needed] - **Language(s) (NLP):** [More Information Needed] - **License:** [More Information Needed] ### Dataset Sources [optional]  - **Repository:** [More Information Needed] - **Paper [optional]:** [More Information Needed] - **Demo [optional]:** [More Information Needed] ## Uses  ### Direct Use  [More Information Needed] ### Out-of-Scope Use  [More Information Needed] ## Dataset Structure  [More Information Needed] ## Dataset Creation ### Curation Rationale  [More Information Needed] ### Source Data  #### Data Collection and Processing  [More Information Needed] #### Who are the source data producers?  [More Information Needed] ### Annotations [optional]  #### Annotation process  [More Information Needed] #### Who are the annotators?  [More Information Needed] #### Personal and Sensitive Information  [More Information Needed] ## Bias, Risks, and Limitations  [More Information Needed] ### Recommendations  Users should be made aware of the risks, biases and limitations of the dataset. More information needed for further recommendations. ## Citation [optional]  **BibTeX:** [More Information Needed] **APA:** [More Information Needed] ## Glossary [optional]  [More Information Needed] ## More Information [optional] [More Information Needed] ## Dataset Card Authors [optional] [More Information Needed] ## Dataset Card Contact [More Information Needed]

# 数据集卡片：Open-Thoughts-4-Code-Qwen3-32B-Annotated  本数据集为OpenThoughts4团队整理的[mlfoundations-dev/hero_run_4_code](https://huggingface.co/datasets/mlfoundations-dev/hero_run_4_code)的**通义千问3-32B（Qwen3-32B）**标注版本。我们在`generated_text`字段中提供了Qwen3-32B生成的回复内容。这批样本使用温度参数（temperature）= 0.8以及最大输出Token（Token）数=7500生成。请注意，多数回复存在截断情况，请谨慎使用本数据集！ ## 数据集详情 ### 数据集描述  - **整理方：** [需补充更多信息] - **资助方（可选）：** [需补充更多信息] - **共享方（可选）：** [需补充更多信息] - **自然语言（NLP）：** [需补充更多信息] - **许可协议：** [需补充更多信息] ### 数据集来源（可选）  - **代码仓库：** [需补充更多信息] - **论文（可选）：** [需补充更多信息] - **演示（可选）：** [需补充更多信息] ## 数据集用途  ### 直接使用场景  [需补充更多信息] ### 不适用使用场景  [需补充更多信息] ## 数据集结构  [需补充更多信息] ## 数据集构建 ### 整理动机  [需补充更多信息] ### 源数据  #### 数据收集与处理流程  [需补充更多信息] #### 源数据生产者是谁？  [需补充更多信息] ### 标注信息（可选）  #### 标注流程  [需补充更多信息] #### 标注者是谁？  [需补充更多信息] #### 个人与敏感信息  [需补充更多信息] ## 偏差、风险与局限性  ### 相关建议  用户应知晓该数据集存在的风险、偏差与局限性。如需进一步完善建议，请补充更多信息。 ## 引用信息（可选）  **BibTeX格式：** [需补充更多信息] **APA格式：** [需补充更多信息] ## 术语表（可选）  [需补充更多信息] ## 更多信息（可选） [需补充更多信息] ## 数据集卡片撰写者（可选） [需补充更多信息] ## 数据集卡片联系方式 [需补充更多信息]

提供机构：

maas

创建时间：

2025-11-21

搜集汇总

数据集介绍

背景与挑战

背景概述

该数据集是mlfoundations-dev/hero_run_4_code的Qwen3-32B标注版本，包含在generated_text列中的生成响应，这些响应使用温度0.8和最大输出标记7500生成，但许多被截断。数据集大小为21.03GB，采用Apache License 2.0许可证，更新于2025年11月21日。

以上内容由遇见数据集搜集并总结生成