marin-community/open-thoughts-4-code-qwen3-32b-annotated

Name: marin-community/open-thoughts-4-code-qwen3-32b-annotated
Creator: marin-community
Published: 2025-11-20 19:01:49
License: 暂无描述

Hugging Face2025-11-20 更新2025-12-20 收录

下载链接：

https://hf-mirror.com/datasets/marin-community/open-thoughts-4-code-qwen3-32b-annotated

下载链接

链接失效反馈

官方服务：

资源简介：

--- # For reference on dataset card metadata, see the spec: https://github.com/huggingface/hub-docs/blob/main/datasetcard.md?plain=1 # Doc / guide: https://huggingface.co/docs/hub/datasets-cards {} --- # Dataset Card for Open-Thoughts-4-Code-Qwen3-32B-Annotated  This dataset is the Qwen3-32B annotated version of [mlfoundations-dev/hero_run_4_code](https://huggingface.co/datasets/mlfoundations-dev/hero_run_4_code) curated by the OpenThoughts4 team. We provide the responses from Qwen3-32B in the `generated_text` column. These samples were generated using temperature = 0.8 and max output tokens = 7,500. We note that many of the responses are truncated, so use this dataset wisely! ## Dataset Details ### Dataset Description  - **Curated by:** [More Information Needed] - **Funded by [optional]:** [More Information Needed] - **Shared by [optional]:** [More Information Needed] - **Language(s) (NLP):** [More Information Needed] - **License:** [More Information Needed] ### Dataset Sources [optional]  - **Repository:** [More Information Needed] - **Paper [optional]:** [More Information Needed] - **Demo [optional]:** [More Information Needed] ## Uses  ### Direct Use  [More Information Needed] ### Out-of-Scope Use  [More Information Needed] ## Dataset Structure  [More Information Needed] ## Dataset Creation ### Curation Rationale  [More Information Needed] ### Source Data  #### Data Collection and Processing  [More Information Needed] #### Who are the source data producers?  [More Information Needed] ### Annotations [optional]  #### Annotation process  [More Information Needed] #### Who are the annotators?  [More Information Needed] #### Personal and Sensitive Information  [More Information Needed] ## Bias, Risks, and Limitations  [More Information Needed] ### Recommendations  Users should be made aware of the risks, biases and limitations of the dataset. More information needed for further recommendations. ## Citation [optional]  **BibTeX:** [More Information Needed] **APA:** [More Information Needed] ## Glossary [optional]  [More Information Needed] ## More Information [optional] [More Information Needed] ## Dataset Card Authors [optional] [More Information Needed] ## Dataset Card Contact [More Information Needed]

# 如需参考数据集卡片元数据规范，请参阅：https://github.com/huggingface/hub-docs/blob/main/datasetcard.md?plain=1 # 文档/指南：https://huggingface.co/docs/hub/datasets-cards {} --- # Open-Thoughts-4-Code-Qwen3-32B-Annotated 数据集卡片  本数据集由OpenThoughts4团队整理，是[mlfoundations-dev/hero_run_4_code](https://huggingface.co/datasets/mlfoundations-dev/hero_run_4_code)的Qwen3-32B标注版本。我们在`generated_text`（生成文本）列中提供了Qwen3-32B生成的回复，这些样本采用温度系数（temperature）为0.8、最大输出Token数为7500的配置生成。请注意，多数回复存在截断情况，请谨慎使用本数据集！ ## 数据集详情 ### 数据集描述  - **整理方**：[需补充更多信息] - **资助方[可选]**：[需补充更多信息] - **共享方[可选]**：[需补充更多信息] - **自然语言处理所用语言**：[需补充更多信息] - **授权协议**：[需补充更多信息] ### 数据集来源[可选]  - **代码仓库**：[需补充更多信息] - **相关论文[可选]**：[需补充更多信息] - **演示 Demo[可选]**：[需补充更多信息] ## 数据集用途 ### 直接用途  [需补充更多信息] ### 超出范围的使用场景  [需补充更多信息] ## 数据集结构  [需补充更多信息] ## 数据集构建 ### 构建初衷  [需补充更多信息] ### 源数据  #### 数据收集与处理流程  [需补充更多信息] #### 源数据生产者是谁？  [需补充更多信息] ### 标注信息[可选]  #### 标注流程  [需补充更多信息] #### 标注者是谁？  [需补充更多信息] #### 个人与敏感信息  [需补充更多信息] ## 偏差、风险与局限性  ### 建议  用户应知晓本数据集存在的风险、偏差与局限性。如需进一步的建议，需补充更多信息。 ## 引用信息[可选]  **BibTeX格式：** [需补充更多信息] **APA格式：** [需补充更多信息] ## 术语表[可选]  [需补充更多信息] ## 更多信息[可选] [需补充更多信息] ## 数据集卡片作者[可选] [需补充更多信息] ## 数据集卡片联系人 [需补充更多信息]

提供机构：

marin-community

5,000+

优质数据集

54 个

任务类型

进入经典数据集