nvidia/Nemotron-CrossThink
收藏Hugging Face2025-05-01 更新2025-10-25 收录
下载链接:
https://hf-mirror.com/datasets/nvidia/Nemotron-CrossThink
下载链接
链接失效反馈官方服务:
资源简介:
Nemotron-CrossThink是一个多领域强化学习(RL)数据集,旨在提高大型语言模型(LLM)的一般目的和数学推理能力。该数据集包含高质量的问题-答案对,带有详细的推理轨迹,从CommonCrawl和高质量书籍中整理和合成。Nemotron-CrossThink专注于在STEM、人文和数学问题解决领域中构建多样化和可验证的推理示例。
Nemotron-CrossThink is a multi-domain reinforcement learning (RL) dataset designed to improve general-purpose and mathematical reasoning in large language models (LLMs). The dataset contains high-quality question-answer pairs with detailed reasoning traces, curated and synthesized from CommonCrawl and high-quality books. Nemotron-CrossThink focuses on building diverse and verifiable reasoning examples across STEM, humanities, and mathematical problem-solving domains.
提供机构:
nvidia



