amd/Instella-GSM8K-synthetic
收藏Hugging Face2025-11-14 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/amd/Instella-GSM8K-synthetic
下载链接
链接失效反馈官方服务:
资源简介:
Instella-GSM8K-synthetic数据集是一个合成数据集,用于Instella-3B模型的第二阶段预训练。该数据集通过将GSM8k数据集中的数值替换为可由相同Python程序解答的替代值来生成新的问题和答案对,并用于模型训练。
The Instella-GSM8K-synthetic dataset is a synthetic dataset used for the second stage pre-training of the Instella-3B model. It is generated by replacing the numerical values in the GSM8k dataset with alternative values that can be answered by the same Python program, creating new question-answer pairs for model training.
提供机构:
amd



