mamed0v/orca-math-word-problems-200k-turkmen
收藏Hugging Face2024-07-06 更新2024-07-22 收录
下载链接:
https://hf-mirror.com/datasets/mamed0v/orca-math-word-problems-200k-turkmen
下载链接
链接失效反馈官方服务:
资源简介:
该数据集是microsoft/orca-math-word-problems-200k数据集的土库曼语翻译版本,包含200,000个高质量的数学应用题及其解答。该土库曼语版本旨在扩展土库曼语社区的数学问题解决数据集的可访问性。数据集采用JSONL格式,每行包含一个JSON对象,包含原始英语问题和答案及其土库曼语翻译。翻译工作通过Google Translate完成,用户在使用时应注意可能存在的不准确或误译情况。
This dataset is a Turkmen translation of the original microsoft/orca-math-word-problems-200k dataset, containing 200,000 high-quality math word problems and their solutions. The Turkmen version aims to extend the accessibility of math problem-solving datasets to the Turkmen language community. The dataset is provided in JSONL format, with each line representing a single JSON object containing the original English question and answer along with their Turkmen translations. The translations were performed using Google Translate, and users should be aware of potential inaccuracies or mistranslations.
提供机构:
mamed0v
原始信息汇总
Turkmen Orca Math Word Problems 200k Dataset
概述
该数据集是原始microsoft/orca-math-word-problems-200k数据集的Turkmen语翻译版本。Orca Math Word Problems数据集包含200,000个高质量的数学应用题及其解答。此Turkmen版本旨在将数学问题解决数据集的可访问性扩展到Turkmen语社区。
数据集详情
- 原始数据集: microsoft/orca-math-word-problems-200k
- 语言: 英语和Turkmen语
- 样本数量: 200,000
- 任务类型: 数学应用题及其解答
- 翻译方法: Google Translate
文件格式
数据集以JSONL(JSON Lines)格式提供。文件中的每一行代表一个包含以下结构的单个JSON对象:
json { "question": "原始英语数学应用题", "answer": "原始英语答案", "question_tk": "翻译为Turkmen语的数学应用题", "answer_tk": "翻译为Turkmen语的答案" }
示例: json { "question": "What is the largest number with a certain number of digits that is exactly divisible by 88? The answer is 9944. How many digits does this number have?", "answer": "The number 9944 has four digits.", "question_tk": "Belli bir san bilen 88-e bölünyän iň uly san haýsy? Jogap 9944. Bu sanyň näçe sany bar?", "answer_tk": "9944 belgisinde dört san bar." }



