five

Meng-Research/gsm8k-pr

收藏
Hugging Face2025-02-17 更新2025-11-29 收录
下载链接:
https://hf-mirror.com/datasets/Meng-Research/gsm8k-pr
下载链接
链接失效反馈
官方服务:
资源简介:
--- license: apache-2.0 --- ## Dataset Card for GSM8K-PR ### Dataset Summary The GSM8K-PR dataset is a modified version of the well-known GSM8K dataset, designed to enhance the evaluation of large language models (LLMs) in long-context settings. This dataset retains the original structure of GSM8K, but with one important modification: **all personal pronouns (such as “he,” “she,” “it,” “they,” etc.) have been replaced with the specific person, object, or thing they refer to**. The goal of this modification is to make the dataset more suitable for tasks that involve long-form text processing and reasoning. ### Potential Errors Although we have put considerable effort into curating and modifying the GSM8K-PR dataset, there may still be some errors that have not yet been identified. If you come across any issues, inconsistencies, or errors within the dataset, please do not hesitate to contact us.

--- 许可证:Apache-2.0 --- ## GSM8K-PR 数据集卡片 ### 数据集概述 GSM8K-PR数据集是知名数据集GSM8K的改进版本,旨在提升大语言模型(Large Language Model,LLM)在长上下文场景下的评估性能。本数据集保留了GSM8K的原始结构,但做出了一项关键改进:**所有人称代词(如“他”“她”“它”“他们”等)均已替换为其所指代的具体人物、物体或事物**。此项改进的目标是使本数据集更适配涉及长文本处理与推理的任务。 ### 潜在问题 尽管我们在整理与改进GSM8K-PR数据集时投入了大量精力,但仍可能存在尚未被发现的疏漏与错误。若您在使用本数据集时发现任何问题、不一致之处或错误,请随时与我们联系。
提供机构:
Meng-Research
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作