five

shuyuej/GSM8K-Consistency

收藏
Hugging Face2023-12-31 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/shuyuej/GSM8K-Consistency
下载链接
链接失效反馈
官方服务:
资源简介:
--- license: apache-2.0 --- # GSM8K-Consistency Benchmark **GSM8K-Consistency** is a benchmark database for analyzing the consistency of `Arithmetic Reasoning on GSM8K`. ## 🚀 The dataset is available on 🤗 Hugging Face! This is a math-problem-related semantics-preserving perturbation benchmark that can be very helpful for evaluating the consistency of arithmetic reasoning capability. ## 💻 Dataset Usage Run the following command to load the data: ```python from datasets import load_dataset dataset = load_dataset("shuyuej/GSM8K-Consistency") dataset = dataset['train'] print(dataset) ``` Dataset Description: ```python Dataset({ features: ['id', 'original_question', 'paraphrased_question', 'answer_detail', 'numerical_answer'], num_rows: 85225 }) ```
提供机构:
shuyuej
原始信息汇总

GSM8K-Consistency Benchmark

GSM8K-Consistency 是一个用于分析 GSM8K 算术推理 一致性的基准数据库。

数据集描述

该数据集是一个与数学问题相关的语义保持扰动基准,有助于评估算术推理能力的一致性。

数据集使用

使用以下命令加载数据: python from datasets import load_dataset

dataset = load_dataset("shuyuej/GSM8K-Consistency") dataset = dataset[train] print(dataset)

数据集结构

python Dataset({ features: [id, original_question, paraphrased_question, answer_detail, numerical_answer], num_rows: 85225 })

5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作