DatologyAI/commonsense_qa_rephrased

Name: DatologyAI/commonsense_qa_rephrased
Creator: DatologyAI
Published: 2024-10-22 13:08:41
License: 暂无描述

Hugging Face2024-10-22 更新2025-04-12 收录

下载链接：

https://hf-mirror.com/datasets/DatologyAI/commonsense_qa_rephrased

下载链接

链接失效反馈

官方服务：

资源简介：

--- configs: - config_name: default data_files: - split: train path: data/train-* dataset_info: features: - name: id dtype: int64 - name: prefix dtype: string - name: choices sequence: string - name: answerID dtype: int64 splits: - name: train num_bytes: 209658 num_examples: 1221 download_size: 128995 dataset_size: 209658 --- # commonsense_qa Dataset ## Overview This repository contains the rephrased version of the commonsense_qa dataset. The dataset is formatted as a collection of multiple-choice questions. ## Dataset Structure Each example in the dataset contains the following fields: ```python { "id": 0, "prefix": "A revolving door is convenient for two direction travel, but it also serves as a security measure at a", "choices": [ "bank.", "library.", "department store.", "mall.", "New York." ], "answerID": 0 } ``` ## Fields Description - **id**: Unique identifier for each example - **question**: The question or prompt text - **choices**: List of possible answers - **answerID**: Index of the correct answer in the choices list (0-based) ## Loading the Dataset You can load this dataset using the Hugging Face datasets library: ```python from datasets import load_dataset # Load the dataset dataset = load_dataset("DatologyAI/{dataset_name}") # Access the data for example in dataset['train']: print(example) ``` ## Example Usage ```python # Load the dataset dataset = load_dataset("DatologyAI/{dataset_name}") # Get a sample question sample = dataset['train'][0] # Print the question print("Question:", sample['question']) print("Choices:") for i, choice in enumerate(sample['choices']): print(f"{i}. {choice}") print("Correct Answer:", sample['choices'][sample['answerID']]) ``` ## Dataset Creation This dataset was processed to ensure: 1. All entries are sorted by ID 2. All string values have been stripped of extra whitespace 3. Consistent JSON formatting

提供机构：

DatologyAI

5,000+

优质数据集

54 个

任务类型

进入经典数据集