five

DatologyAI/commonsense_qa_rephrased

收藏
Hugging Face2024-10-22 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/DatologyAI/commonsense_qa_rephrased
下载链接
链接失效反馈
官方服务:
资源简介:
--- configs: - config_name: default data_files: - split: train path: data/train-* dataset_info: features: - name: id dtype: int64 - name: prefix dtype: string - name: choices sequence: string - name: answerID dtype: int64 splits: - name: train num_bytes: 209658 num_examples: 1221 download_size: 128995 dataset_size: 209658 --- # commonsense_qa Dataset ## Overview This repository contains the rephrased version of the commonsense_qa dataset. The dataset is formatted as a collection of multiple-choice questions. ## Dataset Structure Each example in the dataset contains the following fields: ```python { "id": 0, "prefix": "A revolving door is convenient for two direction travel, but it also serves as a security measure at a", "choices": [ "bank.", "library.", "department store.", "mall.", "New York." ], "answerID": 0 } ``` ## Fields Description - **id**: Unique identifier for each example - **question**: The question or prompt text - **choices**: List of possible answers - **answerID**: Index of the correct answer in the choices list (0-based) ## Loading the Dataset You can load this dataset using the Hugging Face datasets library: ```python from datasets import load_dataset # Load the dataset dataset = load_dataset("DatologyAI/{dataset_name}") # Access the data for example in dataset['train']: print(example) ``` ## Example Usage ```python # Load the dataset dataset = load_dataset("DatologyAI/{dataset_name}") # Get a sample question sample = dataset['train'][0] # Print the question print("Question:", sample['question']) print("Choices:") for i, choice in enumerate(sample['choices']): print(f"{i}. {choice}") print("Correct Answer:", sample['choices'][sample['answerID']]) ``` ## Dataset Creation This dataset was processed to ensure: 1. All entries are sorted by ID 2. All string values have been stripped of extra whitespace 3. Consistent JSON formatting
提供机构:
DatologyAI
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作