LM-Polygraph/trivia_qa_tiny

Name: LM-Polygraph/trivia_qa_tiny
Creator: LM-Polygraph
Published: 2024-11-04 13:30:17
License: 暂无描述

Hugging Face2024-11-04 更新2025-04-12 收录

下载链接：

https://hf-mirror.com/datasets/LM-Polygraph/trivia_qa_tiny

下载链接

链接失效反馈

官方服务：

资源简介：

--- language: - en dataset_info: config_name: continuation features: - name: input dtype: string - name: output dtype: string splits: - name: train num_bytes: 7657 num_examples: 100 - name: test num_bytes: 7657 num_examples: 100 download_size: 15360 dataset_size: 15314 configs: - config_name: continuation data_files: - split: train path: continuation/train-* - split: test path: continuation/test-* --- # Dataset Card for trivia_qa_tiny  This is a preprocessed version of trivia_qa_tiny dataset for benchmarks in LM-Polygraph. ## Dataset Details ### Dataset Description  - **Curated by:** https://huggingface.co/LM-Polygraph - **License:** https://github.com/IINemo/lm-polygraph/blob/main/LICENSE.md ### Dataset Sources [optional]  - **Repository:** https://github.com/IINemo/lm-polygraph ## Uses  ### Direct Use  This dataset should be used for performing benchmarks on LM-polygraph. ### Out-of-Scope Use  This dataset should not be used for further dataset preprocessing. ## Dataset Structure  This dataset contains the "continuation" subset, which corresponds to main dataset, used in LM-Polygraph. It may also contain other subsets, which correspond to instruct methods, used in LM-Polygraph. Each subset contains two splits: train and test. Each split contains two string columns: "input", which corresponds to processed input for LM-Polygraph, and "output", which corresponds to processed output for LM-Polygraph. ## Dataset Creation ### Curation Rationale  This dataset is created in order to separate dataset creation code from benchmarking code. ### Source Data  #### Data Collection and Processing  Data is collected from https://huggingface.co/datasets/SpeedOfMagic/trivia_qa_tiny and processed by using build_dataset.py script in repository. #### Who are the source data producers?  People who created https://huggingface.co/datasets/SpeedOfMagic/trivia_qa_tiny ## Bias, Risks, and Limitations  This dataset contains the same biases, risks, and limitations as its source dataset https://huggingface.co/datasets/SpeedOfMagic/trivia_qa_tiny ### Recommendations  Users should be made aware of the risks, biases and limitations of the dataset.

提供机构：

LM-Polygraph

5,000+

优质数据集

54 个

任务类型

进入经典数据集