nightingal3/fig-qa

Name: nightingal3/fig-qa
Creator: nightingal3
Published: 2023-06-10 18:13:33
License: 暂无描述

Hugging Face2023-06-10 更新2024-03-04 收录

下载链接：

https://hf-mirror.com/datasets/nightingal3/fig-qa

下载链接

链接失效反馈

官方服务：

资源简介：

--- annotations_creators: - expert-generated - crowdsourced language_creators: - crowdsourced language: - en license: - mit multilinguality: - monolingual pretty_name: Fig-QA size_categories: - 10K<n<100K source_datasets: - original task_categories: - multiple-choice task_ids: - multiple-choice-qa --- # Dataset Card for Fig-QA ## Table of Contents - [Table of Contents](#table-of-contents) - [Dataset Description](#dataset-description) - [Dataset Summary](#dataset-summary) - [Supported Tasks and Leaderboards](#supported-tasks-and-leaderboards) - [Languages](#languages) - [Dataset Structure](#dataset-structure) - [Data Splits](#data-splits) - [Considerations for Using the Data](#considerations-for-using-the-data) - [Discussion of Biases](#discussion-of-biases) - [Additional Information](#additional-information) - [Licensing Information](#licensing-information) - [Citation Information](#citation-information) ## Dataset Description - **Repository:** https://github.com/nightingal3/Fig-QA - **Paper:** https://arxiv.org/abs/2204.12632 - **Leaderboard:** https://explainaboard.inspiredco.ai/leaderboards?dataset=fig_qa - **Point of Contact:** emmy@cmu.edu ### Dataset Summary This is the dataset for the paper [Testing the Ability of Language Models to Interpret Figurative Language](https://arxiv.org/abs/2204.12632). Fig-QA consists of 10256 examples of human-written creative metaphors that are paired as a Winograd schema. It can be used to evaluate the commonsense reasoning of models. The metaphors themselves can also be used as training data for other tasks, such as metaphor detection or generation. ### Supported Tasks and Leaderboards You can evaluate your models on the test set by submitting to the [leaderboard](https://explainaboard.inspiredco.ai/leaderboards?dataset=fig_qa) on Explainaboard. Click on "New" and select `qa-multiple-choice` for the task field. Select `accuracy` for the metric. You should upload results in the form of a system output file in JSON or JSONL format. ### Languages This is the English version. Multilingual version can be found [here](https://huggingface.co/datasets/cmu-lti/multi-figqa). ### Data Splits Train-{S, M(no suffix), XL}: different training set sizes Dev Test (labels not provided for test set) ## Considerations for Using the Data ### Discussion of Biases These metaphors are human-generated and may contain insults or other explicit content. Authors of the paper manually removed offensive content, but users should keep in mind that some potentially offensive content may remain in the dataset. ## Additional Information ### Licensing Information MIT License ### Citation Information If you found the dataset useful, please cite this paper: @misc{https://doi.org/10.48550/arxiv.2204.12632, doi = {10.48550/ARXIV.2204.12632}, url = {https://arxiv.org/abs/2204.12632}, author = {Liu, Emmy and Cui, Chen and Zheng, Kenneth and Neubig, Graham}, keywords = {Computation and Language (cs.CL), Artificial Intelligence (cs.AI), FOS: Computer and information sciences, FOS: Computer and information sciences}, title = {Testing the Ability of Language Models to Interpret Figurative Language}, publisher = {arXiv}, year = {2022}, copyright = {Creative Commons Attribution Share Alike 4.0 International} }

提供机构：

nightingal3

原始信息汇总

数据集概述

数据集名称

名称: Fig-QA

数据集基本信息

语言: 英语 (en)
许可证: MIT License
多语言性: 单语种
大小: 10K<n<100K
来源: 原始数据
任务类别: 多项选择
任务ID: 多项选择问答 (multiple-choice-qa)

数据集描述

概述: Fig-QA包含10256个人类编写的创意隐喻示例，这些示例作为Winograd模式配对，用于评估模型的常识推理能力。这些隐喻也可用于其他任务，如隐喻检测或生成。
支持的任务和排行榜: 可通过提交模型到Explainaboard的排行榜来评估模型在测试集上的表现。

数据集结构

数据分割: 训练集（不同大小）、开发集、测试集（测试集标签未提供）

使用数据的考虑

偏见讨论: 数据集中的隐喻由人类生成，可能包含侮辱或其他明确内容。虽然论文作者手动移除了攻击性内容，但用户应注意可能仍存在潜在的攻击性内容。

附加信息

许可证信息: MIT License
引用信息: 如果发现数据集有用，请引用相关论文。

5,000+

优质数据集

54 个

任务类型

进入经典数据集