ANALOGICAL

Name: ANALOGICAL
Creator: 人工智能研究所，南卡罗来纳大学，美国
Published: 2023-05-26 04:38:17
License: 暂无描述

arXiv2023-05-26 更新2024-08-06 收录

下载链接：

http://arxiv.org/abs/2305.05050v3

下载链接

链接失效反馈

官方服务：

资源简介：

ANALOGICAL是一个用于评估大型语言模型在长文本类比评估中的新基准。该数据集包含13个子数据集，涵盖从单词到句子的六个不同复杂度级别的类比，包括单词、单词与句子、句法、否定、蕴涵和隐喻。这些数据集用于评估八种大型语言模型在语义向量空间中识别类比对的能力。通过这些数据集，研究者发现随着类比复杂度的增加，大型语言模型识别类比的能力逐渐下降。

ANALOGICAL is a novel benchmark for evaluating large language models in long-text analogical assessment. This dataset comprises 13 sub-datasets, covering analogies across six distinct complexity levels ranging from individual words to complete sentences, including word-level, word-sentence, syntactic, negation, entailment, and metaphorical analogies. These sub-datasets are used to evaluate the capability of eight large language models to identify analogy pairs within semantic vector spaces. Through this benchmark, researchers have found that the ability of large language models to recognize analogies gradually declines as the complexity of the analogies increases.

提供机构：

人工智能研究所，南卡罗来纳大学，美国

创建时间：

2023-05-09

搜集汇总

数据集介绍