five

Knowledge Base and RAG Evaluation Dataset

收藏
DataCite Commons2025-02-19 更新2025-04-09 收录
下载链接:
https://datahub.tec.mx/citation?persistentId=doi:10.57687/FK2/UEFAPU
下载链接
链接失效反馈
官方服务:
资源简介:
This dataset consists of two tabs: 1. Knowledge Base: This tab contains key terms (hypernyms), their associated skills (hyponyms), definitions and KSA-O classifications. It follows the structure of the dataset "Automotive Industry Skills Taxonomy" . 2. Testing Set: This tab is used to evaluate a RAG (Retrieval-Augmented Generation) model that has access to the knowledge base from the first tab. Structure of the Testing Set (Tab 2): * Text: A hyponym (variation) of a skill. * Label: Categorizes the text into one of three unique values based on its presence in the knowledge base:     -New hypernym: A key term that does not exist in the knowledge base.      -Existing hyponym : A variation of a skill that already exists in the knowledge base.      -New hyponym: A new variation that can be mapped into an existing hypernym. * Original_key: Original hypernym associated with the hyponyms in the "Text" column. The terms in this column correspond to existing pairings in the taxonomy from "Automotive Industry Skills Taxonomy".
提供机构:
Tecnológico de Monterrey
创建时间:
2025-02-17
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作