交叉学科专项知识库评测数据集
收藏国家数据集管理服务平台2026-04-28 更新2026-04-29 收录
下载链接:
https://www.ndsms.cn/dataRetrieval/datasetDetail/?id=45ded58f73288cbba76a58e979df91ec
下载链接
链接失效反馈官方服务:
资源简介:
本评测集面向从事社会科学、历史文献研究、媒体传播及生物医学等交叉领域研究的科研团队、学术机构及项目研发人员,旨在解决交叉领域语料库评测中缺乏系统性分类体系、关键信息验证无标准、评测维度不全面等行业痛点。每个评测对均经过人工评测检验,确保专业可靠。评测集共包含1,014条优质评测内容,构建了完善的系统性评测分类体系,每条评测内容均包含评测维度、评测问题、标准回答、思考过程、验证目标、引证来源及难度等级七大核心要素,能够对交叉领域语料库的适用性、准确性、信息完整性进行全面且精准的评测。与传统的主观判断或单一维度评测不同,本评测集提供了可复用的标准化框架,将“该语料是否可用于某项交叉研究”这一模糊问题拆解为多维度可量化指标。
This benchmark dataset is targeted at research teams, academic institutions and project R&D personnel engaged in interdisciplinary research such as social sciences, historical document research, media communication and biomedicine. It aims to address industry pain points in interdisciplinary corpus evaluation, including the lack of systematic classification systems, absence of standards for key information verification, and incomplete evaluation dimensions. Each evaluation pair has undergone manual review and verification to ensure professional reliability. The benchmark dataset contains a total of 1,014 high-quality evaluation entries, and has established a complete systematic evaluation classification system. Each evaluation entry includes seven core elements: evaluation dimensions, evaluation questions, standard answers, thinking processes, verification targets, citation sources, and difficulty levels, which can conduct comprehensive and accurate evaluations of the applicability, accuracy, and information completeness of interdisciplinary corpora. Unlike traditional subjective judgments or single-dimensional evaluations, this benchmark dataset provides a reusable standardized framework that breaks down the vague question of "whether this corpus can be used for a certain interdisciplinary study" into multi-dimensional quantifiable indicators.
提供机构:
上海库帕思科技有限公司
创建时间:
2026-04-27
搜集汇总
数据集介绍

背景与挑战
背景概述
该数据集是一个面向社会科学、历史文献、媒体传播及生物医学等交叉学科研究的专项评测集,旨在解决相关语料库评测中缺乏系统性标准和全面维度的问题。它包含1014条经过人工检验的评测内容,每条均涵盖评测维度、问题、标准答案等七大核心要素,构建了可复用的标准化多维度评测框架。该框架能够将语料库的适用性评估转化为可量化的指标,用于全面、精准地评测其准确性、完整性与适用性。
以上内容由遇见数据集搜集并总结生成



