trais-lab/DCA-Bench
收藏Hugging Face2025-05-31 更新2025-11-01 收录
下载链接:
https://hf-mirror.com/datasets/trais-lab/DCA-Bench
下载链接
链接失效反馈官方服务:
资源简介:
DCA-Benchmark旨在提供一个全面的基准,用于评估LLM代理在在线数据集平台上发现数据质量问题方面的能力,这是数据管理流程的第一步。我们收集了来自8个在线数据集平台的221个代表性样本,并根据其内容和难度将其分为4种类型,共18个标签。
DCA-Benchmark aims to provide a comprehensive benchmark for evaluating LLM agents capabilities in discovering data quality issues across online dataset platforms, representing the first step of the curation pipeline. We collected 221 representative samples from 8 online dataset platforms and classified them into 4 types with 18 tags according to their various content and difficulty.
提供机构:
trais-lab



