GOBY Benchmark
收藏arXiv2025-09-30 收录
下载链接:
https://goby-benchmark.github.io/
下载链接
链接失效反馈官方服务:
资源简介:
该数据集是一个旨在评估大型语言模型(LLM)在企业数据整合背景下的性能的基准数据集。它突显了以往模型和基准测试的局限性。此外,该数据集用于将LLM与公开基准测试进行比较,揭示性能差距,并为提升LLM在企业数据任务中的应用提供洞见。该数据集的任务是对语义列类型进行注释。
This dataset is a benchmark designed to evaluate the performance of Large Language Models (LLMs) in the context of enterprise data integration. It highlights the limitations of existing models and benchmark datasets. Furthermore, this dataset is utilized to compare LLMs against public benchmarks, uncover performance gaps, and provide insights for enhancing the application of LLMs in enterprise data tasks. The task of this dataset is semantic column type annotation.
提供机构:
GOBY Benchmark Team



