ECIR2020-dataset-search
收藏arXiv2020-01-28 更新2024-06-21 收录
下载链接:
https://github.com/Zhiyu-Chen/ECIR2020-dataset-search
下载链接
链接失效反馈官方服务:
资源简介:
数据集ECIR2020-dataset-search由理海大学创建,包含2417个由美国联邦政府发布的资源,涵盖多种主题。每个资源包括一个或多个CSV格式的数据表及其元数据。数据集的创建旨在通过生成可能的架构标签来增强数据集检索任务,解决现有数据集搜索引擎依赖于数据集描述匹配查询的问题。该数据集的应用领域包括数据共享和重用,特别是在需要高效数据管理和检索的场景中。
The ECIR2020-dataset-search dataset was created by Lehigh University, containing 2,417 resources released by the U.S. federal government and covering a wide range of topics. Each resource includes one or more CSV-formatted data tables along with their associated metadata. This dataset was developed to enhance dataset retrieval tasks by generating potential schema tags, addressing the limitation that existing dataset search engines solely rely on matching between dataset descriptions and user queries. This dataset finds applications in data sharing and reuse, especially in scenarios requiring efficient data management and retrieval.
提供机构:
理海大学
创建时间:
2020-01-28



