five

11 Benchmark Clean-Clean ER datasets in CSV format

收藏
NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://zenodo.org/record/13946188
下载链接
链接失效反馈
官方服务:
资源简介:
Contains: D1: Contains restaurant descriptions, first introduced in OAEI 2010. D2: Includes duplicate products from Abt.com and Buy.com. D3: Matches product descriptions from Amazon and Google Base. D4: Compares bibliographic data from DBLP and ACM. D5, D6, D7: Contain descriptions of television shows and movies from TheTVDB, IMDb, and TMDb. D8: Matches product descriptions from Walmart and Amazon. D9: Involves bibliographic data from DBLP and Google Scholar. D10: Links movie descriptions from IMDb and DBpedia. D11: A large-scale dataset with millions of heterogeneous entities from two DBpedia versions spanning a 3-year gap.
创建时间:
2025-03-19
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作