Abt-Buy
收藏OpenDataLab2026-05-17 更新2024-05-09 收录
下载链接:
https://opendatalab.org.cn/OpenDataLab/Abt-Buy
下载链接
链接失效反馈官方服务:
资源简介:
用于实体解析的 Abt-Buy 数据集来自在线零售商 Abt.com 和 Buy.com。该数据集包含来自 abt.com 的 1081 个实体和来自 buy.com 的 1092 个实体,以及两个数据源之间具有 1097 个匹配记录对的黄金标准(完美映射)。两个数据源之间的共同属性是:产品名称、产品描述和产品价格。该数据集最初发布在莱比锡大学数据库组的存储库中:https://dbs.uni-leipzig.de/research/projects/object_matching/benchmark_datasets_for_entity_resolution 以实现结果的可重复性和性能的可比性Abt-Buy 匹配任务中的不同匹配器,数据集被分成固定的训练集、验证集和测试集。 CompERBench 存储库中提供了固定拆分:http://data.dws.informatik.uni-mannheim.de/benchmarkmatchingtasks/index.html
The Abt-Buy dataset for entity resolution is sourced from the online retailers Abt.com and Buy.com. It contains 1081 entities from abt.com and 1092 entities from buy.com, along with a gold standard (perfect mapping) of 1097 matched record pairs between the two data sources. The common attributes shared by the two sources are product name, product description, and product price. This dataset was originally published in the repository of the Database Group at the University of Leipzig (https://dbs.uni-leipzig.de/research/projects/object_matching/benchmark_datasets_for_entity_resolution) to enable result reproducibility and performance comparability across different matchers for the Abt-Buy matching task. The dataset is split into fixed training, validation, and test sets, and the fixed splits are provided in the CompERBench repository: http://data.dws.informatik.uni-mannheim.de/benchmarkmatchingtasks/index.html
提供机构:
OpenDataLab
创建时间:
2022-05-23
搜集汇总
数据集介绍

背景与挑战
背景概述
Abt-Buy是一个用于实体解析的基准数据集,包含来自Abt.com和Buy.com的产品数据及匹配记录对,由莱比锡大学发布用于评估不同匹配器的性能。
以上内容由遇见数据集搜集并总结生成



