Web Data Commons - Product Corpus
收藏webdatacommons.org2025-03-25 收录
下载链接:
http://webdatacommons.org/productcorpus/index.html
下载链接
链接失效反馈官方服务:
资源简介:
Generating a Product Data Catalog out of the Web. The crawler was forced to retrieve data from 32 different PLDs which were chosen based on their containment of marked up annotations as well as their traffic rankings as reported by Alexa
从网络中生成产品数据目录。爬虫被迫从32个不同的产品列表数据源(PLDs)中检索数据,这些数据源的选择基于其包含的标记化注释以及Alexa报告的流量排名。
提供机构:
webdatacommons.org



