Long document similarity dataset, Wikipedia excerptions for wine collections
收藏Mendeley Data2024-01-31 更新2024-06-27 收录
下载链接:
https://zenodo.org/record/4812960
下载链接
链接失效反馈官方服务:
资源简介:
Wine-related articles extracted from Wikipedia. For all articles, the figures and tables have been filtered out, as well as the categories and "see also" sections. The article structure, and particularly the sub-titles and paragraphs are kept in these datasets Wines Wikipedia wines dataset consists of 1635 articles from the wine domain. The extracted dataset consists of a non-trivial mixture of articles, including different wine categories, brands, wineries, grape types, and more. The ground-truth recommendations were crafted by a human sommelier, which annotated 92 source articles with ~10 ground-truth recommendations for each sample. Examples for ground-truth expert-based recommendations are Dom Pérignon - Moët & Chandon Pinot Meunier - Chardonnay
创建时间:
2024-01-31



