Long document similarity dataset, Wikipedia excerptions for movies collections
收藏NIAID Data Ecosystem2026-03-14 收录
下载链接:
https://zenodo.org/record/7019172
下载链接
链接失效反馈官方服务:
资源简介:
Movies-related articles extracted from Wikipedia.
For all articles, the figures and tables have been filtered out, as well as the categories and "see also" sections.
The article structure, and particularly the sub-titles and paragraphs are kept in these datasets
Movies
The Wikipedia Movies dataset consists of 100,371 articles describing various movies. Each article may consist of text passages describing the plot, cast, production, reception, soundtrack, and more.
创建时间:
2023-01-20



