five

Long document similarity dataset, Wikipedia excerptions for movies collections

收藏
NIAID Data Ecosystem2026-03-14 收录
下载链接:
https://zenodo.org/record/7019172
下载链接
链接失效反馈
官方服务:
资源简介:
Movies-related articles extracted from Wikipedia. For all articles, the figures and tables have been filtered out, as well as the categories and "see also" sections. The article structure, and particularly the sub-titles and paragraphs are kept in these datasets   Movies The Wikipedia Movies dataset consists of 100,371 articles describing various movies. Each article may consist of text passages describing the plot, cast, production, reception, soundtrack, and more.
创建时间:
2023-01-20
二维码
社区交流群
二维码
科研交流群
商业服务