nyuuzyou/cdnpdf-presentations-part1
收藏Hugging Face2024-11-01 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/nyuuzyou/cdnpdf-presentations-part1
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含来自cdnpdf.com平台的101,022个教育演示文稿的元数据和原始文件,这些文件主要是PPTX格式。数据集支持多语言,主要包括俄语,也有英语、哈萨克语、乌克兰语和白俄罗斯语的内容。数据集的元数据以JSON Lines格式存储,包含演示文稿的标题、描述、URL、下载URL和文件路径等信息。数据集遵循CC0许可证,允许无限制的使用、修改和分发。
The cdnpdf Educational Materials Dataset (Part 1) contains metadata and original files for 101,022 educational presentations from the cdnpdf.com platform. These presentations are primarily in Russian, with some in English, Kazakh, Ukrainian, and Belarusian. The dataset includes information such as presentation titles, descriptions, URLs, download URLs, and file paths. All PPT files have been converted to PPTX format for better compatibility and reduced file size. The dataset is structured with metadata stored in JSON Lines format and original files as PPTX presentations. It is released under the Creative Commons Zero (CC0) license, allowing for any use, modification, or distribution without attribution.
提供机构:
nyuuzyou



