five

sleeping-ai/wattpad-complete

收藏
Hugging Face2024-11-29 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/sleeping-ai/wattpad-complete
下载链接
链接失效反馈
官方服务:
资源简介:
Wattpad Complete数据集包含大约20,000个故事的链接,这些链接是从Wattpad平台上提取的,按照网站上的可浏览类别进行组织。数据集的结构是一个tar存档,包含按类别组织的文本文件,每个文本文件包含相应类别的链接。此外,还提供了两个补充数据集,分别对应Hot和New故事类别,每个包含大约1,500个链接,这些链接对应于较小的Wattpad-Small数据集。该仓库是公开可访问的。

The Wattpad Complete dataset contains approximately 20,000 links to stories that were extracted for dataset creation. These links correspond to stories accessible to standard users on the Wattpad platform, organized by the various browsable categories available on the website. The dataset structure is a tar archive containing text files for each category, with each text file including links organized by the respective category. Additionally, two supplementary datasets are included for the Hot and New story categories, each containing approximately 1,500 links, corresponding to the smaller Wattpad-Small dataset. This repository is publicly accessible.
提供机构:
sleeping-ai
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作