nyuuzyou/womanru-posts
收藏Hugging Face2024-08-21 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/nyuuzyou/womanru-posts
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含1,308,238个来自Woman.ru论坛的帖子,Woman.ru是一个流行的俄语信息和娱乐门户网站,也是Runet(俄罗斯互联网)中最受欢迎的女性网站之一。数据集涵盖了从2005年到2024年的帖子,提供了近二十年来平台上讨论的全面视图。内容包括原始帖子和各种主题的回复,提供了该网站主要女性用户群体的兴趣、关注点和互动的见解。数据集主要用于文本生成任务,特别是语言建模。数据字段包括URL、标题、原始帖子内容、日期以及回复列表。数据集以俄语为主,且所有数据都在一个单一的分割中。数据集采用CC0许可证,允许自由使用、修改和分发。
This dataset contains 1,308,238 forum posts from Woman.ru, a popular Russian-language information and entertainment portal. Woman.ru is one of the most visited womens sites in Runet (Russian Internet). The dataset covers posts from around 2005 to 2024, providing a comprehensive view of discussions on the platform over nearly two decades. The content includes original posts and replies on various topics, offering insights into the interests, concerns, and interactions of the sites predominantly female user base. The dataset is primarily used for text-generation tasks, particularly language modeling. Data fields include URL, title, original post content, date, and a list of replies. The dataset is primarily in Russian and all examples are in a single split. The dataset is licensed under CC0, allowing free use, modification, and distribution.
提供机构:
nyuuzyou



