DataSynGen/RUwiki
收藏Hugging Face2025-02-02 更新2025-02-15 收录
下载链接:
https://hf-mirror.com/datasets/DataSynGen/RUwiki
下载链接
链接失效反馈官方服务:
资源简介:
RuWiki数据集是从俄语维基百科中提取的1000篇文章组成的数据集。这些文章通过使用ruWiki-web-scraper工具的修改版本来获取,并且每篇文章都被包含在<s_text>和</s_text>标签中,方便进行文本处理。
The RuWiki dataset consists of 1000 articles extracted from the Russian Wikipedia. These articles were obtained using a modified version of the ruWiki-web-scraper tool, and each article is enclosed within <s_text> and </s_text> tags for convenient text processing.
提供机构:
DataSynGen



