opendatalab/WanJuan-Russian
收藏Hugging Face2025-04-22 更新2025-04-08 收录
下载链接:
https://hf-mirror.com/datasets/opendatalab/WanJuan-Russian
下载链接
链接失效反馈官方服务:
资源简介:
万卷丝路-俄语数据集是一个超过280GB的大型语料库,包含7个主要类别和34个子类别,内容覆盖历史、政治、文化、房地产、购物、天气、餐饮、百科全书和专业知识等多个领域。该数据集适用于文本生成任务,并支持俄语。
The WanJuan-Russian corpus is a large-scale dataset exceeding 280GB, comprising 7 main categories and 34 subcategories, covering a wide range of topics such as history, politics, culture, real estate, shopping, weather, dining, encyclopedias, and professional knowledge. It is designed for text generation tasks and supports the Russian language.
提供机构:
opendatalab



