Kazakhstani and Russian news corpus
收藏NIAID Data Ecosystem2026-03-12 收录
下载链接:
https://data.mendeley.com/datasets/2vz7vtbhn2
下载链接
链接失效反馈官方服务:
资源简介:
The presented news corpora consists of 6261953 documents from open Kazakhstani and Russian news media. The dataset is presented in a form of zip archive containing 26 CSV (comma-separated values) files with the dataset split into 250 000 documents in each file.
Each document (row) consists of the following fields:
ID
Title
Text
Source
URL
Datetime
Number of views
创建时间:
2020-12-18



