Kazakhstani and Russian news corpus
收藏Mendeley Data2024-01-31 更新2024-06-26 收录
下载链接:
https://data.mendeley.com/datasets/2vz7vtbhn2
下载链接
链接失效反馈官方服务:
资源简介:
The presented news corpora consists of 6261953 documents from open Kazakhstani and Russian news media. The dataset is presented in a form of zip archive containing 26 CSV (comma-separated values) files with the dataset split into 250 000 documents in each file. Each document (row) consists of the following fields: ID Title Text Source URL Datetime Number of views
创建时间:
2024-01-31



