eduge
收藏Opencsg2024-07-19 更新2025-05-03 收录
下载链接:
https://www.opencsg.com/datasets/AIWizards/eduge
下载链接
链接失效反馈官方服务:
资源简介:
Eduge是一个新闻分类数据集,主要用于训练新闻分类器。它包含7.5万篇蒙古语新闻文章,分为9个类别:艺术文化、经济、健康、法律、政治、体育、科技、教育和环境。每条数据包括新闻文本和对应的类别标签,数据集被划分为训练集和测试集,可用于多类别文本分类任务。该数据集来源于Eduge.mn,整合了多个新闻网站的内容。
Eduge is a news classification dataset designed primarily for training news classification models. It contains 75,000 Mongolian news articles, categorized into 9 classes: art and culture, economy, health, law, politics, sports, technology, education, and environment. Each sample in the dataset includes the news text and its corresponding category label. The dataset is split into a training set and a test set, and can be applied to multi-class text classification tasks. This dataset is sourced from Eduge.mn, integrating content from multiple news websites.
创建时间:
2024-07-19



