five

spandyie/nepali-news-dataset

收藏
Hugging Face2026-02-23 更新2026-03-29 收录
下载链接:
https://hf-mirror.com/datasets/spandyie/nepali-news-dataset
下载链接
链接失效反馈
官方服务:
资源简介:
--- language: - ne license: other task_categories: - text-classification tags: - news - nepali - news-classification pretty_name: Nepali News Dataset size_categories: - 1M<n<10M --- # Nepali News Dataset A large collection of Nepali news articles scraped from [Baahrakhari](https://baahrakhari.com) and other Nepali news sources, with cleaned category labels. ## Dataset Summary - **Language:** Nepali (ne) - **Total articles:** ~2.76 million - **Format:** Parquet (snappy compressed) ## Features | Column | Type | Description | |--------|------|-------------| | `category` | string | Original Nepali category label | | `heading` | string | Article headline | | `text` | string | Full article body | | `date` | string | Publication date (Nepali calendar format) | | `link` | string | Source URL | | `clean_categories` | string | English category label | ## Category Distribution | Category | Count | |----------|-------| | news | 702,150 | | misc | 402,847 | | politics | 250,668 | | national | 225,204 | | society | 214,447 | | economy | 213,954 | | sports | 179,669 | | global | 125,672 | | health | 64,707 | | entertainment | 62,847 | | ... | ... |
提供机构:
spandyie
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作