spandyie/nepali-news-dataset
收藏Hugging Face2026-02-23 更新2026-03-29 收录
下载链接:
https://hf-mirror.com/datasets/spandyie/nepali-news-dataset
下载链接
链接失效反馈官方服务:
资源简介:
---
language:
- ne
license: other
task_categories:
- text-classification
tags:
- news
- nepali
- news-classification
pretty_name: Nepali News Dataset
size_categories:
- 1M<n<10M
---
# Nepali News Dataset
A large collection of Nepali news articles scraped from [Baahrakhari](https://baahrakhari.com) and other Nepali news sources, with cleaned category labels.
## Dataset Summary
- **Language:** Nepali (ne)
- **Total articles:** ~2.76 million
- **Format:** Parquet (snappy compressed)
## Features
| Column | Type | Description |
|--------|------|-------------|
| `category` | string | Original Nepali category label |
| `heading` | string | Article headline |
| `text` | string | Full article body |
| `date` | string | Publication date (Nepali calendar format) |
| `link` | string | Source URL |
| `clean_categories` | string | English category label |
## Category Distribution
| Category | Count |
|----------|-------|
| news | 702,150 |
| misc | 402,847 |
| politics | 250,668 |
| national | 225,204 |
| society | 214,447 |
| economy | 213,954 |
| sports | 179,669 |
| global | 125,672 |
| health | 64,707 |
| entertainment | 62,847 |
| ... | ... |
提供机构:
spandyie



