ranjitraut/nepal-section-wise-act-datasets
收藏Hugging Face2026-03-17 更新2026-03-29 收录
下载链接:
https://hf-mirror.com/datasets/ranjitraut/nepal-section-wise-act-datasets
下载链接
链接失效反馈官方服务:
资源简介:
---
license: fair-noncommercial-research-license
task_categories:
- question-answering
- text-generation
- text-to-speech
language:
- ne
tags:
- legal
pretty_name: Nepal Section-wise Act Datasets
size_categories:
- 100K<n<1M
---
# Nepal Section-wise Act Datasets
## Dataset Description
This dataset contains section-wise legal acts and laws of Nepal, organized for easy access and analysis. It is designed to support legal research, natural language processing (NLP) tasks, and the development of legal tech applications in Nepal.
### Features
- **Section-wise organization**: Each act is broken down by its sections, making it easy to reference specific legal provisions.
- **Comprehensive coverage**: Includes major acts and laws relevant to Nepal.
### Use Cases
- Legal research and analysis
- NLP model training (e.g., legal chatbots, document classification)
- Educational purposes for law students and practitioners
- Development of legal tech tools
## Dataset Structure
### Data Fields
| Field | Type | Description |
|---------------|-----------|----------------------------------------------|
| act_name | string | Name of the act/law |
| section | string | Section number and title |
| content | string | Full text of the section |
### Example
```json
{
"act_name": "रेडियोधर्मी पदार्थ (उपयोग तथा नियमन) ऐन, २०७७",
"section": " परिच्छेद-१",
"content": "रेडियोधर्मी पदार्थ एवं प्रविधिको सुरक्षित तथा शान्तिपूर्ण उपयोग गर्न, अध्ययन अनुसन्धान गर्न र आयनीकरण गर्ने बिकिरणवाट पर्न सक्ने प्रतिकूल प्रभाववाट सर्वसाधारणको जीउधनको सुरक्षा तथा वातावरण संरक्षण गर्ने सम्बन्धमा आवश्यक कानूनी व्यवस्था गर्न बाञ्छुनीय भएकोले, सदृधीय संसदले यो ऐन बनाएको छ।",
}
```
## Usage
Load with Hugging Face Datasets library:
```python
from datasets import load_dataset
# Full dataset
dataset = load_dataset("ranjitraut/nepal-section-wise-act-datasets", split="train")
# Streaming for large-scale use
streamed = load_dataset("ranjitraut/nepal-section-wise-act-datasets", split="train", streaming=True)
for example in streamed:
print(example["act_name"], example["content"][:100])
```
## Creation Methodology
- Sourced from official Nepali government legal repositories (e.g., lawcommission.gov.np).
- Processed via text extraction, section-wise splitting, and cleaning.
- Language: Primarily Nepali (Devanagari script); some bilingual entries.
- No PII or sensitive data included; anonymized where needed.
## Limitations
## Dataset Creation
### Curation Rationale
The dataset was created to address the lack of structured, accessible legal data in Nepal. By providing section-wise legal texts, we aim to facilitate research, education, and innovation in the legal domain.
### Source Data
- **Initial Data Collection**: Collected from official government publications and legal portals.
- **Processing**: Cleaned, structured, and validated by legal experts and data scientists.
## Considerations
### Social Impact
- Promotes transparency and accessibility of legal information in Nepal.
- Supports the development of AI tools for legal aid and education.
### Limitations
- Covers only a subset of Nepalese laws; not exhaustive.
- Legal texts may be subject to amendments; always verify with official sources.
- Potential OCR errors in scanned documents.
- Not exhaustive; focuses on major acts.
### Recommendations
- Use this dataset as a starting point for legal research or NLP projects.
- Cross-reference with official legal documents for critical applications.
## Citation
If you use this dataset, please cite it as:
```
@dataset{ranjitraut_nepal_section_wise_act_datasets,
author={Ranjit Raut},
title={Nepal Section-wise Act Datasets},
year={2026},
url={https://huggingface.co/datasets/ranjitraut/nepal-section-wise-act-datasets}
}
```
提供机构:
ranjitraut



