five

ranjitraut/nepal-section-wise-act-datasets

收藏
Hugging Face2026-03-17 更新2026-03-29 收录
下载链接:
https://hf-mirror.com/datasets/ranjitraut/nepal-section-wise-act-datasets
下载链接
链接失效反馈
官方服务:
资源简介:
--- license: fair-noncommercial-research-license task_categories: - question-answering - text-generation - text-to-speech language: - ne tags: - legal pretty_name: Nepal Section-wise Act Datasets size_categories: - 100K<n<1M --- # Nepal Section-wise Act Datasets ## Dataset Description This dataset contains section-wise legal acts and laws of Nepal, organized for easy access and analysis. It is designed to support legal research, natural language processing (NLP) tasks, and the development of legal tech applications in Nepal. ### Features - **Section-wise organization**: Each act is broken down by its sections, making it easy to reference specific legal provisions. - **Comprehensive coverage**: Includes major acts and laws relevant to Nepal. ### Use Cases - Legal research and analysis - NLP model training (e.g., legal chatbots, document classification) - Educational purposes for law students and practitioners - Development of legal tech tools ## Dataset Structure ### Data Fields | Field | Type | Description | |---------------|-----------|----------------------------------------------| | act_name | string | Name of the act/law | | section | string | Section number and title | | content | string | Full text of the section | ### Example ```json { "act_name": "रेडियोधर्मी पदार्थ (उपयोग तथा नियमन) ऐन, २०७७", "section": " परिच्छेद-१", "content": "रेडियोधर्मी पदार्थ एवं प्रविधिको सुरक्षित तथा शान्तिपूर्ण उपयोग गर्न, अध्ययन अनुसन्धान गर्न र आयनीकरण गर्ने बिकिरणवाट पर्न सक्ने प्रतिकूल प्रभाववाट सर्वसाधारणको जीउधनको सुरक्षा तथा वातावरण संरक्षण गर्ने सम्बन्धमा आवश्यक कानूनी व्यवस्था गर्न बाञ्छुनीय भएकोले, सदृधीय संसदले यो ऐन बनाएको छ।", } ``` ## Usage Load with Hugging Face Datasets library: ```python from datasets import load_dataset # Full dataset dataset = load_dataset("ranjitraut/nepal-section-wise-act-datasets", split="train") # Streaming for large-scale use streamed = load_dataset("ranjitraut/nepal-section-wise-act-datasets", split="train", streaming=True) for example in streamed: print(example["act_name"], example["content"][:100]) ``` ## Creation Methodology - Sourced from official Nepali government legal repositories (e.g., lawcommission.gov.np). - Processed via text extraction, section-wise splitting, and cleaning. - Language: Primarily Nepali (Devanagari script); some bilingual entries. - No PII or sensitive data included; anonymized where needed. ## Limitations ## Dataset Creation ### Curation Rationale The dataset was created to address the lack of structured, accessible legal data in Nepal. By providing section-wise legal texts, we aim to facilitate research, education, and innovation in the legal domain. ### Source Data - **Initial Data Collection**: Collected from official government publications and legal portals. - **Processing**: Cleaned, structured, and validated by legal experts and data scientists. ## Considerations ### Social Impact - Promotes transparency and accessibility of legal information in Nepal. - Supports the development of AI tools for legal aid and education. ### Limitations - Covers only a subset of Nepalese laws; not exhaustive. - Legal texts may be subject to amendments; always verify with official sources. - Potential OCR errors in scanned documents. - Not exhaustive; focuses on major acts. ### Recommendations - Use this dataset as a starting point for legal research or NLP projects. - Cross-reference with official legal documents for critical applications. ## Citation If you use this dataset, please cite it as: ``` @dataset{ranjitraut_nepal_section_wise_act_datasets, author={Ranjit Raut}, title={Nepal Section-wise Act Datasets}, year={2026}, url={https://huggingface.co/datasets/ranjitraut/nepal-section-wise-act-datasets} } ```
提供机构:
ranjitraut
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作