five

dheerajpabolu/Indian_Laws_Structured_Legal_Dataset

收藏
Hugging Face2026-04-04 更新2026-04-12 收录
下载链接:
https://hf-mirror.com/datasets/dheerajpabolu/Indian_Laws_Structured_Legal_Dataset
下载链接
链接失效反馈
官方服务:
资源简介:
--- license: mit --- # 📚 Indian Legal Acts Dataset (Structured Sections) ## 🧾 Overview This dataset provides structured, machine-readable legal text from major Indian statutes, including: - Bharatiya Nyaya Sanhita, 2023 (BNS) - Code of Criminal Procedure, 1973 (CrPC) - Code of Civil Procedure, 1908 (CPC) - Indian Evidence Act, 1872 (IEA) - Negotiable Instruments Act, 1881 (NIA) - Motor Vehicles Act, 1988 (MVA) - Indian Divorce Act, 1869 (IDA) Each entry represents a **section or chunk of a section**, enriched with metadata for efficient retrieval and NLP applications. --- ## 🎯 Intended Use This dataset is designed for: - Legal question answering systems - Semantic search and document retrieval - Retrieval-Augmented Generation (RAG) pipelines - Legal text classification - Training domain-specific language models --- ## 🏗️ Data Structure Each record contains: - Act-level metadata (name, code, effective date) - Structural hierarchy (chapter, section) - Legal content (full section text) - Search-optimized text (`search_text`) - Chunking support for long sections --- ## 🧠 Key Features - ✅ Structured legal hierarchy - ✅ Chunked for LLM compatibility - ✅ Optimized for vector search - ✅ Multi-domain legal coverage - ✅ High-quality statutory text --- ## 🚀 Example Use Cases - Build legal chatbots - Create semantic legal search engines - Implement RAG pipelines with LLMs - Perform legal document classification --- ## ⚠️ Limitations - Does not include judicial interpretations or case law - Legal language may require domain expertise - Some sections are split into multiple chunks --- ## 📜 License This dataset contains publicly available legal texts. Users should ensure compliance with applicable laws and data usage policies in their jurisdiction. --- ## 🤝 Contributions Contributions are welcome: - Add more acts - Improve annotations - Include case law or summaries ---
提供机构:
dheerajpabolu
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作