IDTheftCase-JudgmentCorpus: Indonesian Theft Case Judgment Corpus - Levels of Court
收藏DataCite Commons2025-04-09 更新2025-04-16 收录
下载链接:
https://data.mendeley.com/datasets/48x9xm7rkf/1
下载链接
链接失效反馈官方服务:
资源简介:
IDTheftCase-JudgmentCorpus: Indonesian Theft Case Judgment Corpus – Levels of Court is a dataset containing the full-text documents of written judgments handed down by Indonesian courts in criminal theft cases at three levels: the court of first instance, the appellate court, and the cassation court. The dataset was created to support research and development activities in information extraction and natural language processing, specifically about the processing and understanding the legal texts and court documents.
The dataset includes the full text of judgments with information about defendants, judges, types of punishment, hearing dates, and other relevant data for analysis. The dataset is organized into several files:
1. Manually Annotated Judgment Documents
• 1-pertama.json: Annotated judgments from the court of first instance.
• 2-banding.json: Annotated judgments from the appellate court.
• 3-kasasi.json: Annotated judgments from the cassation court.
2. Non-Annotated Judgment Documents
• 1-pertama-not-annotated.json: Non-annotated judgments from the court of first instance.
3. Metadata File metadata.csv : Contains contextual and hierarchical information about the judgment documents, structured into the following columns:
• Id: Unique case identifier.
• Id Putusan: Original ID of the document, distinguishing records from the Supreme Court’s website.
• Tingkat Proses: Indicates the court level (First Instance, Appellate, Cassation).
• Jenis Lembaga Pengadilan: Type of judicial institution handling the case (e.g., PN, PT, MA).
• Lembaga Pengadilan: Name of the judicial institution.
• Tahun: The year the judgment was issued.
• Inckrah: Indicates the case’s final resolution level (First Instance, Appellate, or Cassation).
• Amar: Type of decision issued (e.g., acquittal, conviction).
• Id Pertama: Related first-instance document ID.
• Id Banding: Related appellate document ID.
• Id Kasasi: Related cassation document ID.
All documents in this dataset were obtained from public records on the official website of the Supreme Court of the Republic of Indonesia (https://putusan3.mahkamahagung.go.id/). As such, the dataset represents real-world cases and reflects the legal form of Indonesian court documents.
IDTheftCase-JudgmentCorpus is an essential dataset for research in named entity recognition and extraction, punishment imposition pattern analysis, and automatic document classification in the Indonesian legal context. Moreover, the dataset is useful for developers and researchers who aim to build and implement machine learning-based models to extract, group, and analyze judgment documents at different court levels.
提供机构:
Mendeley Data
创建时间:
2024-12-09



