ZySec-AI/data-extraction

Name: ZySec-AI/data-extraction
Creator: ZySec-AI
Published: 2025-09-12 13:08:11
License: 暂无描述

Hugging Face2025-09-12 更新2025-09-13 收录

下载链接：

https://hf-mirror.com/datasets/ZySec-AI/data-extraction

下载链接

链接失效反馈

官方服务：

资源简介：

该数据集是一个全面的训练语言模型的数据集，用于法律和调查背景下的问答生成、原子关系提取和意图检测。数据集包括三个主要部分：1）问题生成和原子关系数据，包含来自复杂法律文件的训练数据，如案例研究、话题覆盖等；2）意图提取训练数据，专注于提高法律执法和法律背景下的用户意图检测；3）数据格式部分详细描述了JSONL文件的结构。数据集适用于法律AI系统、调查工具、语言模型训练以及学术研究。

This dataset is a comprehensive training resource for language models, designed for question generation, atomic relations extraction, and intent detection in legal and investigative contexts. It includes three main components: 1) Question Generation and Atomic Relations data, with training data from complex legal documents such as case studies and topic coverage; 2) Intent Extractions Training Data, focused on improving user intent detection in legal enforcement and legal contexts; 3) Data Format details the structure of the JSONL files. The dataset is suitable for Legal AI Systems, Investigation Tools, Language Model Training, and Academic Research.

提供机构：

ZySec-AI

5,000+

优质数据集

54 个

任务类型

进入经典数据集