paraloq/json_data_extraction
收藏Hugging Face2024-03-25 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/paraloq/json_data_extraction
下载链接
链接失效反馈官方服务:
资源简介:
Diverse Restricted JSON Data Extraction数据集是一个用于受限JSON数据提取的多样化数据集,涵盖了多个主题领域,如医疗、电子商务、商业、旅行、媒体、技术和制造等。数据集的主要用途包括基准测试受限JSON数据提取、微调数据提取模型和JSON模式检索模型。数据集的结构包括标题、主题、JSON模式、数据实例、媒介和文本。数据是通过Google的Gemini-Pro模型生成的,旨在提供多样化的数据实例。数据集可能包含来自Google的Gemini-Pro模型的偏见和风险。
The Diverse Restricted JSON Data Extraction dataset is a diversified dataset designed for restricted JSON data extraction tasks, covering multiple thematic domains such as healthcare, e-commerce, business, travel, media, technology, and manufacturing. Its primary applications include benchmarking restricted JSON data extraction tasks, fine-tuning data extraction models and JSON schema retrieval models. The dataset consists of components including title, theme, JSON schema, data instances, medium, and text. The data was generated using Google's Gemini-Pro model with the goal of providing diverse data instances. The dataset may contain biases and risks associated with Google's Gemini-Pro model.
提供机构:
paraloq
原始信息汇总
数据集概述
本数据集仅供研究目的使用。



