llm-jp/extraction-wiki-ja

Name: llm-jp/extraction-wiki-ja
Creator: llm-jp
Published: 2025-05-30 04:58:55
License: 暂无描述

Hugging Face2025-05-30 更新2025-05-31 收录

下载链接：

https://hf-mirror.com/datasets/llm-jp/extraction-wiki-ja

下载链接

链接失效反馈

官方服务：

资源简介：

这是一个为信息抽取和结构化任务定制的日语指令微调数据集，由LLM-jp项目开发。数据集包含了从日本维基百科文章自动生成的指令-响应对，这些指令和响应都经过了一个特定模型的筛选。数据集分为三个子集，分别是v0.1、v0.2和v0.3，其中v0.1和v0.2是两轮对话格式（指令+响应），而v0.3是四轮对话格式（指令+响应+指令+响应）。

This is a Japanese instruction-tuning dataset tailored for information extraction and structuring tasks, developed by the LLM-jp project. The dataset consists of automatically generated instruction-response pairs from Japanese Wikipedia articles, which have been filtered by a specific model to ensure quality. The dataset is divided into three subsets: v0.1, v0.2, and v0.3, with v0.1 and v0.2 in a two-turn dialogue format (instruction + response) and v0.3 in a four-turn dialogue format (instruction + response + instruction + response).

提供机构：

llm-jp

5,000+

优质数据集

54 个

任务类型

进入经典数据集