five

huytd189/japanese-grammar-correction

收藏
Hugging Face2025-10-11 更新2025-10-25 收录
下载链接:
https://hf-mirror.com/datasets/huytd189/japanese-grammar-correction
下载链接
链接失效反馈
官方服务:
资源简介:
日本语语法校正数据集,设计用于训练语言模型以识别和纠正日语文本中的广泛语法错误和风格问题。数据由错误和正确句子的对,以及分类错误类型和提供额外上下文的元数据组成。该数据集通过人工从日语学习社区的讨论中精选和利用LLM合成的数据创建而成。数据集以CSV格式存储,包含错误文本、正确文本、错误类型、目标单词、注释和难度等级等列。

Japanese Grammar Correction Dataset, designed to train language models to identify and correct a wide range of grammatical errors and stylistic issues in Japanese text. The data consists of pairs of incorrect and correct sentences, along with metadata that classifies the type of error and provides additional context. The dataset was created by both manual curation from discussions in Japanese learning communities and synthetically generated using an LLM. The dataset is in CSV format, containing columns for incorrect_text, correct_text, error_type, target_word, comment, and difficulty.
提供机构:
huytd189
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作