GPTeacher-General-Instruct
收藏魔搭社区2026-01-07 更新2025-12-06 收录
下载链接:
https://modelscope.cn/datasets/teknium/GPTeacher-General-Instruct
下载链接
链接失效反馈官方服务:
资源简介:
GPTeacher General-Instruct dataset is GPT-4 Generated self-instruct dataset.
There are multiple versions, with more or less similarity reductions.
The dedupe only dataset contains 18194 entries, with less the more similarity is reduced.
Format is identical to alpaca's, with a varyiable mix of Instruction/Input/Response, and Instruction/NullInput/Response fields.
Learn more on github here:
https://github.com/teknium1/GPTeacher
GPTeacher通用指令数据集(GPTeacher General-Instruct dataset)是由GPT-4生成的自指令数据集。该数据集存在多个版本,各版本的相似度缩减处理程度各不相同。仅去重数据集包含18194条数据,且相似度缩减强度越高,数据量越少。其格式与Alpaca数据集完全一致,包含两种可变字段组合:Instruction/Input/Response与Instruction/NullInput/Response。更多详情可访问以下GitHub链接:https://github.com/teknium1/GPTeacher
提供机构:
maas
创建时间:
2025-11-18



