jhonparra18/distill-gpt4-eng-chat_1024tk
收藏Hugging Face2025-03-30 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/jhonparra18/distill-gpt4-eng-chat_1024tk
下载链接
链接失效反馈官方服务:
资源简介:
这是一个用于微调GPT-2大型模型的文本生成数据集,包含了请求(request)和响应(response)文本对,以及一个表示问题-答案长度的布尔字段(len_q_a)。数据集由训练集(train)组成,总大小约为962MB,包含411,490个示例。文本超过1024个token的记录已被过滤。
This is a text generation dataset for fine-tuning GPT-2 large model, which includes request (request) and response (response) text pairs, and a boolean field (len_q_a) indicating the length of question-answer. The dataset consists of a training set (train), with a total size of approximately 962MB, containing 411,490 examples. Records with more than 1024 tokens of text have been filtered out.
提供机构:
jhonparra18



