marcuscedricridia/Open-Critic-GPT-deepclean-sharegpt
收藏Hugging Face2025-04-04 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/marcuscedricridia/Open-Critic-GPT-deepclean-sharegpt
下载链接
链接失效反馈官方服务:
资源简介:
Vezora/Open-Critic-GPT数据集是一个用于自然语言处理任务的数据集,具体应用于训练或评估GPT模型。数据集包含两列输入,分别是人类输入和助手输入,输出格式为sharegpt,即包含对话信息的格式。经过一系列的数据清洗步骤,包括去除重复数据、长度过滤、语言过滤等,最终数据集大小为1604条记录。
The Vezora/Open-Critic-GPT dataset is a natural language processing dataset designed for training or evaluating GPT models. The dataset includes two input columns, one for human input and the other for assistant input, with an output format of sharegpt, which contains conversation information. After a series of data cleaning steps, including deduplication, length filtering, language filtering, etc., the final dataset size is reduced to 1604 records.
提供机构:
marcuscedricridia



