CJJones/Cosmopedia_QA_RAG_JSON_SQLite
收藏Hugging Face2025-11-01 更新2025-11-15 收录
下载链接:
https://hf-mirror.com/datasets/CJJones/Cosmopedia_QA_RAG_JSON_SQLite
下载链接
链接失效反馈官方服务:
资源简介:
此数据集包含了一个专门在Cosmopedia数据集上微调的GPT模型的生成输出。它包含35,000+模型交互,并且数据集大小持续增长。数据集设计用于训练和评估会话AI系统、指令遵循模型和文本生成系统。数据集以英语为语言,遵循CC BY-SA 4.0许可证。数据结构包括输入文本、模型输出、处理时间、成功标志、验证标志、Cosmopedia标识符、来源标题、内容类别、原始数据源、合成数据归属、段落在源中的位置、源中的总段落数、处理时间戳和模型标识符。
This dataset consists of output generations from a specialized GPT model fine-tuned on the Cosmopedia dataset. It includes over 35,000 model interactions and is continuously growing. The dataset is designed for training and evaluating conversational AI systems, instruction-following models, and text generation systems. The dataset is in English and is licensed under CC BY-SA 4.0. The data structure includes input text, model-generated output, processing time, success flag, validation flag, Cosmopedia ID, source title, content category, original data source, synthetic data attribution, paragraph index in the source, total paragraphs in the source, processing timestamp, and model identifier.
提供机构:
CJJones



