seangogo/processed_tldr_sft_dataset_20251029_044328
收藏Hugging Face2025-10-29 更新2025-11-15 收录
下载链接:
https://hf-mirror.com/datasets/seangogo/processed_tldr_sft_dataset_20251029_044328
下载链接
链接失效反馈官方服务:
资源简介:
该数据集是基于OpenAI的Summarize from Feedback任务构建的,包含了帖子的唯一标识符、子版块、标题、帖子内容、摘要等字段。数据集通过预处理生成了用于摘要的查询和响应字段,以及它们的标记化版本和长度信息。这些数据用于训练模型以从反馈中生成摘要。
The dataset is built for OpenAIs Summarize from Feedback task, containing fields such as unique post identifier, subreddit, title, post body, summary, etc. The dataset is preprocessed to generate query and response fields for summarization, as well as their tokenized versions and length information. This data is used to train models to generate summaries from feedback.
提供机构:
seangogo



