Aipresso/MEGA-cleaned-prompts
收藏Hugging Face2025-10-16 更新2025-10-25 收录
下载链接:
https://hf-mirror.com/datasets/Aipresso/MEGA-cleaned-prompts
下载链接
链接失效反馈官方服务:
资源简介:
清洗提示巨数据集是一个包含2.7百万个经过清洗的英语提示的全面集合,这些提示经过精心处理,用于训练高级语言模型和AI系统。数据集已经去重、过滤为仅包含英语内容、移除了不必要的引号和格式化元素,并且为每个提示计算了token数量,移除了空白和低质量条目。
Cleaned Prompts Mega Dataset is a comprehensive collection of 2.7 million cleaned English prompts, meticulously processed for training advanced language models and AI systems. The dataset has been deduplicated, filtered for English-only content, stripped of unnecessary quotes and formatting artifacts, token counts calculated for each prompt, and removed blank and low-quality entries.
提供机构:
Aipresso



