natong19/lmsys-chat-1m-filtered
收藏Hugging Face2025-12-15 更新2025-12-20 收录
下载链接:
https://hf-mirror.com/datasets/natong19/lmsys-chat-1m-filtered
下载链接
链接失效反馈官方服务:
资源简介:
这是一个过滤版本的lmsys-chat-1m数据集,包含与各种大型语言模型(LLMs)的真实对话。原始数据集有100万条对话,经过多步过滤处理(包括移除敏感信息、格式验证、多种去重方法等)后,最终保留约25.9万条高质量的对话样本。
Filtered version of lmsys-chat-1m dataset, containing real-world conversations with various LLMs. The original 1M samples were processed through multiple filtering steps (including PII removal, format validation, various deduplication methods, etc.), resulting in ~259k high-quality conversation samples.
提供机构:
natong19



