Nitral-AI/RCSI-v2_ShareGPT
收藏Hugging Face2025-02-01 更新2025-02-15 收录
下载链接:
https://hf-mirror.com/datasets/Nitral-AI/RCSI-v2_ShareGPT
下载链接
链接失效反馈官方服务:
资源简介:
Reddit评论风格指令数据集v2,来源于HuggingFace的nbeerbower/reddit-dpo,包含英语评论。数据集经过清洗,去除了错误、拒绝的内容、过度常见的n-gram和拒绝等,通过字符串匹配和最小哈希进行了去重,通过分类器模型和字符串匹配去除了无用的部分,并进行了拼写和语法修正。
Reddit Comment Style Instruct-v2 dataset sourced from HuggingFaces nbeerbower/reddit-dpo, containing English comments. The dataset has been cleaned, removing errors, rejections, over-prevalent n-grams, and refusals, deduplicated through string match and min-hash, deslopped through classifier model and string match, and spelling and grammar have been corrected.
提供机构:
Nitral-AI



