bigstupidhats/wildchat_conversations
收藏Hugging Face2024-11-23 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/bigstupidhats/wildchat_conversations
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含多语言对话数据,涵盖了阿拉伯语、德语、英语、西班牙语、印地语、俄语、斯瓦希里语和中文。数据集的字段包括对话ID、模型、时间戳、对话内容、语言、是否经过编辑、角色、是否含有毒性内容等。此外,还包含了OpenAI和Detoxify的审核结果,以及对话的指令和输出。数据集提供了每种语言的字节大小和示例数量,适用于多语言对话分析和毒性内容检测等任务。
This dataset contains multilingual conversation data, recording detailed information about the conversations, including conversation ID, model, timestamp, conversation content, language, whether it has been redacted, role, and whether it contains toxicity. The dataset also includes moderation results for the conversations, such as OpenAI and Detoxify moderation categories and scores, and whether they are flagged as toxic. The dataset is divided into multiple languages including Arabic, German, English, Spanish, Hindi, Russian, Swahili, and Chinese.
提供机构:
bigstupidhats



