ai2-adapt-dev/tulu_v3.9_wildchat_100k_english
收藏Hugging Face2024-11-22 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/ai2-adapt-dev/tulu_v3.9_wildchat_100k_english
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含对话数据,每个对话包含哈希值、模型、时间戳、对话内容、语言、国家、IP地址哈希、用户代理、消息内容等信息。此外,数据集还包含了OpenAI和Detoxify的审核结果,如骚扰、仇恨、自残、性内容、暴力等类别的标记和分数。数据集分为训练集,包含62784个样本,总大小为1511785315.30176字节。
This dataset contains conversation data, each including a hash value, model, timestamp, conversation content, language, country, hashed IP address, user agent, message content, and more. Additionally, the dataset includes moderation results from OpenAI and Detoxify, such as flags and scores for categories like harassment, hate, self-harm, sexual content, and violence. The dataset is divided into a training set containing 62,784 samples, with a total size of 1,511,785,315.30176 bytes.
提供机构:
ai2-adapt-dev



