mtimur/distill-gpt4-eng-chat
收藏Hugging Face2024-12-12 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/mtimur/distill-gpt4-eng-chat
下载链接
链接失效反馈官方服务:
资源简介:
该数据集由GPT4对用户请求的回答构成,这些请求来源于[allenai/WildChat-1M](https://huggingface.co/datasets/allenai/WildChat-1M)和[causal-lm/instructions](https://huggingface.co/datasets/causal-lm/instructions)两个数据集。文本(请求和响应)在四种情况下被删除:包含非英文字母和特殊符号、包含http链接、包含html块以及复杂度超过1.5倍四分位距加上第三四分位数的文本(在某些情况下使用句子的平均复杂度值或最大值)。
The dataset consists of GPT-4 answers to user requests, with queries sourced from allenai/WildChat-1M and causal-lm/instructions. Texts (requests and responses) were deleted in three cases: containing non-English letters and special symbols, containing http-links, containing html blocks, and in some cases, perplexity values of sentences or maximum values exceeding 1.5*IQR + third quantile.
提供机构:
mtimur



