AI-Secure/DecodingTrust

Name: AI-Secure/DecodingTrust
Creator: AI-Secure
Published: 2024-02-18 06:29:12
License: 暂无描述

Hugging Face2024-02-18 更新2024-03-04 收录

下载链接：

https://hf-mirror.com/datasets/AI-Secure/DecodingTrust

下载链接

链接失效反馈

官方服务：

资源简介：

DecodingTrust数据集旨在帮助研究人员更好地理解部署最先进的大型语言模型（LLMs）时的能力、局限性和潜在风险。该数据集涵盖了八个主要领域的可信度评估，包括毒性、刻板印象和偏见、对抗性鲁棒性、分布外鲁棒性、隐私、对抗性演示的鲁棒性、机器伦理和公平性。

The DecodingTrust dataset is designed to help researchers better understand the capabilities, limitations, and potential risks of deploying state-of-the-art Large Language Models (LLMs). This dataset covers trustworthiness evaluation across eight major domains, including toxicity, stereotypes and bias, adversarial robustness, out-of-distribution robustness, privacy, robustness against adversarial demonstrations, machine ethics, and fairness.

提供机构：

AI-Secure

原始信息汇总