gallifantjack/bigbio_pubmed_qa_N_A
收藏Hugging Face2024-12-11 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/gallifantjack/bigbio_pubmed_qa_N_A
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含bigbio_pubmed_qa的评估结果,带有标签列N_A,以及各种模型性能指标和样本。数据集的特征包括样本的唯一标识符、用户查询、助手响应、预期输出、响应得分、得分解释、输入列、标签列、模型名称和原始数据集名称。数据集的用途包括评估模型在不同任务中的鲁棒性、评估模型响应中的潜在偏见以及模型性能的监控和分析。
This dataset contains evaluation results for bigbio_pubmed_qa with label column N_A, with various model performance metrics and samples. The dataset features include a unique identifier for the sample, user query/content, assistant response, expected output, score of the assistants response, explanation of the score, input column and label column in the original dataset, model name used in evaluation, and name of the original dataset used. This dataset can be used for evaluating model robustness across various tasks, assessing potential biases in model responses, and model performance monitoring and analysis.
提供机构:
gallifantjack



