arena-human-preference-100k
收藏魔搭社区2026-01-06 更新2025-02-22 收录
下载链接:
https://modelscope.cn/datasets/AI-ModelScope/arena-human-preference-100k
下载链接
链接失效反馈官方服务:
资源简介:
## Overview
This dataset contains leaderboard conversation data collected between June 2024 and August 2024.
It includes English human preference evaluations used to develop [Arena Explorer](http://lmarena.ai/explore).
Additionally, we provide an embedding file, which contains precomputed embeddings for the English conversations.
These embeddings are used in the topic modeling pipeline to categorize and analyze these conversations.
For a detailed exploration of the dataset and analysis methods, refer to the [notebook](https://colab.research.google.com/drive/1chzqjePYnpq08fA3KzyKvSkuzCjojyiE?usp=sharing) and
[blogpost](https://blog.lmarena.ai/blog/2025/arena-explorer/),
which provides a step-by-step workflow for processing data and insights derived.
## License
User prompts are licensed under CC-BY-4.0, and model outputs are governed by the terms of use set by the respective model providers.
## Citation
```bibtex
@misc{tang2025explorer,
title={Arena Explorer: A Topic Modeling Pipeline for LLM Evals & Analytics},
author={Kelly Tang and Wei-Lin Chiang and Anastasios N. Angelopoulos}
year={2025},
}
@misc{chiang2024chatbot,
title={Chatbot Arena: An Open Platform for Evaluating LLMs by Human Preference},
author={Wei-Lin Chiang and Lianmin Zheng and Ying Sheng and Anastasios Nikolas Angelopoulos and Tianle Li and Dacheng Li and Hao Zhang and Banghua Zhu and Michael Jordan and Joseph E. Gonzalez and Ion Stoica},
year={2024},
eprint={2403.04132},
archivePrefix={arXiv},
primaryClass={cs.AI}
}
```
## 概述
本数据集包含2024年6月至2024年8月期间采集的排行榜对话数据。
其包含用于开发Arena Explorer(http://lmarena.ai/explore)的英文人类偏好评估数据。此外,本数据集附带嵌入文件(embedding file),其中包含针对上述英文对话的预计算嵌入向量(precomputed embedding);这些嵌入向量被用于主题建模(topic modeling)流程,以对对话进行分类与分析。
若需了解本数据集与分析方法的详细探索内容,请参考配套的[Jupyter Notebook](https://colab.research.google.com/drive/1chzqjePYnpq08fA3KzyKvSkuzCjojyiE?usp=sharing)与[博客文章](https://blog.lmarena.ai/blog/2025/arena-explorer/),其中详细给出了数据处理流程与衍生洞察的分步指南。
## 授权协议
用户提示词采用CC-BY-4.0协议授权,模型输出内容则受各模型服务商的使用条款约束。
## 引用
bibtex
@misc{tang2025explorer,
title={Arena Explorer: A Topic Modeling Pipeline for LLM Evals & Analytics},
author={Kelly Tang and Wei-Lin Chiang and Anastasios N. Angelopoulos}
year={2025},
}
@misc{chiang2024chatbot,
title={Chatbot Arena: An Open Platform for Evaluating LLMs by Human Preference},
author={Wei-Lin Chiang and Lianmin Zheng and Ying Sheng and Anastasios Nikolas Angelopoulos and Tianle Li and Dacheng Li and Hao Zhang and Banghua Zhu and Michael Jordan and Joseph E. Gonzalez and Ion Stoica},
year={2024},
eprint={2403.04132},
archivePrefix={arXiv},
primaryClass={cs.AI}
}
提供机构:
maas
创建时间:
2025-02-20



