SeaBench
收藏SeaBench 数据集概述
基本信息
- 许可证: Apache 2.0
- 语言:
- 越南语 (vi)
- 印度尼西亚语 (id)
- 泰语 (th)
- 配置:
- 配置名称: Question
- 数据文件: public-questions.jsonl
- 任务类别: 文本生成
- 数据规模: n<1K
数据集描述
SeaBench 数据集旨在评估大型语言模型 (LLMs) 在东南亚语言中的能力,特别是通过精心设计的评估任务来评估模型在印度尼西亚语、泰语和越南语中的多轮对话和指令跟随能力。
引用
如果您发现 SeaBench 对您的研究有用,请考虑引用以下论文:
@article{damonlp2024seallm3, author = {Wenxuan Zhang*, Hou Pong Chan*, Yiran Zhao*, Mahani Aljunied*, Jianyu Wang*, Chaoqun Liu, Yue Deng, Zhiqiang Hu, Weiwen Xu, Yew Ken Chia, Xin Li, Lidong Bing}, title = {SeaLLMs 3: Open Foundation and Chat Multilingual Large Language Models for Southeast Asian Languages}, year = {2024}, url = {https://arxiv.org/abs/2407.19672} }
@article{damonlpsg2023seallm, author = {Xuan-Phi Nguyen*, Wenxuan Zhang*, Xin Li*, Mahani Aljunied*, Zhiqiang Hu, Chenhui Shen, Yew Ken Chia, Xingxuan Li, Jianyu Wang, Qingyu Tan, Liying Cheng, Guanzheng Chen, Yue Deng, Sen Yang, Chaoqun Liu, Hang Zhang, Lidong Bing}, title = {SeaLLMs - Large Language Models for Southeast Asia}, year = {2024}, booktitle = {ACL 2024 System Demonstrations}, url = {https://arxiv.org/pdf/2312.00738}, }




