AlpacaEval
收藏arXiv2025-09-30 收录
下载链接:
https://github.com/tatsu-lab/alpaca_eval
下载链接
链接失效反馈官方服务:
资源简介:
该数据集旨在用于引导Guanaco-7b模型分析水印技术,同时它也用于评估在大规模语言模型上水印技巧的性能。该数据集主要针对自然语言处理领域,特别是自由格式生成任务。
This dataset is designed to enable the Guanaco-7b model to analyze watermarking techniques, and it is also utilized to evaluate the performance of watermarking strategies on large language models (LLMs). It primarily targets the field of natural language processing, with a specific focus on free-form generation tasks.



