GEMv2
收藏arXiv2022-06-24 更新2024-07-24 收录
下载链接:
https://gem-benchmark.com/
下载链接
链接失效反馈官方服务:
资源简介:
GEMv2是一个多语言自然语言生成(NLG)基准数据集,由谷歌研究院主导开发,旨在通过标准化评估流程推动NLG领域的进步,支持51种语言。数据集内容丰富,包括数据到文本、摘要、响应生成等多种任务,涵盖不同规模和复杂度的数据集。创建过程注重模块化和可扩展性,允许持续集成新开发的数据集。GEMv2的应用领域广泛,旨在解决NLG领域的评估挑战,提升模型性能和多样性,同时支持多语言环境下的研究和开发。
GEMv2 is a multilingual natural language generation (NLG) benchmark dataset led by Google Research. It is designed to advance the NLG field through standardized evaluation workflows, and supports 51 languages in total. The dataset encompasses a rich variety of tasks including data-to-text generation, summarization, response generation and more, with collections spanning different scales and complexity levels. Its development framework emphasizes modularity and scalability, allowing for the continuous integration of newly developed datasets. GEMv2 has broad application prospects, aiming to address the evaluation challenges in the NLG domain, enhance model performance and diversity, while supporting research and development in multilingual settings.
提供机构:
谷歌研究院
创建时间:
2022-06-23



