AVATAR
收藏arXiv2023-05-05 更新2024-06-21 收录
下载链接:
https://github.com/wasiahmad/AVATAR
下载链接
链接失效反馈官方服务:
资源简介:
AVATAR数据集由加利福尼亚大学洛杉矶分校创建,包含9,515个编程问题及其Java和Python解决方案。该数据集从竞赛编程网站、在线平台和开源存储库收集,特别包括250个示例的单元测试,以评估程序翻译的功能正确性。AVATAR数据集的创建过程涉及数据收集、预处理和过滤,确保数据的多样性和质量。该数据集主要应用于编程语言间的自动翻译,旨在解决软件开发中跨语言迁移的效率和成本问题。
The AVATAR dataset was created by the University of California, Los Angeles. It contains 9,515 programming problems along with their corresponding Java and Python solutions. This dataset is collected from competitive programming websites, online platforms and open-source repositories, and specifically includes unit tests for 250 examples to evaluate the functional correctness of program translation. The creation process of the AVATAR dataset involves data collection, preprocessing and filtering, ensuring the diversity and quality of the dataset. This dataset is mainly applied to automatic translation between programming languages, aiming to address the efficiency and cost issues of cross-language migration in software development.
提供机构:
加利福尼亚大学洛杉矶分校
创建时间:
2021-08-26



