Large-Scale Software Observatorium (LASSO)
收藏arXiv2025-09-30 收录
下载链接:
https://softwareobservatorium.github.io/
下载链接
链接失效反馈官方服务:
资源简介:
该数据集由LASSO引入了三种新的数据结构,用于存储观测数据:序列表、刺激-响应矩阵(SRMs)和刺激-响应超立方体(SRHs)。这些结构以结构化的格式捕捉刺激-响应交互,便于在软件工程领域对生成式人工智能模型进行训练和评估。该数据集支持生成式人工智能在多个应用领域的发展,包括训练、增强生成、提示以及测试驱动的软件实验。任务领域涉及软件工程和生成式人工智能。
This dataset features three novel data structures introduced by LASSO for storing observational data: sequence tables, stimulus-response matrices (SRMs), and stimulus-response hypercubes (SRHs). These structures capture stimulus-response interactions in a structured format, facilitating the training and evaluation of generative AI models in the field of software engineering. This dataset supports the advancement of generative AI across multiple application domains, including training, augmented generation, prompting, and test-driven software experimentation. Its task domains span software engineering and generative artificial intelligence.
提供机构:
LASSO project team



