five

Malva

收藏
Figshare2026-03-25 更新2026-04-28 收录
下载链接:
https://figshare.com/articles/dataset/Malva/31851544
下载链接
链接失效反馈
官方服务:
资源简介:
Malva is a defect library for agentic GUI software, a class of systems in which large language model (LLM) agents autonomously operate graphical user interfaces to complete user-specified tasks without relying on programmatic APIs.This dataset supports an empirical study of defects in agentic GUI applications. It contains a benchmark of 40 open-source agentic GUI projects collected from GitHub, spanning web, desktop, and mobile environments. For each project, we record metadata including repository links, commit identifiers, deployment environments, and the LLM and perception mechanisms used.The dataset also includes a curated defect library that documents defects identified in these applications. For each defect instance, the dataset records the defect type, explanation, root cause, consequences, source-code location, and defect-triggering tests. Multiple cases of the same defect type are documented to capture different manifestations of similar issues.The artifact includes two main files: (1) application.csv, which describes the benchmark applications, and (2) defect.csv, which contains the organized defect library used in the study.
创建时间:
2026-03-25
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作