five

Hydrangea

收藏
DataCite Commons2025-02-12 更新2025-05-07 收录
下载链接:
https://figshare.com/articles/dataset/Hydrangea/28262426/2
下载链接
链接失效反馈
官方服务:
资源简介:
Hydrangea is a defect library for LLM-enabled software. Hydrangea has 4 main petals, each corresponding to one of the major components where defects often arise: LLM agent, vector database, software component, and system.What is LLM-enabled software?It is software that integrates LLMs (large language models) with RAG (retrieval-augmented generation ) support to realize intelligence features.It contains four components:<b>LLM agent</b> that manages LLM interfaces, constructs prompts, and invokes the LLM<b>Vector database</b> that supports RAG algorithm and enhances the LLM agent<b>Software component</b> that interacts with the first two components to perform certain tasks<b>System</b> that manages resources and privileges to carry out the executionWhat's inside the artifact:For enhanced availability and reusability, we offer an organized defect library utilized in our manual studies.Below are details of what is included in each part:Application benchmarkA suite of 100 non-trivial projects which tightly integrates LLMs and vector databases in their workflow.We have uploaded <code>application.csv</code>, it contains:software project nameGitHub link and commit IDclassificationused LLM and vector databaseHydrangea Defect LibraryThe result of TABLE Ⅱ in our paper can be reproduced by this organized defect library.In the uploaded <code>defect.csv</code>, we have documented different cases for the same defect type, as defects can manifest in various ways. For each distinct case of the same defect, we have separated them with a blank line and labeled them as case 1, case 2, and so on, according to the specific circumstances.It contains:A collection of defects in these projects (involves 100 projects),containingthe defect type and its detailed explanationthe exact file and source-code line location of the defectthe consequences of defectthe defect-triggering tests
提供机构:
figshare
创建时间:
2025-01-23
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作