Problem-Solution Dataset
收藏arXiv2025-09-30 收录
下载链接:
https://github.com/cablelabs/llmdata
下载链接
链接失效反馈官方服务:
资源简介:
该数据集是由OpenAI API生成的,包含了从400家顶级软件公司中提取的313个问题-解决方案映射。该数据集是通过大型语言模型(LLM)的提示创建的,用于对模型进行微调,以辅助内部创新。其规模涵盖了313个映射,任务是将问题映射到解决方案,反之亦然,这通过微调后的LLM模型来实现。
This dataset was generated via the OpenAI API, and contains 313 problem-solution mappings extracted from 400 leading software companies. Developed using prompts from Large Language Models (LLMs), it is designed for model fine-tuning to support internal innovation. With 313 mappings in total, the core task of this dataset is to perform bidirectional mapping between problems and solutions, which is implemented using the fine-tuned LLM models.
提供机构:
OpenAI, using GPT-3.5-turbo



