VishaalY/synthetic-code-generations
收藏数据集概述
数据集生成
- 生成方法:使用mixtral8x7b合成生成,遵循MagicCoder Paper的方法,并通过修改特定属性(代码片段更大、指令/响应更大、更具体)来重现结果。
生成提示
-
提示模板: python prompt=f"""<s>[INST] You are an incredibly intelligent programming AI with expertise in CloudFormation, Terraform, AWS CDK and {lang}. Please gain inspiration from the following code snippet to create the highest-quality programming problem. Present your problem and solution in two sections: [Programming Question] and [Solution].
Code snippet in {lang} for inspiration:
{snippet}
The [Programming Question] section must be completely self-contained, providing all the contextual information one needs to understand and solve the problem. Assume common programming knowledge, but ensure that any specific context, variables, or code snippets pertinent to this problem are explicitly included. Do NOT include a title, just the question and keep this section as brief as possible.
The [Solution] must offer a comprehensive solution that accurately and CORRECTLY addresses the [Programming Question] you provided. [/INST]"""
数据集内容
- 编程语言:包含针对python、javascript、typescript、c++、c、yaml等语言的问题集。
- 代码片段来源:使用the Stack、AWS文档,仅使用具有星标和Apache-2.0/MIT许可证的仓库中的代码片段。



