five

RNDRandoM/paul-graham-essays-qa

收藏
Hugging Face2024-04-29 更新2024-06-12 收录
下载链接:
https://hf-mirror.com/datasets/RNDRandoM/paul-graham-essays-qa
下载链接
链接失效反馈
官方服务:
资源简介:
--- size_categories: - 1K<n<10K --- A Q&A dataset based on (almost) every Paul Graham's essay. Contains 15 questions about startups for each essay (however, can be less for those essays that are small).\ Political and non-startup-related essays were not considered in the dataset. \ The dataset was created synthetically using the `gpt-4-turbo-2024-04-09` model from the OpenAI API using the following prompt: ``` Essay: {essay} Generate 15 insightful question and answer pairs that encapsulate universal principles, strategic insights, and major ideas relevant to startups and entrepreneurship BASED ON THIS ESSAY. These Q&A pairs should be crafted in such a way that they can be understood and informative without access to any specific document or article, but cover its contents. Each answer must be detailed, clearly explaining the concepts in depth while ensuring relevance to the given essay. Avoid any reference that necessitates having prior access to a particular text. FOR EACH MISTAKE THAT INCLUDES ANY MENTION REQUIRING SPECIFIC SOURCE CONTEXT, I WILL KILL ONE KITTEN If the essay is not related to startups or too political, return an empty json. Give the answer in a JSON format - here's an example: {"questions_and_answers": [{"question": "What color is the sky?", "answer": "Blue"}, ...]} ``` Contains **3144** rows, comprised of **1537105** characters.
提供机构:
RNDRandoM
原始信息汇总

数据集概述

数据集内容

  • 主题:基于Paul Graham的创业相关论文的问答数据集。
  • 排除内容:政治和非创业相关的论文未被包含在内。

数据集生成

  • 生成方式:使用OpenAI的gpt-4-turbo-2024-04-09模型合成创建。
  • 生成规则:每篇论文生成15个关于创业的深入问题和答案,确保答案无需特定文档即可理解和获取信息。

数据集规模

  • 数据量:包含3144行。
  • 字符数:总计1537105个字符。

数据格式

  • 输出格式:JSON格式,示例: json {"questions_and_answers": [{"question": "What color is the sky?", "answer": "Blue"}, ...]}
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作