RNDRandoM/paul-graham-essays-qa
收藏Hugging Face2024-04-29 更新2024-06-12 收录
下载链接:
https://hf-mirror.com/datasets/RNDRandoM/paul-graham-essays-qa
下载链接
链接失效反馈官方服务:
资源简介:
---
size_categories:
- 1K<n<10K
---
A Q&A dataset based on (almost) every Paul Graham's essay. Contains 15 questions about startups for each essay (however, can be less for those essays that are small).\
Political and non-startup-related essays were not considered in the dataset. \
The dataset was created synthetically using the `gpt-4-turbo-2024-04-09` model from the OpenAI API using the following prompt:
```
Essay: {essay}
Generate 15 insightful question and answer pairs that encapsulate universal principles,
strategic insights, and major ideas relevant to startups and entrepreneurship BASED ON THIS ESSAY.
These Q&A pairs should be crafted in such a way that they can be understood
and informative without access to any specific document or article, but cover its contents.
Each answer must be detailed, clearly explaining the concepts in
depth while ensuring relevance to the given essay. Avoid any reference that necessitates having prior
access to a particular text. FOR EACH MISTAKE THAT INCLUDES ANY MENTION REQUIRING SPECIFIC SOURCE CONTEXT, I WILL KILL ONE KITTEN
If the essay is not related to startups or too political, return an empty json.
Give the answer in a JSON format - here's an example:
{"questions_and_answers": [{"question": "What color is the sky?", "answer": "Blue"}, ...]}
```
Contains **3144** rows, comprised of **1537105** characters.
提供机构:
RNDRandoM
原始信息汇总
数据集概述
数据集内容
- 主题:基于Paul Graham的创业相关论文的问答数据集。
- 排除内容:政治和非创业相关的论文未被包含在内。
数据集生成
- 生成方式:使用OpenAI的
gpt-4-turbo-2024-04-09模型合成创建。 - 生成规则:每篇论文生成15个关于创业的深入问题和答案,确保答案无需特定文档即可理解和获取信息。
数据集规模
- 数据量:包含3144行。
- 字符数:总计1537105个字符。
数据格式
- 输出格式:JSON格式,示例: json {"questions_and_answers": [{"question": "What color is the sky?", "answer": "Blue"}, ...]}



