so_stacksample
收藏huggingface.co2025-03-22 收录
下载链接:
https://huggingface.co/datasets/community-datasets/so_stacksample
下载链接
链接失效反馈官方服务:
资源简介:
Dataset with the text of 10% of questions and answers from the Stack Overflow programming Q&A website.
This is organized as three tables:
Questions contains the title, body, creation date, closed date (if applicable), score, and owner ID for all non-deleted Stack Overflow questions whose Id is a multiple of 10.
Answers contains the body, creation date, score, and owner ID for each of the answers to these questions. The ParentId column links back to the Questions table.
Tags contains the tags on each of these questions.
本数据集收录了 Stack Overflow 编程问答网站中 10% 的问题与答案文本。数据组织分为三张表格:
- 问题表包含标题、正文、创建日期、关闭日期(如适用)、评分和所有者 ID,涉及所有非删除状态且 ID 为 10 的倍数的 Stack Overflow 问题。
- 答案表包含每个问题的正文、创建日期、评分和所有者 ID,其中 ParentId 列用于指向问题表。
- 标签表包含上述每个问题的标签。
提供机构:
Community Datasets



