P1ayer-1/stack-exchange-preferences-code-v2
收藏Hugging Face2023-06-07 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/P1ayer-1/stack-exchange-preferences-code-v2
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: qid
dtype: int64
- name: question
dtype: string
- name: answers
list:
- name: answer_id
dtype: int64
- name: author
dtype: string
- name: author_id
dtype: int64
- name: author_profile
dtype: string
- name: pm_score
dtype: int64
- name: selected
dtype: bool
- name: text
dtype: string
- name: date
dtype: string
- name: metadata
sequence: string
splits:
- name: Stackoverflow.com
num_bytes: 16527798152
num_examples: 7365699
- name: ai.stackexchange.com
num_bytes: 613928
num_examples: 379
- name: arduino.stackexchange.com
num_bytes: 13062539
num_examples: 4995
- name: askubuntu.com
num_bytes: 145519144
num_examples: 90833
- name: bioinformatics.stackexchange.com
num_bytes: 2463986
num_examples: 1117
- name: codegolf.stackexchange.com
num_bytes: 90341915
num_examples: 11914
- name: codereview.stackexchange.com
num_bytes: 154014482
num_examples: 30853
- name: computergraphics.stackexchange.com
num_bytes: 641463
num_examples: 291
- name: cs.stackexchange.com
num_bytes: 4425422
num_examples: 2796
- name: cseducators.stackexchange.com
num_bytes: 717869
num_examples: 386
- name: cstheory.stackexchange.com
num_bytes: 648274
num_examples: 382
- name: datascience.stackexchange.com
num_bytes: 7320028
num_examples: 3929
- name: dba.stackexchange.com
num_bytes: 59618569
num_examples: 25712
- name: devops.stackexchange.com
num_bytes: 1667039
num_examples: 972
- name: drupal.stackexchange.com
num_bytes: 32743998
num_examples: 19325
- name: dsp.stackexchange.com
num_bytes: 4572839
num_examples: 2282
- name: emacs.stackexchange.com
num_bytes: 9574939
num_examples: 6138
- name: elementaryos.stackexchange.com
num_bytes: 2073881
num_examples: 1601
- name: ethereum.stackexchange.com
num_bytes: 15163266
num_examples: 8235
- name: gamedev.stackexchange.com
num_bytes: 23440255
num_examples: 10565
- name: gis.stackexchange.com
num_bytes: 47994011
num_examples: 23390
- name: magento.stackexchange.com
num_bytes: 67498065
num_examples: 28969
- name: math.stackexchange.com
num_bytes: 27912632
num_examples: 16773
- name: mathematica.stackexchange.com
num_bytes: 62273649
num_examples: 29947
- name: meta.askubuntu.com
num_bytes: 765182
num_examples: 494
- name: meta.serverfault.com
num_bytes: 432170
num_examples: 260
- name: meta.stackoverflow.com
num_bytes: 8353413
num_examples: 3454
- name: meta.superuser.com
num_bytes: 457731
num_examples: 262
- name: networkengineering.stackexchange.com
num_bytes: 5378499
num_examples: 2624
- name: opendata.stackexchange.com
num_bytes: 601530
num_examples: 451
- name: opensource.stackexchange.com
num_bytes: 473900
num_examples: 306
- name: or.stackexchange.com
num_bytes: 741848
num_examples: 291
- name: quantumcomputing.stackexchange.com
num_bytes: 1027544
num_examples: 607
- name: raspberrypi.stackexchange.com
num_bytes: 14094960
num_examples: 7854
- name: retrocomputing.stackexchange.com
num_bytes: 2504144
num_examples: 1400
- name: reverseengineering.stackexchange.com
num_bytes: 3691408
num_examples: 1736
- name: robotics.stackexchange.com
num_bytes: 960069
num_examples: 448
- name: rus.stackexchange.com
num_bytes: 588180
num_examples: 471
- name: salesforce.stackexchange.com
num_bytes: 52133354
num_examples: 23521
- name: scicomp.stackexchange.com
num_bytes: 2465183
num_examples: 1090
- name: serverfault.com
num_bytes: 130901651
num_examples: 71060
- name: sharepoint.stackexchange.com
num_bytes: 32663707
num_examples: 17250
- name: sitecore.stackexchange.com
num_bytes: 5648428
num_examples: 2646
- name: softwareengineering.stackexchange.com
num_bytes: 41562683
num_examples: 20664
- name: softwarerecs.stackexchange.com
num_bytes: 2858851
num_examples: 1937
- name: stackapps.com
num_bytes: 860269
num_examples: 282
- name: stats.stackexchange.com
num_bytes: 28418367
num_examples: 14404
- name: superuser.com
num_bytes: 125405835
num_examples: 92628
- name: tex.stackexchange.com
num_bytes: 165178808
num_examples: 69895
- name: unix.stackexchange.com
num_bytes: 128049310
num_examples: 76183
- name: vi.stackexchange.com
num_bytes: 5271702
num_examples: 3792
- name: webapps.stackexchange.com
num_bytes: 5888224
num_examples: 4882
- name: webmasters.stackexchange.com
num_bytes: 8794039
num_examples: 6647
- name: wordpress.stackexchange.com
num_bytes: 54736694
num_examples: 26821
download_size: 7771040557
dataset_size: 18133008028
---
# Dataset Card for "stack-exchange-preferences-code-v2"
[More Information needed](https://github.com/huggingface/datasets/blob/main/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)
提供机构:
P1ayer-1
原始信息汇总
数据集概述
数据集特征
- qid: 整数类型 (int64)
- question: 字符串类型 (string)
- answers: 列表类型,包含以下字段:
- answer_id: 整数类型 (int64)
- author: 字符串类型 (string)
- author_id: 整数类型 (int64)
- author_profile: 字符串类型 (string)
- pm_score: 整数类型 (int64)
- selected: 布尔类型 (bool)
- text: 字符串类型 (string)
- date: 字符串类型 (string)
- metadata: 序列类型,字符串 (sequence: string)
数据集分割
- Stackoverflow.com: 16527798152 字节, 7365699 示例
- ai.stackexchange.com: 613928 字节, 379 示例
- arduino.stackexchange.com: 13062539 字节, 4995 示例
- askubuntu.com: 145519144 字节, 90833 示例
- bioinformatics.stackexchange.com: 2463986 字节, 1117 示例
- codegolf.stackexchange.com: 90341915 字节, 11914 示例
- codereview.stackexchange.com: 154014482 字节, 30853 示例
- computergraphics.stackexchange.com: 641463 字节, 291 示例
- cs.stackexchange.com: 4425422 字节, 2796 示例
- cseducators.stackexchange.com: 717869 字节, 386 示例
- cstheory.stackexchange.com: 648274 字节, 382 示例
- datascience.stackexchange.com: 7320028 字节, 3929 示例
- dba.stackexchange.com: 59618569 字节, 25712 示例
- devops.stackexchange.com: 1667039 字节, 972 示例
- drupal.stackexchange.com: 32743998 字节, 19325 示例
- dsp.stackexchange.com: 4572839 字节, 2282 示例
- emacs.stackexchange.com: 9574939 字节, 6138 示例
- elementaryos.stackexchange.com: 2073881 字节, 1601 示例
- ethereum.stackexchange.com: 15163266 字节, 8235 示例
- gamedev.stackexchange.com: 23440255 字节, 10565 示例
- gis.stackexchange.com: 47994011 字节, 23390 示例
- magento.stackexchange.com: 67498065 字节, 28969 示例
- math.stackexchange.com: 27912632 字节, 16773 示例
- mathematica.stackexchange.com: 62273649 字节, 29947 示例
- meta.askubuntu.com: 765182 字节, 494 示例
- meta.serverfault.com: 432170 字节, 260 示例
- meta.stackoverflow.com: 8353413 字节, 3454 示例
- meta.superuser.com: 457731 字节, 262 示例
- networkengineering.stackexchange.com: 5378499 字节, 2624 示例
- opendata.stackexchange.com: 601530 字节, 451 示例
- opensource.stackexchange.com: 473900 字节, 306 示例
- or.stackexchange.com: 741848 字节, 291 示例
- quantumcomputing.stackexchange.com: 1027544 字节, 607 示例
- raspberrypi.stackexchange.com: 14094960 字节, 7854 示例
- retrocomputing.stackexchange.com: 2504144 字节, 1400 示例
- reverseengineering.stackexchange.com: 3691408 字节, 1736 示例
- robotics.stackexchange.com: 960069 字节, 448 示例
- rus.stackexchange.com: 588180 字节, 471 示例
- salesforce.stackexchange.com: 52133354 字节, 23521 示例
- scicomp.stackexchange.com: 2465183 字节, 1090 示例
- serverfault.com: 130901651 字节, 71060 示例
- sharepoint.stackexchange.com: 32663707 字节, 17250 示例
- sitecore.stackexchange.com: 5648428 字节, 2646 示例
- softwareengineering.stackexchange.com: 41562683 字节, 20664 示例
- softwarerecs.stackexchange.com: 2858851 字节, 1937 示例
- stackapps.com: 860269 字节, 282 示例
- stats.stackexchange.com: 28418367 字节, 14404 示例
- superuser.com: 125405835 字节, 92628 示例
- tex.stackexchange.com: 165178808 字节, 69895 示例
- unix.stackexchange.com: 128049310 字节, 76183 示例
- vi.stackexchange.com: 5271702 字节, 3792 示例
- webapps.stackexchange.com: 5888224 字节, 4882 示例
- webmasters.stackexchange.com: 8794039 字节, 6647 示例
- wordpress.stackexchange.com: 54736694 字节, 26821 示例
数据集大小
- 下载大小: 7771040557 字节
- 数据集大小: 18133008028 字节



