trelent/the-stack-dedup-python-docstrings-1.0-percent-unified
收藏Hugging Face2023-02-26 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/trelent/the-stack-dedup-python-docstrings-1.0-percent-unified
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: body_hash
dtype: string
- name: body
dtype: string
- name: docstring
dtype: string
- name: path
dtype: string
- name: name
dtype: string
- name: repository_name
dtype: string
- name: repository_stars
dtype: float64
- name: lang
dtype: string
- name: body_without_docstring
dtype: string
- name: unified
dtype: string
splits:
- name: train
num_bytes: 680876286
num_examples: 237074
download_size: 247316903
dataset_size: 680876286
---
# Dataset Card for "the-stack-dedup-python-docstrings-1.0-percent-unified"
[More Information needed](https://github.com/huggingface/datasets/blob/main/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)
提供机构:
trelent
原始信息汇总
数据集概述
数据集名称
"the-stack-dedup-python-docstrings-1.0-percent-unified"
数据集特征
- body_hash: 字符串类型
- body: 字符串类型
- docstring: 字符串类型
- path: 字符串类型
- name: 字符串类型
- repository_name: 字符串类型
- repository_stars: 浮点数类型
- lang: 字符串类型
- body_without_docstring: 字符串类型
- unified: 字符串类型
数据集分割
- train:
- 字节数: 680876286
- 示例数: 237074
数据集大小
- 下载大小: 247316903字节
- 数据集大小: 680876286字节



