bdice/rapids-codegen
收藏Hugging Face2023-11-27 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/bdice/rapids-codegen
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: repo_id
dtype: string
- name: file_path
dtype: string
- name: content
dtype: string
- name: __index_level_0__
dtype: int64
splits:
- name: train
num_bytes: 378315950
num_examples: 16827
download_size: 151107014
dataset_size: 378315950
configs:
- config_name: default
data_files:
- split: train
path: data/train-*
---
# Dataset Card for "rapids-codegen"
[More Information needed](https://github.com/huggingface/datasets/blob/main/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)
The dataset includes four features: repo_id (repository ID), file_path (file path), content (content), and __index_level_0__ (index level). The dataset is split into a training set (train) with 16827 examples, totaling 378315950 bytes. The download size of the dataset is 151107014 bytes, and the total dataset size is 378315950 bytes. The dataset configuration is default, with the training data file path being data/train-*.
提供机构:
bdice
原始信息汇总
数据集概述
数据集信息
-
特征列表:
repo_id: 字符串类型file_path: 字符串类型content: 字符串类型__index_level_0__: 整数类型
-
数据分割:
train: 包含378,315,950字节,16,827个样本
-
数据大小:
- 下载大小: 151,107,014字节
- 数据集大小: 378,315,950字节
配置信息
- 默认配置:
- 数据文件路径:
data/train-*
- 数据文件路径:



