osouza/code_contests_pt
收藏Hugging Face2024-05-13 更新2024-06-12 收录
下载链接:
https://hf-mirror.com/datasets/osouza/code_contests_pt
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: name
dtype: string
- name: description
dtype: string
- name: public_tests
sequence:
- name: input
dtype: string
- name: output
dtype: string
- name: private_tests
sequence:
- name: input
dtype: string
- name: output
dtype: string
- name: generated_tests
sequence:
- name: input
dtype: string
- name: output
dtype: string
- name: source
dtype:
class_label:
names:
'0': UNKNOWN_SOURCE
'1': CODECHEF
'2': CODEFORCES
'3': HACKEREARTH
'4': CODEJAM
'5': ATCODER
'6': AIZU
- name: difficulty
dtype:
class_label:
names:
'0': UNKNOWN_DIFFICULTY
'1': EASY
'2': MEDIUM
'3': HARD
'4': HARDER
'5': HARDEST
'6': EXTERNAL
'7': A
'8': B
'9': C
'10': D
'11': E
'12': F
'13': G
'14': H
'15': I
'16': J
'17': K
'18': L
'19': M
'20': N
'21': O
'22': P
'23': Q
'24': R
'25': S
'26': T
'27': U
'28': V
- name: solutions
sequence:
- name: language
dtype:
class_label:
names:
'0': UNKNOWN_LANGUAGE
'1': PYTHON
'2': CPP
'3': PYTHON3
'4': JAVA
- name: solution
dtype: string
- name: incorrect_solutions
sequence:
- name: language
dtype:
class_label:
names:
'0': UNKNOWN_LANGUAGE
'1': PYTHON
'2': CPP
'3': PYTHON3
'4': JAVA
- name: solution
dtype: string
- name: cf_contest_id
dtype: int64
- name: cf_index
dtype: string
- name: cf_points
dtype: float32
- name: cf_rating
dtype: int32
- name: cf_tags
sequence: string
- name: is_description_translated
dtype: bool
- name: untranslated_description
dtype: string
- name: time_limit
struct:
- name: seconds
dtype: int64
- name: nanos
dtype: int64
- name: memory_limit_bytes
dtype: int64
- name: input_file
dtype: string
- name: output_file
dtype: string
- name: description_pt
dtype: string
splits:
- name: train
num_bytes: 19073060554
num_examples: 13328
- name: test
num_bytes: 182608706
num_examples: 165
- name: valid
num_bytes: 167483300
num_examples: 117
download_size: 7638681386
dataset_size: 19423152560
configs:
- config_name: default
data_files:
- split: train
path: data/train-*
- split: test
path: data/test-*
- split: valid
path: data/valid-*
---
提供机构:
osouza
原始信息汇总
数据集概述
数据集特征
- name: 字符串类型
- description: 字符串类型
- public_tests: 序列类型,包含
- input: 字符串类型
- output: 字符串类型
- private_tests: 序列类型,包含
- input: 字符串类型
- output: 字符串类型
- generated_tests: 序列类型,包含
- input: 字符串类型
- output: 字符串类型
- source: 分类标签类型,包含
- UNKNOWN_SOURCE
- CODECHEF
- CODEFORCES
- HACKEREARTH
- CODEJAM
- ATCODER
- AIZU
- difficulty: 分类标签类型,包含
- UNKNOWN_DIFFICULTY
- EASY
- MEDIUM
- HARD
- HARDER
- HARDEST
- EXTERNAL
- A 至 V
- solutions: 序列类型,包含
- language: 分类标签类型,包含
- UNKNOWN_LANGUAGE
- PYTHON
- CPP
- PYTHON3
- JAVA
- solution: 字符串类型
- language: 分类标签类型,包含
- incorrect_solutions: 序列类型,包含
- language: 分类标签类型,包含
- UNKNOWN_LANGUAGE
- PYTHON
- CPP
- PYTHON3
- JAVA
- solution: 字符串类型
- language: 分类标签类型,包含
- cf_contest_id: 整数类型
- cf_index: 字符串类型
- cf_points: 浮点数类型
- cf_rating: 整数类型
- cf_tags: 序列类型,字符串类型
- is_description_translated: 布尔类型
- untranslated_description: 字符串类型
- time_limit: 结构类型,包含
- seconds: 整数类型
- nanos: 整数类型
- memory_limit_bytes: 整数类型
- input_file: 字符串类型
- output_file: 字符串类型
- description_pt: 字符串类型
数据集分割
- train: 13328个示例,19073060554字节
- test: 165个示例,182608706字节
- valid: 117个示例,167483300字节
数据集大小
- 下载大小: 7638681386字节
- 数据集大小: 19423152560字节



