rootacess/pie-perf
收藏Hugging Face2023-04-23 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/rootacess/pie-perf
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: user_id
dtype: string
- name: problem_id
dtype: string
- name: language
dtype: string
- name: submission_id_v0
dtype: string
- name: submission_id_v1
dtype: string
- name: cpu_time_v0
dtype: int64
- name: cpu_time_v1
dtype: int64
- name: memory_v0
dtype: int64
- name: memory_v1
dtype: int64
- name: status_v0
dtype: string
- name: status_v1
dtype: string
- name: improvement_frac
dtype: float64
- name: input
dtype: string
- name: target
dtype: string
- name: code_v0_loc
dtype: int64
- name: code_v1_loc
dtype: int64
- name: code_v0_num_chars
dtype: int64
- name: code_v1_num_chars
dtype: int64
- name: code_v0_no_empty_lines
dtype: string
- name: code_v1_no_empty_lines
dtype: string
- name: code_same
dtype: bool
- name: relative_loc_diff_percent
dtype: float64
- name: diff
sequence: string
- name: diff_only_import_comment
dtype: bool
- name: measured_runtime_v0
dtype: float64
- name: measured_runtime_v1
dtype: float64
- name: runtime_lift
dtype: float64
- name: key
sequence: string
splits:
- name: train
num_bytes: 110329743
num_examples: 36857
- name: val
num_bytes: 5942994
num_examples: 1940
- name: test
num_bytes: 2714513
num_examples: 1000
- name: codegen_1shot_test
num_bytes: 3003513
num_examples: 1000
download_size: 56295756
dataset_size: 121990763
---
# Dataset Card for "pie-perf"
[More Information needed](https://github.com/huggingface/datasets/blob/main/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)
提供机构:
rootacess
原始信息汇总
数据集概述
数据集名称
"pie-perf"
数据特征
- user_id: 字符串类型
- problem_id: 字符串类型
- language: 字符串类型
- submission_id_v0: 字符串类型
- submission_id_v1: 字符串类型
- cpu_time_v0: 整数类型
- cpu_time_v1: 整数类型
- memory_v0: 整数类型
- memory_v1: 整数类型
- status_v0: 字符串类型
- status_v1: 字符串类型
- improvement_frac: 浮点数类型
- input: 字符串类型
- target: 字符串类型
- code_v0_loc: 整数类型
- code_v1_loc: 整数类型
- code_v0_num_chars: 整数类型
- code_v1_num_chars: 整数类型
- code_v0_no_empty_lines: 字符串类型
- code_v1_no_empty_lines: 字符串类型
- code_same: 布尔类型
- relative_loc_diff_percent: 浮点数类型
- diff: 字符串序列类型
- diff_only_import_comment: 布尔类型
- measured_runtime_v0: 浮点数类型
- measured_runtime_v1: 浮点数类型
- runtime_lift: 浮点数类型
- key: 字符串序列类型
数据集分割
- train: 36857个样本,大小为110329743字节
- val: 1940个样本,大小为5942994字节
- test: 1000个样本,大小为2714513字节
- codegen_1shot_test: 1000个样本,大小为3003513字节
数据集大小
- 下载大小: 56295756字节
- 数据集总大小: 121990763字节



