five

SkAndMl/CPTDS-3

收藏
Hugging Face2023-10-29 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/SkAndMl/CPTDS-3
下载链接
链接失效反馈
官方服务:
资源简介:
--- configs: - config_name: default data_files: - split: train path: data/train-* dataset_info: features: - name: question dtype: string - name: label dtype: class_label: names: '0': array '1': graph '2': string splits: - name: train num_bytes: 1836512 num_examples: 3012 download_size: 874048 dataset_size: 1836512 task_categories: - text-classification language: - en pretty_name: cptds-3 --- # Dataset Card for "CPTDS-3" 1. CPTDS-3 dataset is made up of coding problem questions from multiple coding websites. 2. The 'DS' in the name stands for data structures and the '3' indicates that the questions belong to 3 mutually exclusive categories 3. The dataset was prepared for the research work names [Stacking of Hyperparameter Tuned Models for Tagging Coding Problems](https://arxiv.org/abs/2306.10077#:~:text=In%20this%20work%2C%20we%20propose,models%20developed%20for%20this%20work.) ## Languages The dataset consists of questions only in English ## Dataset Structure ### Data Instances For each instance, there is a string for the question, a string for the class label. ``` {'question': 'Andrew love sea that s height summer season decide beach take sunbe sunbatheThe beach rectangular field n row m column some cell beach free road stone shop nonmovable object some adjacent cell sunbed locate horizontally verticallyAndrew hope sunbe that s bad luck long free place that s Andrew ask help find free place sunbe Andrews sunbe place adjacent cell if adjacent free cell order free place sunbe disturb tourist you follow action come sunbe cause p unit discomfort owner lift sunbe side rotate 90 degree one half sunbe remain cell half sunbe free cell at time way sunbe rotation Rotation sunbe 90 degree cell 1 2 come sunbe cause q unit discomfort owner shift sunbe long cell one half sunbe place free cell Shift sunbe cell right in moment sunbe occupie adjacent free cell you sunbe timehelp Andrew free space sunbe cause minimum possible number unit discomfort tourist detect impossible', 'label': 1} ``` The average token count for the articles and the highlights are provided below: | Feature | Mean Token Count | | ---------- | ---------------- | | Question | 94.02 | ### Data Fields - `question`: a string containing the question of the coding problem - `label` : a string containing the tag of the question ### Data Splits The CPTDS-3 dataset has just 1 split: _train_. Below is the statistics for the dataset. | Dataset Split | Number of Instances in Split | | ------------- | -------------------------------- | | Train | 3012 | [More Information needed](https://github.com/huggingface/datasets/blob/main/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)
提供机构:
SkAndMl
原始信息汇总

数据集概述

数据集名称

CPTDS-3

数据集描述

CPTDS-3 数据集由来自多个编程网站的编程问题组成。名称中的 DS 代表数据结构,3 表示问题属于三个互斥类别。该数据集是为研究工作 Stacking of Hyperparameter Tuned Models for Tagging Coding Problems 准备的。

语言

数据集中的问题仅使用英语。

数据结构

数据实例

每个实例包含一个问题的字符串和一个类别标签的字符串。

示例: json { "question": "Andrew love sea that s height summer season decide beach take sunbe sunbatheThe beach rectangular field n row m column some cell beach free road stone shop nonmovable object some adjacent cell sunbed locate horizontally verticallyAndrew hope sunbe that s bad luck long free place that s Andrew ask help find free place sunbe Andrews sunbe place adjacent cell if adjacent free cell order free place sunbe disturb tourist you follow action come sunbe cause p unit discomfort owner lift sunbe side rotate 90 degree one half sunbe remain cell half sunbe free cell at time way sunbe rotation Rotation sunbe 90 degree cell 1 2 come sunbe cause q unit discomfort owner shift sunbe long cell one half sunbe place free cell Shift sunbe cell right in moment sunbe occupie adjacent free cell you sunbe timehelp Andrew free space sunbe cause minimum possible number unit discomfort tourist detect impossible", "label": 1 }

数据字段

  • question: 包含编程问题的字符串
  • label: 包含问题标签的字符串

数据分割

CPTDS-3 数据集只有一个分割:train。以下是数据集的统计信息:

数据集分割 实例数量
Train 3012

平均词元计数

特征 平均词元计数
Question 94.02
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作