five

nimaster/autonlp-data-devign_raw_test

收藏
Hugging Face2022-03-17 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/nimaster/autonlp-data-devign_raw_test
下载链接
链接失效反馈
官方服务:
资源简介:
--- languages: - en task_categories: - text-classification --- # AutoNLP Dataset for project: devign_raw_test ## Dataset Descritpion This dataset has been automatically processed by AutoNLP for project devign_raw_test. ### Languages The BCP-47 code for the dataset's language is en. ## Dataset Structure ### Data Instances A sample from this dataset looks as follows: ```json [ { "text": "void ff_avg_h264_qpel16_mc32_msa ( uint8_t * dst , const uint8_t * src , ptrdiff_t stride ) { avc_lu[...]", "target": 0 }, { "text": "static void sd_cardchange ( void * opaque , bool load ) { SDState * sd = opaque ; qemu_set_irq ( sd [...]", "target": 0 } ] ``` ### Dataset Fields The dataset has the following fields (also called "features"): ```json { "text": "Value(dtype='string', id=None)", "target": "ClassLabel(num_classes=2, names=['0', '1'], id=None)" } ``` ### Dataset Splits This dataset is split into a train and validation split. The split sizes are as follow: | Split name | Num samples | | ------------ | ------------------- | | train | 21188 | | valid | 5298 |
提供机构:
nimaster
原始信息汇总

AutoNLP Dataset Summary for devign_raw_test

Dataset Description

The dataset devign_raw_test has been automatically processed by AutoNLP. It is categorized under text-classification tasks and is primarily in English (BCP-47 code: en).

Dataset Structure

Data Instances

Each instance in the dataset contains:

  • text: A string field containing the main text content.
  • target: A classification label with two possible values, 0 and 1.

Dataset Fields

The dataset includes two fields:

  • text: String data type.
  • target: A classification label with num_classes=2 and class names [0, 1].

Dataset Splits

The dataset is divided into training and validation sets with the following sizes:

  • train: 21188 samples.
  • valid: 5298 samples.
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作