nimaster/autonlp-data-devign_raw_test
收藏Hugging Face2022-03-17 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/nimaster/autonlp-data-devign_raw_test
下载链接
链接失效反馈官方服务:
资源简介:
---
languages:
- en
task_categories:
- text-classification
---
# AutoNLP Dataset for project: devign_raw_test
## Dataset Descritpion
This dataset has been automatically processed by AutoNLP for project devign_raw_test.
### Languages
The BCP-47 code for the dataset's language is en.
## Dataset Structure
### Data Instances
A sample from this dataset looks as follows:
```json
[
{
"text": "void ff_avg_h264_qpel16_mc32_msa ( uint8_t * dst , const uint8_t * src , ptrdiff_t stride ) { avc_lu[...]",
"target": 0
},
{
"text": "static void sd_cardchange ( void * opaque , bool load ) { SDState * sd = opaque ; qemu_set_irq ( sd [...]",
"target": 0
}
]
```
### Dataset Fields
The dataset has the following fields (also called "features"):
```json
{
"text": "Value(dtype='string', id=None)",
"target": "ClassLabel(num_classes=2, names=['0', '1'], id=None)"
}
```
### Dataset Splits
This dataset is split into a train and validation split. The split sizes are as follow:
| Split name | Num samples |
| ------------ | ------------------- |
| train | 21188 |
| valid | 5298 |
提供机构:
nimaster
原始信息汇总
AutoNLP Dataset Summary for devign_raw_test
Dataset Description
The dataset devign_raw_test has been automatically processed by AutoNLP. It is categorized under text-classification tasks and is primarily in English (BCP-47 code: en).
Dataset Structure
Data Instances
Each instance in the dataset contains:
text: A string field containing the main text content.target: A classification label with two possible values, 0 and 1.
Dataset Fields
The dataset includes two fields:
text: String data type.target: A classification label withnum_classes=2and class names[0, 1].
Dataset Splits
The dataset is divided into training and validation sets with the following sizes:
train: 21188 samples.valid: 5298 samples.



