CesarLeblanc/plantbert_text_classification_dataset
收藏Hugging Face2024-02-19 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/CesarLeblanc/plantbert_text_classification_dataset
下载链接
链接失效反馈官方服务:
资源简介:
---
configs:
- config_name: default
data_files:
- split: fold_0
path: data/fold_0-*
- split: fold_1
path: data/fold_1-*
- split: fold_2
path: data/fold_2-*
- split: fold_3
path: data/fold_3-*
- split: fold_4
path: data/fold_4-*
- split: fold_5
path: data/fold_5-*
- split: fold_6
path: data/fold_6-*
- split: fold_7
path: data/fold_7-*
- split: fold_8
path: data/fold_8-*
- split: fold_9
path: data/fold_9-*
dataset_info:
features:
- name: label
dtype:
class_label:
names:
'0': MA211
'1': MA221
'2': MA222
'3': MA223
'4': MA224
'5': MA225
'6': MA232
'7': MA241
'8': MA251
'9': MA252
'10': MA253
'11': N11
'12': N12
'13': N13
'14': N14
'15': N15
'16': N16
'17': N17
'18': N18
'19': N19
'20': N1A
'21': N1B
'22': N1C
'23': N1D
'24': N1E
'25': N1F
'26': N1G
'27': N1H
'28': N1J
'29': N21
'30': N22
'31': N31
'32': N32
'33': N33
'34': N34
'35': N35
'36': Q11
'37': Q12
'38': Q21
'39': Q22
'40': Q23
'41': Q24
'42': Q25
'43': Q41
'44': Q42
'45': Q43
'46': Q44
'47': Q45
'48': Q46
'49': Q51
'50': Q52
'51': Q53
'52': Q54
'53': R11
'54': R12
'55': R13
'56': R14
'57': R15
'58': R16
'59': R17
'60': R18
'61': R19
'62': R1A
'63': R1B
'64': R1C
'65': R1D
'66': R1E
'67': R1F
'68': R1G
'69': R1H
'70': R1J
'71': R1K
'72': R1M
'73': R1P
'74': R1Q
'75': R1R
'76': R1S
'77': R21
'78': R22
'79': R23
'80': R24
'81': R31
'82': R32
'83': R33
'84': R34
'85': R35
'86': R36
'87': R37
'88': R41
'89': R42
'90': R43
'91': R44
'92': R45
'93': R51
'94': R52
'95': R53
'96': R54
'97': R55
'98': R56
'99': R57
'100': R61
'101': R62
'102': R63
'103': R64
'104': R65
'105': S11
'106': S12
'107': S21
'108': S22
'109': S23
'110': S24
'111': S25
'112': S26
'113': S31
'114': S32
'115': S33
'116': S34
'117': S35
'118': S36
'119': S37
'120': S38
'121': S41
'122': S42
'123': S51
'124': S52
'125': S53
'126': S54
'127': S61
'128': S62
'129': S63
'130': S64
'131': S65
'132': S66
'133': S67
'134': S68
'135': S71
'136': S72
'137': S73
'138': S74
'139': S75
'140': S76
'141': S81
'142': S82
'143': S91
'144': S92
'145': S93
'146': S94
'147': T11
'148': T12
'149': T13
'150': T14
'151': T15
'152': T16
'153': T17
'154': T18
'155': T19
'156': T1A
'157': T1B
'158': T1C
'159': T1D
'160': T1E
'161': T1F
'162': T1G
'163': T1H
'164': T21
'165': T22
'166': T23
'167': T24
'168': T25
'169': T27
'170': T28
'171': T29
'172': T31
'173': T32
'174': T33
'175': T34
'176': T35
'177': T36
'178': T37
'179': T38
'180': T39
'181': T3A
'182': T3B
'183': T3C
'184': T3D
'185': T3E
'186': T3F
'187': T3G
'188': T3H
'189': T3J
'190': T3K
'191': T3M
'192': U21
'193': U22
'194': U23
'195': U24
'196': U25
'197': U26
'198': U27
'199': U28
'200': U29
'201': U2A
'202': U32
'203': U33
'204': U34
'205': U35
'206': U36
'207': U37
'208': U38
'209': U3A
'210': U3B
'211': U3C
'212': U3D
'213': U61
'214': U62
'215': V11
'216': V12
'217': V13
'218': V14
'219': V15
'220': V32
'221': V33
'222': V34
'223': V35
'224': V37
'225': V38
'226': V39
- name: text
dtype: string
splits:
- name: fold_0
num_bytes: 37135896
num_examples: 85087
- name: fold_1
num_bytes: 36025033
num_examples: 85076
- name: fold_2
num_bytes: 35613576
num_examples: 85115
- name: fold_3
num_bytes: 36680348
num_examples: 85067
- name: fold_4
num_bytes: 36877319
num_examples: 85065
- name: fold_5
num_bytes: 36029591
num_examples: 85081
- name: fold_6
num_bytes: 36277596
num_examples: 85148
- name: fold_7
num_bytes: 36033390
num_examples: 85082
- name: fold_8
num_bytes: 36234393
num_examples: 85053
- name: fold_9
num_bytes: 35973208
num_examples: 85159
download_size: 98159513
dataset_size: 362880350
---
# Dataset Card for "plantbert_text_classification_dataset"
[More Information needed](https://github.com/huggingface/datasets/blob/main/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)
提供机构:
CesarLeblanc
原始信息汇总
数据集概述
数据集配置
- 默认配置:
- 数据文件路径:
fold_0:data/fold_0-*fold_1:data/fold_1-*fold_2:data/fold_2-*fold_3:data/fold_3-*fold_4:data/fold_4-*fold_5:data/fold_5-*fold_6:data/fold_6-*fold_7:data/fold_7-*fold_8:data/fold_8-*fold_9:data/fold_9-*
- 数据文件路径:
数据集信息
- 特征:
label:- 数据类型:类别标签
- 类别名称:
- 0: MA211
- 1: MA221
- 2: MA222
- 3: MA223
- 4: MA224
- 5: MA225
- 6: MA232
- 7: MA241
- 8: MA251
- 9: MA252
- 10: MA253
- 11: N11
- 12: N12
- 13: N13
- 14: N14
- 15: N15
- 16: N16
- 17: N17
- 18: N18
- 19: N19
- 20: N1A
- 21: N1B
- 22: N1C
- 23: N1D
- 24: N1E
- 25: N1F
- 26: N1G
- 27: N1H
- 28: N1J
- 29: N21
- 30: N22
- 31: N31
- 32: N32
- 33: N33
- 34: N34
- 35: N35
- 36: Q11
- 37: Q12
- 38: Q21
- 39: Q22
- 40: Q23
- 41: Q24
- 42: Q25
- 43: Q41
- 44: Q42
- 45: Q43
- 46: Q44
- 47: Q45
- 48: Q46
- 49: Q51
- 50: Q52
- 51: Q53
- 52: Q54
- 53: R11
- 54: R12
- 55: R13
- 56: R14
- 57: R15
- 58: R16
- 59: R17
- 60: R18
- 61: R19
- 62: R1A
- 63: R1B
- 64: R1C
- 65: R1D
- 66: R1E
- 67: R1F
- 68: R1G
- 69: R1H
- 70: R1J
- 71: R1K
- 72: R1M
- 73: R1P
- 74: R1Q
- 75: R1R
- 76: R1S
- 77: R21
- 78: R22
- 79: R23
- 80: R24
- 81: R31
- 82: R32
- 83: R33
- 84: R34
- 85: R35
- 86: R36
- 87: R37
- 88: R41
- 89: R42
- 90: R43
- 91: R44
- 92: R45
- 93: R51
- 94: R52
- 95: R53
- 96: R54
- 97: R55
- 98: R56
- 99: R57
- 100: R61
- 101: R62
- 102: R63
- 103: R64
- 104: R65
- 105: S11
- 106: S12
- 107: S21
- 108: S22
- 109: S23
- 110: S24
- 111: S25
- 112: S26
- 113: S31
- 114: S32
- 115: S33
- 116: S34
- 117: S35
- 118: S36
- 119: S37
- 120: S38
- 121: S41
- 122: S42
- 123: S51
- 124: S52
- 125: S53
- 126: S54
- 127: S61
- 128: S62
- 129: S63
- 130: S64
- 131: S65
- 132: S66
- 133: S67
- 134: S68
- 135: S71
- 136: S72
- 137: S73
- 138: S74
- 139: S75
- 140: S76
- 141: S81
- 142: S82
- 143: S91
- 144: S92
- 145: S93
- 146: S94
- 147: T11
- 148: T12
- 149: T13
- 150: T14
- 151: T15
- 152: T16
- 153: T17
- 154: T18
- 155: T19
- 156: T1A
- 157: T1B
- 158: T1C
- 159: T1D
- 160: T1E
- 161: T1F
- 162: T1G
- 163: T1H
- 164: T21
- 165: T22
- 166: T23
- 167: T24
- 168: T25
- 169: T27
- 170: T28
- 171: T29
- 172: T31
- 173: T32
- 174: T33
- 175: T34
- 176: T35
- 177: T36
- 178: T37
- 179: T38
- 180: T39
- 181: T3A
- 182: T3B
- 183: T3C
- 184: T3D
- 185: T3E
- 186: T3F
- 187: T3G
- 188: T3H
- 189: T3J
- 190: T3K
- 191: T3M
- 192: U21
- 193: U22
- 194: U23
- 195: U24
- 196: U25
- 197: U26
- 198: U27
- 199: U28
- 200: U29
- 201: U2A
- 202: U32
- 203: U33
- 204: U34
- 205: U35
- 206: U36
- 207: U37
- 208: U38
- 209: U3A
- 210: U3B
- 211: U3C
- 212: U3D
- 213: U61
- 214: U62
- 215: V11
- 216: V12
- 217: V13
- 218: V14
- 219: V15
- 220: V32
- 221: V33
- 222: V34
- 223: V35
- 224: V37
- 225: V38
- 226: V39
text:- 数据类型:字符串
数据集分割
- 分割信息:
fold_0:- 字节数:37135896
- 样本数:85087
fold_1:- 字节数:36025033
- 样本数:85076
fold_2:- 字节数:35613576
- 样本数:85115
fold_3:- 字节数:36680348
- 样本数:85067
fold_4:- 字节数:36877319
- 样本数:85065
fold_5:- 字节数:36029591
- 样本数:85081
fold_6:- 字节数:36277596
- 样本数:85148
fold_7:- 字节数:36033390
- 样本数:85082
fold_8:- 字节数:36234393
- 样本数:85053
fold_9:- 字节数:35973208
- 样本数:85159
数据集大小
- 下载大小:98159513 字节
- 数据集大小:362880350 字节



