Sirawipa/hosxp_usage_code2
收藏Hugging Face2024-07-02 更新2024-07-06 收录
下载链接:
https://hf-mirror.com/datasets/Sirawipa/hosxp_usage_code2
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含文本和标签两个特征,标签是一个分类标签,具有多个类别名称。数据集被分为训练集、测试集和验证集,训练集包含3114个示例,测试集包含1038个示例,验证集也包含1038个示例。整个数据集的下载大小为44750字节,数据集总大小为257490字节。
This dataset is primarily used for text classification tasks, containing text data and corresponding classification labels. The text data is stored as strings, and the labels are multi-class classifications with multiple predefined categories. The dataset is divided into training, test, and validation sets, each with specified sizes and number of examples.
提供机构:
Sirawipa
原始信息汇总
数据集概述
特征
- text: 数据类型为字符串。
- label: 数据类型为类别标签,包含以下类别:
- 171: X9
- 34: AAA
- 4: 810
- 63: EE
- 65: EN
- 3: 20
- 100: MI
- 51: DI
- 147: SI
- 87: IMW
- 89: ISV
- 1: 12
- 156: SWC
- 154: SUB
- 2: 15
- 157: TAK
- 158: TIA
- 98: KP
- 167: X5
- 166: X4
- 97: KM
- 54: DS
- 71: EY
- 130: RN
- 55: DS1
- 159: TN
- 43: BP
- 44: BP3
- 45: BP4
- 46: BP5
- 47: BPS
- 124: PK
- 125: PKB
- 126: PKL
- 123: PEE
- 53: DP
- 128: PS
- 127: PM
- 149: SL
- 0: 0
- 148: SK
- 152: SM
- 146: SF
- 129: PTL
- 144: SB
- 153: ST
- 155: SW
- 79: GP1
- 80: GP2
- 78: GP
- 75: GB
- 77: GL
- 76: GK
- 70: ER
- 106: OR
- 122: OS
- 104: OB
- 105: OB3
- 101: MM
- 99: MF
- 172: Z
- 164: X2
- 168: X6
- 165: X3
- 160: X1
- 170: X8
- 85: IM
- 81: IA
- 83: IK
- 84: IL
- 88: IP
- 90: IT
- 95: IVT
- 91: IV
- 92: IVD
- 93: IVF
- 94: IVS
- 108: OR10
- 121: ORC
- 119: OR8
- 118: OR7
- 113: OR2
- 17: A20
- 18: A21
- 52: DL
- 82: ID
- 145: SC
- 96: IW
- 163: X12
- 42: AV6
- 9: A13
- 23: A26
- 103: NC1
- 38: AV2
- 15: A19
- 117: OR6
- 116: OR5
- 5: A1
- 29: A5
- 21: A24
- 27: A3
- 14: A18
- 22: A25
- 143: S92
- 16: A2
- 10: A14
- 8: A12
- 33: A9
- 32: A8
- 6: A10
- 31: A7
- 11: A15
- 7: A11
- 28: A4
- 30: A6
- 161: X10
- 69: EOR
- 68: EOP
- 67: EOL
- 66: EOB
- 13: A17
- 12: A16
- 36: AT2
- 35: AT1
- 41: AV5
- 86: IMP
- 132: S10
- 141: S8
- 136: S3
- 135: S2
- 142: S9
- 133: S11
- 138: S5
- 137: S4
- 131: S1
- 134: S12
- 19: A22
- 26: A29
- 25: A28
- 24: A27
- 107: OR1
- 120: OR9
- 169: X7
- 102: NC
- 62: ECR
- 61: ECL
- 60: ECB
- 20: A23
- 37: AV1
- 40: AV4
- 39: AV3
- 139: S6
- 140: S7
- 162: X11
- 48: D1
- 151: SLD
- 49: D2
- 50: D3
- 74: EYR
- 64: EEP
- 73: EYL
- 72: EYB
- 59: EAR
- 58: EAP
- 57: EAL
- 56: EAB
- 115: OR4
- 112: OR14
- 109: OR11
- 110: OR12
- 150: SL1
- 114: OR3
- 111: OR13
数据集划分
- train: 包含3114个样本,占用154796字节。
- test: 包含1038个样本,占用52166字节。
- validation: 包含1038个样本,占用50528字节。
数据集大小
- 下载大小: 44750字节
- 数据集总大小: 257490字节
配置
- config_name: default
- data_files:
- train: data/train-*
- test: data/test-*
- validation: data/validation-*
- data_files:



