vic35get/nhtsa_complaints_dataset
收藏Hugging Face2025-02-02 更新2025-02-15 收录
下载链接:
https://hf-mirror.com/datasets/vic35get/nhtsa_complaints_dataset
下载链接
链接失效反馈官方服务:
资源简介:
NHTSA投诉数据集包含美国国家公路交通安全管理局(NHTSA)在2014至2024年间收集的车辆投诉信息。该数据集旨在用于文本分类任务,特别是对车辆组件投诉的分类。数据集由训练集、验证集和测试集组成,每个集合都包含不同数量的样本。数据集的列包括投诉编号、投诉日期、相关组件类别、投诉摘要和标签。数据集已经过清洗和预处理,包括数据过滤、日期转换、去除重复和缺失数据、类平衡以及文本清洗。
The NHTSA Complaints Dataset contains vehicle complaint information collected by the National Highway Traffic Safety Administration (NHTSA) between 2014 and 2024. The dataset is intended for text classification tasks, specifically for classifying vehicle component complaints. The dataset consists of a training set, validation set, and test set, each containing a different number of samples. The columns of the dataset include complaint number, date of complaint filing, related component category, summary of complaint, and label. The dataset has undergone cleaning and preprocessing, including data filtering, date conversion, removal of duplicates and missing data, class balancing, and text cleaning.
提供机构:
vic35get



