five

Result dataset for our experimental analysis on multi-cepstral projection representation strategies for dysphonia detection

收藏
NIAID Data Ecosystem2026-05-01 收录
下载链接:
https://zenodo.org/record/7897602
下载链接
链接失效反馈
官方服务:
资源简介:
Database containing the results of the analyzed versions of the framework proposed in our paper submitted to the journal Sensors (Basel) under the title "An experimental analysis on multi-cepstral projection representation strategies for dysphonia detection". In this database, we have the following information: "gender": Gender of individuals referring to the selected voice database. For this field, we have the following possible values: “male” for a selection of male individuals, “female” for a selection of female individuals, and “both” for a selection considering both genders. “Techniques”: Concerns about the techniques for extracting cepstral coefficients that we are analyzing. The identifier “nonceps” refers to the use of non-cepstral features. “vowel”: Vowel considered in the database selection. The following values are possible: “a”, “i” and “u”. “intonation”: Tone used by individuals when pronouncing the analyzed vowel. Possible values are: “h” for high; “l” for low; “n” is normal; and “lhl” for low-high-low. “coordinates”: Number of coordinates that make up the feature vector that represents the voice signal after the dimensionality reduction routines. “scale”: Normalization function used on the feature vector. The possible values of this field are the following: “MinMax” for the min-max scale; “Robust” for the robust scale; “Standard” for the standard scale; and “Unscaled” for the unscaled vector. “ACC”: Accuracy obtained by the analyzed version on the considered voice database clipping. “AUC”: Area under the ROC curve obtained by the analyzed version on the considered voice database clipping. “EER”: Equal Error Rate obtained by the analyzed version on the considered voice database clipping. “F1”: F1-score obtained by the analyzed version on the considered voice database clipping. “EH”: Rate of healthy voice signals classified as pathological on the considered voice database clipping. “EP”: Rate of pathological voice signals classified as healthy on the considered voice database clipping. “KFCV”: Average accuracy score of a 5-fold Cross Validation over the training dataset on the considered voice database clipping. “Balancing”: Indication of the use of balancing technique (SMOTE) by the considered framework version. “Classifier”: Classifier used, being possible the use of Random Forest (RF), Logistic Regression (LR), and Support Vector Machine (SVM). “Multi-Projection”: Multi-projection strategies employed by the evaluated technique. “Features”: Type of feature that defines the feature vector. In this case, the following values are possible in this field: “NonCeps” for non-cepstral features; “Ceps” for cepstral features only; and “Ceps and NonCeps” for features of cepstral and non-cepstral types. . It is worth noting that the symbol “-”, present in some fields, represents the “non-use” of any technique of the type indicated by the field. For example, in the case of the “Balancing” field, the value “-” means that no data balancing technique was used in the evaluated version of the framework.
创建时间:
2023-05-05
二维码
社区交流群
二维码
科研交流群
商业服务