A platform for research: civil engineering, architecture and urbanism
Customized Instance Random Undersampling to Increase Knowledge Management for Multiclass Imbalanced Data Classification
Imbalanced data constitutes a challenge for knowledge management. This problem is even more complex in the presence of hybrid (numeric and categorical data) having missing values and multiple decision classes. Unfortunately, health-related information is often multiclass, hybrid, and imbalanced. This paper introduces a novel undersampling procedure that deals with multiclass hybrid data. We explore its impact on the performance of the recently proposed customized naïve associative classifier (CNAC). The experiments made, and the statistical analysis, show that the proposed method surpasses existing classifiers, with the advantage of being able to deal with multiclass, hybrid, and incomplete data with a low computational cost. In addition, our experiments showed that the CNAC benefits from data sampling; therefore, we recommend using the proposed undersampling procedure to balance data for CNAC.
Customized Instance Random Undersampling to Increase Knowledge Management for Multiclass Imbalanced Data Classification
Imbalanced data constitutes a challenge for knowledge management. This problem is even more complex in the presence of hybrid (numeric and categorical data) having missing values and multiple decision classes. Unfortunately, health-related information is often multiclass, hybrid, and imbalanced. This paper introduces a novel undersampling procedure that deals with multiclass hybrid data. We explore its impact on the performance of the recently proposed customized naïve associative classifier (CNAC). The experiments made, and the statistical analysis, show that the proposed method surpasses existing classifiers, with the advantage of being able to deal with multiclass, hybrid, and incomplete data with a low computational cost. In addition, our experiments showed that the CNAC benefits from data sampling; therefore, we recommend using the proposed undersampling procedure to balance data for CNAC.
Customized Instance Random Undersampling to Increase Knowledge Management for Multiclass Imbalanced Data Classification
Claudia C. Tusell-Rey (author) / Oscar Camacho-Nieto (author) / Cornelio Yáñez-Márquez (author) / Yenny Villuendas-Rey (author)
2022
Article (Journal)
Electronic Resource
Unknown
Metadata by DOAJ is licensed under CC BY-SA 1.0
DOAJ | 2024
|Keep it simple: random oversampling for imbalanced data
IEEE | 2023
|Improving activated sludge classification based on imbalanced data
British Library Online Contents | 2014
|