COMPSTAT 2024: Start Registration
View Submission - COMPSTAT2024
A0191
Title: Impact of rarity level and resampling techniques on machine learning classification performance Authors:  Olcay Alpay - Sinop University (Turkey) [presenting]
Abstract: The classification of binary events is frequently discussed in the literature. However, in cases where the distribution of the event of interest is unbalanced, such as with rare events, machine learning algorithms may produce biased results. The classification performance of several machine learning algorithms is investigated, including Logistic Regression, Support Vector Machine, Random Forest, Gradient Boosting, and Artificial Neural Network, in different rarity scenarios and how they are affected by unbalanced data. The impact of resampling techniques on dataset balancing and classification is also examined.