EcoSta 2019: Start Registration
View Submission - EcoSta2019
A0756
Title: Classification strategies for time-constrained cost-sensitive decision tree induction with missing values Authors:  Yong-Shiuan Lee - National Chengchi University (Taiwan) [presenting]
Tsung-Chi Cheng - National Chengchi University (Taiwan)
Abstract: The induction of a cost-sensitive decision tree is one of extensively investigated issues in the study of classification. Among the studies, a newly developed algorithm generates a time-constrained minimal-cost tree, which is the first to build a cost-sensitive tree within a time limit. Their experiments show that the algorithm possesses highly satisfactory performance. However, there often exist missing values when analyzing real data. We therefore extend the time-constrained cost-sensitive tree induction to handle the missing values simultaneously, in which two methods are employed to deal with incomplete data. The first one is to apply the active feature acquisition (AFA) approach, and the other is the model-based imputation methods. Through AFA, we acquire the true feature values for those missing data at a cost that we have to take into account in the tree-inducting process. While imputing the missing values based on available data is a more statistical strategy, which may require little cost and time, but it leads to the issue of misclassification. The proposed strategies incorporate AFA and imputations with the time-constrained cost-sensitive tree induction for different scenarios. We conduct a simulation study and real-world data analysis to examine the performance of the proposed algorithm.