CMStatistics 2019: Start Registration
View Submission - CMStatistics
B1964
Title: Semi-orthogonal non-negative matrix factorization with an application in text mining Authors:  Yutong Li - University of Illinois at Urbana-Champaign (United States) [presenting]
Annie Qu - University of Illinois at Urbana-Champaign (United States)
Ruoqing Zhu - University of Illinois at Urbana-Champaign (United States)
Abstract: Emergency Department (ED) crowding is a worldwide issue that affects the efficiency of hospital management and the quality of patient care. This occurs when the request for an admission ward-bed to receive a patient is delayed until an admission decision is made by a doctor. To reduce the overcrowding and waiting time of ED, we build a classifier to predict the disposition of patients using manually-typed nurse notes collected during triage. However, these triage notes involve high dimensional, noisy, and also sparse text data which makes model fitting and interpretation difficult. To address this issue, we propose the semi-orthogonal non-negative matrix factorization (SONMF) for both continuous and binary design matrices to first bi-cluster the patients and words into a reduced number of topics. The subjects can then be interpreted as a non-subtractive linear combination of orthogonal basis topic vectors. These generated topic vectors provide the hospital with a direct understanding of the cause of admission. We show that by using a transformation of basis, the classification accuracy can be further increased compared to the conventional bag-of-words model and alternative matrix factorization approaches.