EcoSta 2024: Start Registration
View Submission - EcoSta2024
A0661
Title: Accounting for network noise in graph-guided Bayesian modeling of structured high-dimensional data Authors:  Wenrui Li - University of Pennsylvania (United States) [presenting]
Changgee Chang - Indiana University (United States)
Suprateek Kundu - The University of Texas at MD Anderson Cancer Center (United States)
Qi Long - University of Pennsylvania (United States)
Abstract: There is a growing body of literature on knowledge-guided statistical learning methods for analysis of structured high-dimensional data (such as genomic and transcriptomic data) that can incorporate knowledge of underlying networks derived from functional genomics and functional proteomics. These methods have been shown to improve variable selection and prediction accuracy and yield more interpretable results. However, these methods typically use graphs extracted from existing databases or rely on subject matter expertise, which are known to be incomplete and may contain false edges. To address this gap, a graph-guided Bayesian modeling framework is proposed to account for network noise in regression models involving structured high-dimensional predictors. Specifically, two sources of network information are used, including the noisy graph extracted from existing databases and the estimated graph from observed predictors in the dataset at hand, to inform the model for the true underlying network via a latent scale modeling framework. This model is coupled with the Bayesian regression model with structured high-dimensional predictors involving an adaptive structured shrinkage prior. An efficient Markov chain Monte Carlo algorithm is developed for posterior sampling. The advantages of the method are demonstrated over existing methods in simulations and through analyses of a genomics dataset and another proteomics dataset for Alzheimer's disease.