A0339
Title: Zero-inflated quantile rank-score based test with application to scRNA-seq differential gene expression analysis
Authors: Wodan Ling - Fred Hutchinson Cancer Research Center (United States) [presenting]
Wenfei Zhang - Sarepta Therapeutics (United States)
Bin Cheng - Columbia University (United States)
Ying Wei - Columbia University (United States)
Abstract: Differential gene expression analysis based on scRNA-seq data is challenging due to two unique characteristics of scRNA-seq data. First, multimodality and other heterogeneity of the gene expression among different cell conditions lead to divergences in the tail events or crossings of the expression distributions. Second, scRNA-seq data generally have a considerable fraction of dropout events, causing zero inflation in the expression. To account for the first characteristic, existing parametric approaches targeting the mean difference in gene expression are limited, while quantile regression that examines various locations in the distribution will improve the power. However, the second characteristic, zero inflation, makes the traditional quantile regression invalid and underpowered. We propose a quantile-based test that handles multimodality and zero inflation simultaneously. The proposed quantile rank-score based test for differential distribution detection (ZIQRank) is derived under a two-part quantile regression model. It comprises a test in logistic modeling for the zero counts and a collection of rank-score tests adjusting for zero inflation at multiple prespecified quantiles of the positive part. The testing decision is based on the combined p-value of the marginal tests. ZIQRank is asymptotically justified and shown to improve and complement the existing approaches through extensive simulation and real data studies.