TY - JOUR
T1 - Biological relevance of computationally predicted pathogenicity of noncoding variants
AU - Liu, Li
AU - Sanderford, Maxwell D.
AU - Patel, Ravi
AU - Chandrashekar, Pramod
AU - Gibson, Greg
N1 - Publisher Copyright: © 2019, The Author(s).
PY - 2019/12/1
Y1 - 2019/12/1
N2 - Computational prediction of the phenotypic propensities of noncoding single nucleotide variants typically combines annotation of genomic, functional and evolutionary attributes into a single score. Here, we evaluate if the claimed excellent accuracies of these predictions translate into high rates of success in addressing questions important in biological research, such as fine mapping causal variants, distinguishing pathogenic allele(s) at a given position, and prioritizing variants for genetic risk assessment. A significant disconnect is found to exist between the statistical modelling and biological performance of predictive approaches. We discuss fundamental reasons underlying these deficiencies and suggest that future improvements of computational predictions need to address confounding of allelic, positional and regional effects as well as imbalance of the proportion of true positive variants in candidate lists.
AB - Computational prediction of the phenotypic propensities of noncoding single nucleotide variants typically combines annotation of genomic, functional and evolutionary attributes into a single score. Here, we evaluate if the claimed excellent accuracies of these predictions translate into high rates of success in addressing questions important in biological research, such as fine mapping causal variants, distinguishing pathogenic allele(s) at a given position, and prioritizing variants for genetic risk assessment. A significant disconnect is found to exist between the statistical modelling and biological performance of predictive approaches. We discuss fundamental reasons underlying these deficiencies and suggest that future improvements of computational predictions need to address confounding of allelic, positional and regional effects as well as imbalance of the proportion of true positive variants in candidate lists.
UR - http://www.scopus.com/inward/record.url?scp=85060173750&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85060173750&partnerID=8YFLogxK
U2 - 10.1038/s41467-018-08270-y
DO - 10.1038/s41467-018-08270-y
M3 - Article
C2 - 30659175
SN - 2041-1723
VL - 10
JO - Nature communications
JF - Nature communications
IS - 1
M1 - 330
ER -