Cardiff University | Prifysgol Caerdydd ORCA
Online Research @ Cardiff 
WelshClear Cookie - decide language by browser settings

Haploinsufficiency predictions without study bias

Steinberg, Julia, Honti, Frantisek, Meader, Stephen and Webber, Caleb ORCID: 2015. Haploinsufficiency predictions without study bias. Nucleic Acids Research 43 (15) , e101. 10.1093/nar/gkv474

[thumbnail of Licensed under CC BY 4,0] PDF (Licensed under CC BY 4,0) - Published Version
Download (347kB)


Any given human individual carries multiple genetic variants that disrupt protein-coding genes, through structural variation, as well as nucleotide variants and indels. Predicting the phenotypic consequences of a gene disruption remains a significant challenge. Current approaches employ information from a range of biological networks to predict which human genes are haploinsufficient (meaning two copies are required for normal function) or essential (meaning at least one copy is required for viability). Using recently available study gene sets, we show that these approaches are strongly biased towards providing accurate predictions for well-studied genes. By contrast, we derive a haploinsufficiency score from a combination of unbiased large-scale high-throughput datasets, including gene co-expression and genetic variation in over 6000 human exomes. Our approach provides a haploinsufficiency prediction for over twice as many genes currently unassociated with papers listed in Pubmed as three commonly-used approaches, and outperforms these approaches for predicting haploinsufficiency for less-studied genes. We also show that fine-tuning the predictor on a set of well-studied ‘gold standard’ haploinsufficient genes does not improve the prediction for less-studied genes. This new score can readily be used to prioritize gene disruptions resulting from any genetic variant, including copy number variants, indels and single-nucleotide variants.

Item Type: Article
Date Type: Publication
Status: Published
Schools: Medicine
Publisher: Oxford University Press
ISSN: 0305-1048
Date of First Compliant Deposit: 22 October 2020
Date of Acceptance: 29 April 2015
Last Modified: 04 May 2023 23:55

Citation Data

Cited 34 times in Scopus. View in Scopus. Powered By Scopus® Data

Actions (repository staff only)

Edit Item Edit Item


Downloads per month over past year

View more statistics