Zhang, Xinjun, Lin, Hai, Zhao, Huiying, Hao, Yangyang, Mort, Matthew, Cooper, David ORCID: https://orcid.org/0000-0002-8943-8484, Zhou, Yaoqi and Liu, Yunlong 2014. Impact of human pathogenic micro-insertions and micro-deletions on post-transcriptional regulation. Human Molecular Genetics 23 (11) , pp. 3024-3034. 10.1093/hmg/ddu019 |
Abstract
Small insertions/deletions (INDELs) of ≤21 bp comprise 18% of all recorded mutations causing human inherited disease and are evident in 24% of documented Mendelian diseases. INDELs affect gene function in multiple ways: for example, by introducing premature stop codons that either lead to the production of truncated proteins or affect transcriptional efficiency. However, the means by which they impact post-transcriptional regulation, including alternative splicing, have not been fully evaluated. In this study, we collate disease-causing INDELs from the Human Gene Mutation Database (HGMD) and neutral INDELs from the 1000 Genomes Project. The potential of these two types of INDELs to affect binding-site affinity of RNA-binding proteins (RBPs) was then evaluated. We identified several sequence features that can distinguish disease-causing INDELs from neutral INDELs. Moreover, we built a machine-learning predictor called PinPor (predicting pathogenic small insertions and deletions affecting post-transcriptional regulation, http://watson.compbio.iupui.edu/pinpor/) to ascertain which newly observed INDELs are likely to be pathogenic. Our results show that disease-causing INDELs are more likely to ablate RBP-binding sites and tend to affect more RBP-binding sites than neutral INDELs. Additionally, disease-causing INDELs give rise to greater deviations in binding affinity than neutral INDELs. We also demonstrated that disease-causing INDELs may be distinguished from neutral INDELs by several sequence features, such as their proximity to splice sites and their potential effects on RNA secondary structure. This predictor showed satisfactory performance in identifying numerous pathogenic INDELs, with a Matthews correlation coefficient (MCC) value of 0.51 and an accuracy of 0.75.
Item Type: | Article |
---|---|
Date Type: | Publication |
Status: | Published |
Schools: | Medicine |
Subjects: | C Auxiliary Sciences of History > CS Genealogy R Medicine > R Medicine (General) |
Publisher: | Oxford University Press |
ISSN: | 0964-6906 |
Date of Acceptance: | 13 January 2014 |
Last Modified: | 17 Nov 2022 10:51 |
URI: | https://orca.cardiff.ac.uk/id/eprint/79454 |
Citation Data
Cited 24 times in Scopus. View in Scopus. Powered By Scopus® Data
Actions (repository staff only)
Edit Item |