Qi, Mengling, Stenson, Peter D., Ball, Edward V., Tainer, John A., Bacolla, Albino, Kehrer-Sawatzki, Hildegard, Cooper, David N. ORCID: https://orcid.org/0000-0002-8943-8484 and Zhao, Huiying 2022. Distinct sequence features underlie microdeletions and gross deletions in the human genome. Human Mutation 43 (3) , pp. 328-346. 10.1002/humu.24314 |
Preview |
PDF
- Published Version
Available under License Creative Commons Attribution Non-commercial No Derivatives. Download (3MB) | Preview |
Abstract
Microdeletions and gross deletions are important causes (~20%) of human inherited disease and their genomic locations are strongly influenced by the local DNA sequence environment. This notwithstanding, no study has systematically examined their underlying generative mechanisms. Here, we obtained 42,098 pathogenic microdeletions and gross deletions from the Human Gene Mutation Database (HGMD) that together form a continuum of germline deletions ranging in size from 1bp to 28,394,429bp. We analyzed the DNA sequence within 1-kb of the breakpoint junctions and found that the frequencies of non-B DNA-forming repeats, GC-content, and the presence of seven of 78 specific sequence motifs in the vicinity of pathogenic deletions correlated with deletion length for deletions of length ≤30 bp. Further, we found that the presence of DR, GQ and STR repeats is important for the formation of longer deletions (>30 bp) but not for the formation of shorter deletions (≤30 bp) whilst significantly (Chi-square test P-value<2E-16) more microhomologies were identified flanking short deletions than long deletions (length >30 bp). We provide evidence to support a functional distinction between microdeletions and gross deletions. Finally, we propose that a deletion length cut-off of 25-30bp may serve as an objective means to functionally distinguish microdeletions from gross deletions.
Item Type: | Article |
---|---|
Date Type: | Publication |
Status: | Published |
Schools: | Medicine |
Additional Information: | This is an open access article under the terms of the Creative Commons Attribution‐NonCommercial‐NoDerivs License |
Publisher: | Wiley |
ISSN: | 1059-7794 |
Date of First Compliant Deposit: | 25 January 2022 |
Date of Acceptance: | 17 December 2021 |
Last Modified: | 28 May 2024 18:38 |
URI: | https://orca.cardiff.ac.uk/id/eprint/146895 |
Citation Data
Cited 2 times in Scopus. View in Scopus. Powered By Scopus® Data
Actions (repository staff only)
Edit Item |