Bacolla, A., Larson, J. E., Collins, J. R., Li, J., Milosavljevic, A., Stenson, Peter Daniel, Cooper, David Neil ORCID: https://orcid.org/0000-0002-8943-8484 and Wells, R. D. 2008. Abundance and length of simple repeats in vertebrate genomes are determined by their structural properties. Genome Research 18 (10) , pp. 1545-1553. 10.1101/gr.078303.108 |
Abstract
Microsatellites are abundant in vertebrate genomes, but their sequence representation and length distributions vary greatly within each family of repeats (e.g., tetranucleotides). Biophysical studies of 82 synthetic single-stranded oligonucleotides comprising all tetra- and trinucleotide repeats revealed an inverse correlation between the stability of folded-back hairpin and quadruplex structures and the sequence representation for repeats ≥30 bp in length in nine vertebrate genomes. Alternatively, the predicted energies of base-stacking interactions correlated directly with the longest length distributions in vertebrate genomes. Genome-wide analyses indicated that unstable sequences, such as CAG:CTG and CCG:CGG, were over-represented in coding regions and that micro/minisatellites were recruited in genes involved in transcription and signaling pathways, particularly in the nervous system. Microsatellite instability (MSI) is a hallmark of cancer, and length polymorphism within genes can confer susceptibility to inherited disease. Sequences that manifest the highest MSI values also displayed the strongest base-stacking interactions; analyses of 62 tri- and tetranucleotide repeat-containing genes associated with human genetic disease revealed enrichments similar to those noted for micro/minisatellite-containing genes. We conclude that DNA structure and base-stacking determined the number and length distributions of microsatellite repeats in vertebrate genomes over evolutionary time and that micro/minisatellites have been recruited to participate in both gene and protein function.
Item Type: | Article |
---|---|
Date Type: | Publication |
Status: | Published |
Schools: | Medicine |
Subjects: | Q Science > QH Natural history > QH426 Genetics R Medicine > R Medicine (General) |
Publisher: | Cold Spring Harbor Laboratory Press |
ISSN: | 1088-9051 |
Last Modified: | 20 Oct 2022 08:40 |
URI: | https://orca.cardiff.ac.uk/id/eprint/29200 |
Citation Data
Cited 81 times in Scopus. View in Scopus. Powered By Scopus® Data
Actions (repository staff only)
Edit Item |