Bacolla, A., Larson, J. E., Collins, J. R., Li, J., Milosavljevic, A., Stenson, Peter Daniel, Cooper, David Neil ORCID: https://orcid.org/0000-0002-8943-8484 and Wells, R. D.
2008.
Abundance and length of simple repeats in vertebrate genomes are determined by their structural properties.
Genome Research
18
(10)
, pp. 1545-1553.
10.1101/gr.078303.108
|
Abstract
Microsatellites are abundant in vertebrate genomes, but their sequence representation and length distributions vary greatly within each family of repeats (e.g., tetranucleotides). Biophysical studies of 82 synthetic single-stranded oligonucleotides comprising all tetra- and trinucleotide repeats revealed an inverse correlation between the stability of folded-back hairpin and quadruplex structures and the sequence representation for repeats ≥30 bp in length in nine vertebrate genomes. Alternatively, the predicted energies of base-stacking interactions correlated directly with the longest length distributions in vertebrate genomes. Genome-wide analyses indicated that unstable sequences, such as CAG:CTG and CCG:CGG, were over-represented in coding regions and that micro/minisatellites were recruited in genes involved in transcription and signaling pathways, particularly in the nervous system. Microsatellite instability (MSI) is a hallmark of cancer, and length polymorphism within genes can confer susceptibility to inherited disease. Sequences that manifest the highest MSI values also displayed the strongest base-stacking interactions; analyses of 62 tri- and tetranucleotide repeat-containing genes associated with human genetic disease revealed enrichments similar to those noted for micro/minisatellite-containing genes. We conclude that DNA structure and base-stacking determined the number and length distributions of microsatellite repeats in vertebrate genomes over evolutionary time and that micro/minisatellites have been recruited to participate in both gene and protein function.
| Item Type: | Article |
|---|---|
| Date Type: | Publication |
| Status: | Published |
| Schools: | Schools > Medicine |
| Subjects: | Q Science > QH Natural history > QH426 Genetics R Medicine > R Medicine (General) |
| Publisher: | Cold Spring Harbor Laboratory Press |
| ISSN: | 1088-9051 |
| Last Modified: | 20 Oct 2022 08:40 |
| URI: | https://orca.cardiff.ac.uk/id/eprint/29200 |
Citation Data
Cited 81 times in Scopus. View in Scopus. Powered By Scopus® Data
Actions (repository staff only)
![]() |
Edit Item |





Altmetric
Altmetric