Cardiff University | Prifysgol Caerdydd ORCA
Online Research @ Cardiff 
WelshClear Cookie - decide language by browser settings

Reduced purifying selection prevails over positive selection in human copy number variant evolution

Nguyen, Duc-Quang, Webber, Caleb ORCID:, Hehir-Kwa, Jayne, Pfundt, Rolph, Veltman, Joris and Ponting, Chris P. 2008. Reduced purifying selection prevails over positive selection in human copy number variant evolution. Genome Research 18 (11) , pp. 1711-1723. 10.1101/gr.077289.108

[thumbnail of Genome Res.-2008-Nguyen-1711-23.pdf] PDF - Published Version
Download (785kB)


Copy number variation is a dominant contributor to genomic variation and may frequently underlie an individual’s variable susceptibilities to disease. Here we question our previous proposition that copy number variants (CNVs) are often retained in the human population because of their adaptive benefit. We show that genic biases of CNVs are best explained, not by positive selection, but by reduced efficiency of selection in eliminating deleterious changes from the human population. Of four CNV data sets examined, three exhibit significant increases in protein evolutionary rates. These increases appear to be attributable to the frequent coincidence of CNVs with segmental duplications (SDs) that recombine infrequently. Furthermore, human orthologs of mouse genes, which, when disrupted, result in pre- or postnatal lethality, are unusually depleted in CNVs. Together, these findings support a model of reduced purifying selection (Hill–Robertson interference) within copy number variable regions that are enriched in nonessential genes, allowing both the fixation of slightly deleterious substitutions and increased drift of CNV alleles. Additionally, all four CNV sets exhibited increased rates of interspecies chromosomal rearrangement and nucleotide substitution and an increased gene density. We observe that sequences with high G+C contents are most prone to copy number variation. In particular, frequently duplicated human SD sequence, or CNVs that are large and/or observed frequently, tend to be elevated in G+C content. In contrast, SD sequences that appear fixed in the human population lie more frequently within low G+C sequence. These findings provide an overarching view of how CNVs arise and segregate in the human population.

Item Type: Article
Date Type: Publication
Status: Published
Schools: Medicine
Publisher: Cold Spring Harbor Laboratory Press
ISSN: 1088-9051
Date of First Compliant Deposit: 22 October 2020
Date of Acceptance: 23 July 2008
Last Modified: 05 May 2023 02:40

Citation Data

Cited 67 times in Scopus. View in Scopus. Powered By Scopus® Data

Actions (repository staff only)

Edit Item Edit Item


Downloads per month over past year

View more statistics