Cardiff University | Prifysgol Caerdydd ORCA
Online Research @ Cardiff 
WelshClear Cookie - decide language by browser settings

Insights into the architecture of earthworm metallothionein genes, powered by long-read genomics and transcriptomics

Karpov, Maxim A., Short, Stephen, Kille, Peter ORCID: https://orcid.org/0000-0001-6023-5221, Hobbs, Carl, Etxabe, Amaia Green, Spurgeon, David J. and Stürzenbaum, Stephen R. 2026. Insights into the architecture of earthworm metallothionein genes, powered by long-read genomics and transcriptomics. NAR Genomics and Bioinformatics 8 (1) , lqaf195. 10.1093/nargab/lqaf195

[thumbnail of lqaf195.pdf] PDF - Published Version
Download (7MB)

Abstract

The metal-handling metallothionein gene family is subjected to a strong selection pressure in earthworms inhabiting metalliferous soils. The resultant single nucleotide polymorphisms and structural variants within metallothionein’s characteristically short exons challenge standard gene prediction and genome assembly tools and complicate phylogenetic analysis of earthworm metallothioneins (wMTs). Here, the genomic origin of wMTs was defined by sequencing four wMT-containing Bacterial Artificial Chromosomes (BACs). This yielded sequence contigs of lengths from 97 to 123 kb and enabled further genetic analyses, supporting the creation of a BLAST-based software for visualizing long-read RNA-seq alignments—lrRNAseq TAST (Total Alignment Search Tool). Phylogenetic analysis of wMT proteins from long-read RNA-seq data grouped wMTs into three distinct clades. Tissue-specific lrRNAseq and metallomic mapping, conducted in context of lead toxicity, revealed enrichment of wMT-1 and wMT-2 in tissues of heavy metal detoxification, and colocalization of wMT with lead in the chloragog. The generation of rare wMT isoforms was attributed to errors in expression, but the erroneous transcripts may possess a degree of order and purpose. A conserved region in the 3′ UTR of wMT transcripts was found completely distinct between wMT homologues, allowing classification without inferring phylogenetic trees.

Item Type: Article
Date Type: Publication
Status: Published
Schools: Schools > Biosciences
Additional Information: License information from Publisher: LICENSE 1: URL: https://creativecommons.org/licenses/by/4.0/, Type: cc-by
Publisher: Oxford University Press
Date of First Compliant Deposit: 13 January 2026
Date of Acceptance: 11 November 2025
Last Modified: 13 Jan 2026 12:01
URI: https://orca.cardiff.ac.uk/id/eprint/183840

Actions (repository staff only)

Edit Item Edit Item

Downloads

Downloads per month over past year

View more statistics