Southgate, J.A., Bull, M.J., Brown, C.M., Watkins, J., Corden, S., Southgate, B., Moore, C. and Connor, T.R. ORCID: https://orcid.org/0000-0003-2394-6504 2020. Influenza classification from short reads with VAPOR facilitates robust mapping pipelines and zoonotic strain detection for routine surveillance applications. Bioinformatics 36 (6) , pp. 1681-1688. 10.1093/bioinformatics/btz814 |
Preview |
PDF
- Published Version
Available under License Creative Commons Attribution. Download (935kB) | Preview |
Abstract
Motivation: Influenza viruses represent a global public health burden due to annual epidemics and pandemic potential. Due to a rapidly evolving RNA genome, inter-species transmission, intra-host variation, and noise in short-read data, reads can be lost during mapping, and de novo assembly can be time consuming and result in misassembly. We assessed read loss during mapping, and designed a graph-based classifier, VAPOR, for selecting mapping references, assembly validation, and detection of strains of non-human origin. Results: Standard human reference viruses were insufficient for mapping diverse influenza samples in simulation. VAPOR retrieved references for 257 real whole genome sequencing (WGS) samples with a mean of >99.8% identity to assemblies, and increased the proportion of mapped reads by up to 13.3% compared to standard references. VAPOR has the potential to improve the robustness of bioinformatics pipelines for surveillance and could be adapted to other RNA viruses.
Item Type: | Article |
---|---|
Date Type: | Publication |
Status: | Published |
Schools: | Biosciences |
Publisher: | Oxford University Press |
ISSN: | 1367-4803 |
Funders: | MRC, BBSRC |
Date of First Compliant Deposit: | 30 October 2019 |
Date of Acceptance: | 27 October 2019 |
Last Modified: | 08 May 2023 04:23 |
URI: | https://orca.cardiff.ac.uk/id/eprint/126410 |
Citation Data
Cited 2 times in Scopus. View in Scopus. Powered By Scopus® Data
Actions (repository staff only)
Edit Item |