Knecht, Carolin, Mort, Matthew, Junge, Olaf, Cooper, David Neil ![]() ![]() |
Preview |
PDF
- Published Version
Available under License Creative Commons Attribution. Download (315kB) | Preview |
Abstract
The in silico prediction of the functional consequences of mutations is an important goal of human pathogenetics. However, bioinformatic tools that classify mutations according to their functionality employ different algorithms so that predictions may vary markedly between tools. We therefore integrated nine popular prediction tools (PolyPhen-2, SNPs&GO, MutPred, SIFT, MutationTaster2, Mutation Assessor and FATHMM as well as conservation-based Grantham Score and PhyloP) into a single predictor. The optimal combination of these tools was selected by means of a wide range of statistical modeling techniques, drawing upon 10 029 disease-causing single nucleotide variants (SNVs) from Human Gene Mutation Database and 10 002 putatively ‘benign’ non-synonymous SNVs from UCSC. Predictive performance was found to be markedly improved by model-based integration, whilst maximum predictive capability was obtained with either random forest, decision tree or logistic regression analysis. A combination of PolyPhen-2, SNPs&GO, MutPred, MutationTaster2 and FATHMM was found to perform as well as all tools combined. Comparison of our approach with other integrative approaches such as Condel, CoVEC, CAROL, CADD, MetaSVM and MetaLR using an independent validation dataset, revealed the superiority of our newly proposed integrative approach. An online implementation of this approach, IMHOTEP (‘Integrating Molecular Heuristics and Other Tools for Effect Prediction’), is provided at http://www.uni-kiel.de/medinfo/cgi-bin/predictor/.
Item Type: | Article |
---|---|
Date Type: | Publication |
Status: | Published |
Schools: | Medicine |
Subjects: | R Medicine > R Medicine (General) |
Uncontrolled Keywords: | mutation , decision trees , models, statistical , nucleotides , summation , human gene mutation database , heuristics , imputation , datasets , bioinformatics |
Additional Information: | This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by-nc/4.0/), |
Publisher: | Oxford University Press |
ISSN: | 0305-1048 |
Date of First Compliant Deposit: | 10 February 2017 |
Date of Acceptance: | 26 September 2016 |
Last Modified: | 08 May 2023 00:13 |
URI: | https://orca.cardiff.ac.uk/id/eprint/98233 |
Citation Data
Cited 12 times in Scopus. View in Scopus. Powered By Scopus® Data
Actions (repository staff only)
![]() |
Edit Item |