Cardiff University | Prifysgol Caerdydd ORCA
Online Research @ Cardiff 
WelshClear Cookie - decide language by browser settings

IMHOTEP A composite score integrating popular tools for predicting the functional consequences of non-synonymous sequence variants

Knecht, Carolin, Mort, Matthew, Junge, Olaf, Cooper, David Neil ORCID: https://orcid.org/0000-0002-8943-8484, Krawczak, Michael and Caliebe, Amke 2017. IMHOTEP A composite score integrating popular tools for predicting the functional consequences of non-synonymous sequence variants. Nucleic Acids Research 45 (3) , e13. 10.1093/nar/gkw886

[thumbnail of IMHOTEP.pdf]
Preview
PDF - Published Version
Available under License Creative Commons Attribution.

Download (315kB) | Preview

Abstract

The in silico prediction of the functional consequences of mutations is an important goal of human pathogenetics. However, bioinformatic tools that classify mutations according to their functionality employ different algorithms so that predictions may vary markedly between tools. We therefore integrated nine popular prediction tools (PolyPhen-2, SNPs&GO, MutPred, SIFT, MutationTaster2, Mutation Assessor and FATHMM as well as conservation-based Grantham Score and PhyloP) into a single predictor. The optimal combination of these tools was selected by means of a wide range of statistical modeling techniques, drawing upon 10 029 disease-causing single nucleotide variants (SNVs) from Human Gene Mutation Database and 10 002 putatively ‘benign’ non-synonymous SNVs from UCSC. Predictive performance was found to be markedly improved by model-based integration, whilst maximum predictive capability was obtained with either random forest, decision tree or logistic regression analysis. A combination of PolyPhen-2, SNPs&GO, MutPred, MutationTaster2 and FATHMM was found to perform as well as all tools combined. Comparison of our approach with other integrative approaches such as Condel, CoVEC, CAROL, CADD, MetaSVM and MetaLR using an independent validation dataset, revealed the superiority of our newly proposed integrative approach. An online implementation of this approach, IMHOTEP (‘Integrating Molecular Heuristics and Other Tools for Effect Prediction’), is provided at http://www.uni-kiel.de/medinfo/cgi-bin/predictor/.

Item Type: Article
Date Type: Publication
Status: Published
Schools: Medicine
Subjects: R Medicine > R Medicine (General)
Uncontrolled Keywords: mutation , decision trees , models, statistical , nucleotides , summation , human gene mutation database , heuristics , imputation , datasets , bioinformatics
Additional Information: This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by-nc/4.0/),
Publisher: Oxford University Press
ISSN: 0305-1048
Date of First Compliant Deposit: 10 February 2017
Date of Acceptance: 26 September 2016
Last Modified: 08 May 2023 00:13
URI: https://orca.cardiff.ac.uk/id/eprint/98233

Citation Data

Cited 12 times in Scopus. View in Scopus. Powered By Scopus® Data

Actions (repository staff only)

Edit Item Edit Item

Downloads

Downloads per month over past year

View more statistics