Cardiff University | Prifysgol Caerdydd ORCA
Online Research @ Cardiff 
WelshClear Cookie - decide language by browser settings

Bayesian networks elucidate complex genomic landscapes in cancer

Angelopoulos, Nicos ORCID:, Chatzipli, Aikaterini, Nangalia, Jyoti, Maura, Francesco and Campbell, Peter J. 2022. Bayesian networks elucidate complex genomic landscapes in cancer. Communications Biology 5 (1) , 306. 10.1038/s42003-022-03243-w

[thumbnail of 1Bayesian networks elucidate complex.pdf]
PDF - Published Version
Available under License Creative Commons Attribution.

Download (2MB) | Preview


Bayesian networks (BNs) are disciplined, explainable Artificial Intelligence models that can describe structured joint probability spaces. In the context of understanding complex relations between a number of variables in biological settings, they can be constructed from observed data and can provide a guiding, graphical tool in exploring such relations. Here we propose BNs for elucidating the relations between driver events in large cancer genomic datasets. We present a methodology that is specifically tailored to biologists and clinicians as they are the main producers of such datasets. We achieve this by using an optimal BN learning algorithm based on well established likelihood functions and by utilising just two tuning parameters, both of which are easy to set and have intuitive readings. To enhance value to clinicians, we introduce (a) the use of heatmaps for families in each network, and (b) visualising pairwise co-occurrence statistics on the network. For binary data, an optional step of fitting logic gates can be employed. We show how our methodology enhances pairwise testing and how biologists and clinicians can use BNs for discussing the main relations among driver events in large genomic cohorts. We demonstrate the utility of our methodology by applying it to 5 cancer datasets revealing complex genomic landscapes. Our networks identify central patterns in all datasets including a central 4-way mutual exclusivity between HDR, t(4,14), t(11,14) and t(14,16) in myeloma, and a 3-way mutual exclusivity of three major players: CALR, JAK2 and MPL, in myeloproliferative neoplasms. These analyses demonstrate that our methodology can play a central role in the study of large genomic cancer datasets.

Item Type: Article
Date Type: Publication
Status: Published
Schools: Medicine
Additional Information: This article is licensed under a Creative Commons Attribution 4.0 International License
Publisher: Nature Research
ISSN: 2399-3642
Date of First Compliant Deposit: 14 April 2022
Date of Acceptance: 9 March 2022
Last Modified: 04 May 2023 17:23

Actions (repository staff only)

Edit Item Edit Item


Downloads per month over past year

View more statistics