Cardiff University | Prifysgol Caerdydd ORCA
Online Research @ Cardiff 
WelshClear Cookie - decide language by browser settings

DRAGON-Data: A platform and protocol for integrating genomic and phenotypic data across large psychiatric cohorts

Lynham, Amy J. ORCID:, Knott, Sarah, Underwood, Jack F. G. ORCID:, Hubbard, Leon, Agha, Sharifah S. ORCID:, Bisson, Jonathan I. ORCID:, van den Bree, Marianne B. M. ORCID:, Chawner, Samuel J. R. A., Craddock, Nicholas ORCID:, O'Donovan, Michael ORCID:, Jones, Ian R. ORCID:, Kirov, George ORCID:, Langley, Kate ORCID:, Martin, Joanna ORCID:, Rice, Frances ORCID:, Roberts, Neil P., Thapar, Anita ORCID:, Anney, Richard ORCID:, Owen, Michael J. ORCID:, Hall, Jeremy ORCID:, Pardinas, Antonio F. ORCID: and Walters, James T. R. ORCID: 2023. DRAGON-Data: A platform and protocol for integrating genomic and phenotypic data across large psychiatric cohorts. BJPsych Open 9 (2) , e32. 10.1192/bjo.2022.636

[thumbnail of dragon-data-a-platform-and-protocol-for-integrating-genomic-and-phenotypic-data-across-large-psychiatric-cohorts.pdf]
PDF - Published Version
Available under License Creative Commons Attribution.

Download (666kB) | Preview
[thumbnail of Lynham2022_SupplementaryNote - clean.pdf] PDF - Supplemental Material
Available under License Creative Commons Attribution.

Download (292kB)
License URL:
License Start date: 8 February 2023


Background Current psychiatric diagnoses, although heritable, have not been clearly mapped onto distinct underlying pathogenic processes. The same symptoms often occur in multiple disorders, and a substantial proportion of both genetic and environmental risk factors are shared across disorders. However, the relationship between shared symptoms and shared genetic liability is still poorly understood. Aims Well-characterised, cross-disorder samples are needed to investigate this matter, but few currently exist. Our aim is to develop procedures to purposely curate and aggregate genotypic and phenotypic data in psychiatric research. Method As part of the Cardiff MRC Mental Health Data Pathfinder initiative, we have curated and harmonised phenotypic and genetic information from 15 studies to create a new data repository, DRAGON-Data. To date, DRAGON-Data includes over 45 000 individuals: adults and children with neurodevelopmental or psychiatric diagnoses, affected probands within collected families and individuals who carry a known neurodevelopmental risk copy number variant. Results We have processed the available phenotype information to derive core variables that can be reliably analysed across groups. In addition, all data-sets with genotype information have undergone rigorous quality control, imputation, copy number variant calling and polygenic score generation. Conclusions DRAGON-Data combines genetic and non-genetic information, and is available as a resource for research across traditional psychiatric diagnostic categories. Algorithms and pipelines used for data harmonisation are currently publicly available for the scientific community, and an appropriate data-sharing protocol will be developed as part of ongoing projects (DATAMIND) in partnership with Health Data Research UK.

Item Type: Article
Date Type: Publication
Status: Published
Schools: Advanced Research Computing @ Cardiff (ARCCA)
MRC Centre for Neuropsychiatric Genetics and Genomics (CNGG)
Neuroscience and Mental Health Research Institute (NMHRI)
Publisher: Cambridge University Press
ISSN: 2056-4724
Funders: MRC, Wellcome Trust
Date of First Compliant Deposit: 10 January 2023
Date of Acceptance: 21 December 2022
Last Modified: 11 Jun 2024 09:57

Actions (repository staff only)

Edit Item Edit Item


Downloads per month over past year

View more statistics