Menon, Suraj 2009. Integration and biological interpretation of microarry gene expression profiling data. PhD Thesis, Cardiff University. |
![]() |
PDF
- Accepted Post-Print Version
Download (33MB) |
Abstract
Many different strategies have been developed for the analysis of microarray data and these have a significant influence on the level and quality of knowledge that may be achieved from a microarray-based experiment. Two such strategies are explored in this thesis. Part A of this thesis describes explorations of a resource-efficient strategy that could allow for large-scale integration of microarray data in an unsupervised fashion. For this purpose, comparisons were carried out between a series of genelists manually extracted from the literature, representing a disparate set of microarray experiments. Initial results were highly unexpected, and are likely to have been caused by violations of the assumptions of the hypergeometric test used for assessing comparisons. Statistical modelling was found to successfully simulate these results however the estimated net effect of these violations was found to be considerable. These findings strongly caution against the comparison of microarray experiments using their genelists. Part B then describes the development of Gene Set Discovery (GSD), a novel methodology to perform threshold-free gene set analysis of microarray datasets without requiring sample class information. This was achieved by deriving a novel metric that allows for the selection of those gene sets that exhibit significant discrimination between samples. GSD was implemented on four microarray datasets and the results were found to be biologically plausible and/or in agreement with prior analyses of these datasets. These findings suggested that GSD could be a potentially useful tool for biological theme discovery in microarray datasets, particularly in studies of cancer where sample classification is problematic. Also described is a related methodology for extraction of informative genes from within selected gene sets, and a scheme for visualization of results in an integrated format.
Item Type: | Thesis (PhD) |
---|---|
Status: | Unpublished |
Schools: | Medicine |
ISBN: | 9781303189562 |
Funders: | MRC-BP Dorothy Hodgkin Postgraduate Award, Lord Merthyr Scholarship |
Date of First Compliant Deposit: | 30 March 2016 |
Last Modified: | 16 Oct 2014 12:55 |
URI: | https://orca.cardiff.ac.uk/id/eprint/55858 |
Actions (repository staff only)
![]() |
Edit Item |