Mitchell, Lawrence, Sloan, Terence M., Mewissen, Muriel, Ghazal, Peter ORCID: https://orcid.org/0000-0003-0035-2228, Forster, Thorsten, Piotrowski, Michal and Trew, Arthur S. 2011. A parallel random forest classifier for R. p. 1. 10.1145/1996023.1996024 |
Abstract
The statistical language R is favoured by many biostaticians for processing microarray data. In recent times, the quantity of data that can be obtained in experiments has risen significantly, making previously fast analyses time consuming, or even not possible at all with the existing software infrastructure. High Performance Computing (HPC) systems offer a solution to these problems, but at the expense of increased complexity for the end user. The Simple Parallel R Interface (SPRINT) is a library for R that aims to reduce the complexity of using HPC systems by providing biostatisticians with drop-in parallelized replacements of existing R functions. In this paper we describe the implementation of a parallel version of the Random Forest classifier in the SPRINT library.
Item Type: | Conference or Workshop Item (Paper) |
---|---|
Date Type: | Publication |
Status: | Published |
Schools: | Medicine |
Last Modified: | 23 Oct 2022 14:02 |
URI: | https://orca.cardiff.ac.uk/id/eprint/112592 |
Citation Data
Cited 15 times in Scopus. View in Scopus. Powered By Scopus® Data
Actions (repository staff only)
Edit Item |