Lewis, P.D. and Menzies, Georgina ORCID: https://orcid.org/0000-0002-6600-6507 2015. Vibrational spectra, principal components analysis and the horseshoe effect. Vibrational Spectroscopy 81 , pp. 62-67. 10.1016/j.vibspec.2015.10.002 |
Abstract
Vibrational spectroscopy studies often generate datasets containing multiple spectra that are categorized into distinct groups according to similarity. Principal components analysis (PCA) is one of the most frequently used multivariate analysis methods for data reduction of vibrational spectra and visualization of potential groupings between subjects. Vibrational spectra usually display unimodal or multimodal distribution patterns of absorbance or transmittance across wavenumbers. PCA, requires that a linear relationship exists between data distributions of the objects under analysis otherwise the method is prone to a serious artifact known as the ‘horseshoe effect’. This artifact, well known in other fields of science, manifests as a serious distortion of the pattern of how objects group according to the most important principal components leading to misinterpretation of the relationships between the samples from which they are derived. In this paper, using a simulated mid-infrared spectral dataset, we investigate for the first time the potential for the PCA horseshoe effect on vibrational spectra and the why this artifact occurs. We show that when comparing large regions of contiguous wavenumbers between multiple spectra there can be a non-linear relationship between distributions of different spectra. Such non-linearity causes the horseshoe effect and we demonstrate that the degree of distortion of how spectra map on the first two components is related to the region size. We further show that reducing the size of spectra analyzed by PCA can minimize the horseshoe effect. We conclude that PCA should be used with caution in the analysis and interpretation of vibrational spectra and the application of more robust methods should be explored.
Item Type: | Article |
---|---|
Date Type: | Publication |
Status: | Published |
Schools: | Medicine |
Subjects: | R Medicine > R Medicine (General) |
Uncontrolled Keywords: | Vibrational spectroscopy; Principal components analysis; Horseshoe effect |
Publisher: | Elsevier |
ISSN: | 0924-2031 |
Date of Acceptance: | 8 October 2015 |
Last Modified: | 21 Oct 2022 07:08 |
URI: | https://orca.cardiff.ac.uk/id/eprint/99224 |
Citation Data
Cited 7 times in Scopus. View in Scopus. Powered By Scopus® Data
Actions (repository staff only)
Edit Item |