Artemiou, Andreas ORCID: https://orcid.org/0000-0002-7501-4090 and Li, Bing 2009. On principal components and regression: a statistical explanation of a natural phenomenon. Statistica Sinica 19 , pp. 1557-1565. |
Abstract
In this note we give a probabilistic explanation of a phenomenon that is frequently observed but whose reason is not well understood. That is, in a regression setting, the response (Y ) is often highly correlated with the leading principal components of the predictor (X) even though there seems no logical reason for this connection. This phenomenon has long been noticed and discussed in the literature, and has received renewed interest recently because of the need for regressing Y on X of very high dimension, often with comparatively few sampling units, in which case it seems natural to regress on the first few principal components of X. This work stems from a discussion of a recent paper by Cook (2007) which, along with other developments, described a historical debate surrounding, and current interest in, this phenomenon.
Item Type: | Article |
---|---|
Date Type: | Publication |
Status: | Published |
Schools: | Mathematics |
Subjects: | Q Science > QA Mathematics |
Publisher: | Academia Sinica, Institute of Statistical Science |
ISSN: | 1017-0405 |
Last Modified: | 28 Oct 2022 09:54 |
URI: | https://orca.cardiff.ac.uk/id/eprint/76046 |
Citation Data
Cited 22 times in Scopus. View in Scopus. Powered By Scopus® Data
Actions (repository staff only)
Edit Item |