Multiple regressions: the meaning of multiple regression and the non-problem of collinearity

Research output: Contribution to journalArticlepeer-review

Abstract

Simple regression (regression analysis with a single explanatory variable), and multiple regression (regression models with multiple explanatory variables), typically correspond to very different biological questions. The former use regression lines to describe univariate associations. The latter describe the partial, or direct, effects of multiple variables, conditioned on one another. We suspect that the superficial similarity of simple and multiple regression leads to confusion in their interpretation. A clear understanding of these methods is essential, as they underlie a large range of procedures in common use in biology. Beyond simple and multiple regression in their most basic forms, understanding the key principles of these procedures is critical to understanding, and properly applying, many methods, such as mixed models, generalised models, and causal inference using graphs (including path analysis and its extensions). A simple, but careful, look at the distinction between these two analyses is valuable in its own right, and can also be used to clarify widely-held misconceptions about collinearity (correlations among explanatory variables). There is no general sense in which collinearity is a problem. We suspect that the perception of collinearity as a hindrance to analysis stems from misconceptions about interpretation of multiple regression models, and so we pursue discussions about these misconceptions in this light. In particular, collinearity causes multiple regression coefficients to be less precisely estimated than corresponding simple regression coefficients. This should not be interpreted as a problem, as it is perfectly natural that direct effects should be harder to characterise than univariate associations. Purported solutions to the perceived problems of collinearity are detrimental to most biological analyses.
Original languageEnglish
Article number3
Number of pages24
JournalPhilosophy, Theory and Practice in Biology
Volume10
DOIs
Publication statusPublished - 2018

Keywords

  • Regression
  • Multiple regression
  • Collinearity
  • Ordinary least squares
  • Linear model
  • Causal effect
  • Correlation

Fingerprint

Dive into the research topics of 'Multiple regressions: the meaning of multiple regression and the non-problem of collinearity'. Together they form a unique fingerprint.

Cite this