Variable selection for causal inference, prediction, and descriptive research: a narrative review of recommendations
File version
Accepted Manuscript (AM)
Author(s)
Griffith University Author(s)
Primary Supervisor
Other Supervisors
Editor(s)
Szummer, Karolina
Date
Size
File type(s)
Location
Abstract
There is a growing appreciation that the methods and analyses of medical studies should be tailored towards the type of research question. However, frequent conflation exists with respect to the reasons for statistically adjusting for variables in analyses and the methods that should be used for variable selection in regression models. Non-randomised causal studies require statistical adjustment for confounders that may bias the causal effect estimate. Predictor/prognostic factor studies may present unadjusted associations and/or present associations statistically adjusted for existing predictors to establish the added predictive value of the candidate predictor over and above known predictors. Prediction models aim to identify a set of variables that are clinically useable and are collectively the best at predicting the outcome. Descriptive studies may want to characterise the outcome distribution with respect to an additional variable or standardise with respect to a nuisance variable for which the study sample differs from the target population. This narrative review summarises background theory and existing advice on how variable selection should differ for causal research, prediction modelling, predictor/prognostic factor research, and descriptive research. Examples of variable selection approaches from published cardiovascular research are also provided.
Journal Title
European Heart Journal Open
Conference Title
Book Title
Edition
Volume
Issue
Thesis Type
Degree Program
School
Publisher link
Patent number
Funder(s)
Grant identifier(s)
Rights Statement
Rights Statement
© The Author(s) 2025. Published by Oxford University Press on behalf of the European Society of Cardiology. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
Item Access Status
Note
This publication has been entered in Griffith Research Online as an advance online version.
Access the data
Related item(s)
Subject
Cardiology (incl. cardiovascular diseases)
Persistent link to this record
Citation
Dyer, BP, Variable selection for causal inference, prediction, and descriptive research: a narrative review of recommendations, European Heart Journal Open, 2025, pp. oeaf070