Identifying patterns of item missing survey data using latent groups: an observational study

View/ Open
File version
Version of Record (VoR)
Author(s)
Barnett, Adrian G
McElwee, Paul
Nathan, Andrea
Burton, Nicola W
Turrell, Gavin
Griffith University Author(s)
Year published
2017
Metadata
Show full item recordAbstract
Objectives: To examine whether respondents to a survey of health and physical activity and potential determinants could be grouped according to the questions they missed, known as 'item missing'.
Design: Observational study of longitudinal data.
Setting: Residents of Brisbane, Australia.
Participants: 6901 people aged 40-65 years in 2007.
Materials and methods: We used a latent class model with a mixture of multinomial distributions and chose the number of classes using the Bayesian information criterion. We used logistic regression to examine if participants' characteristics were associated with their modal latent class. ...
View more >Objectives: To examine whether respondents to a survey of health and physical activity and potential determinants could be grouped according to the questions they missed, known as 'item missing'. Design: Observational study of longitudinal data. Setting: Residents of Brisbane, Australia. Participants: 6901 people aged 40-65 years in 2007. Materials and methods: We used a latent class model with a mixture of multinomial distributions and chose the number of classes using the Bayesian information criterion. We used logistic regression to examine if participants' characteristics were associated with their modal latent class. We used logistic regression to examine whether the amount of item missing in a survey predicted wave missing in the following survey. Results: Four per cent of participants missed almost one-fifth of the questions, and this group missed more questions in the middle of the survey. Eighty-three per cent of participants completed almost every question, but had a relatively high missing probability for a question on sleep time, a question which had an inconsistent presentation compared with the rest of the survey. Participants who completed almost every question were generally younger and more educated. Participants who completed more questions were less likely to miss the next longitudinal wave. Conclusions: Examining patterns in item missing data has improved our understanding of how missing data were generated and has informed future survey design to help reduce missing data.
View less >
View more >Objectives: To examine whether respondents to a survey of health and physical activity and potential determinants could be grouped according to the questions they missed, known as 'item missing'. Design: Observational study of longitudinal data. Setting: Residents of Brisbane, Australia. Participants: 6901 people aged 40-65 years in 2007. Materials and methods: We used a latent class model with a mixture of multinomial distributions and chose the number of classes using the Bayesian information criterion. We used logistic regression to examine if participants' characteristics were associated with their modal latent class. We used logistic regression to examine whether the amount of item missing in a survey predicted wave missing in the following survey. Results: Four per cent of participants missed almost one-fifth of the questions, and this group missed more questions in the middle of the survey. Eighty-three per cent of participants completed almost every question, but had a relatively high missing probability for a question on sleep time, a question which had an inconsistent presentation compared with the rest of the survey. Participants who completed almost every question were generally younger and more educated. Participants who completed more questions were less likely to miss the next longitudinal wave. Conclusions: Examining patterns in item missing data has improved our understanding of how missing data were generated and has informed future survey design to help reduce missing data.
View less >
Journal Title
BMJ Open
Volume
7
Issue
10
Copyright Statement
© The Author(s) 2017. This is an Open Access article distributed in accordance with the Creative Commons Attribution Non Commercial (CC BY-NC 4.0) license, which permits others to distribute, remix, adapt, build upon this work non-commercially, and license their derivative works on different terms, provided the original work is properly cited and the use is non-commercial.
Subject
Clinical sciences
Health services and systems
Public health
Other health sciences
Science & Technology
Life Sciences & Biomedicine
Medicine, General & Internal
General & Internal Medicine
MULTIPLE IMPUTATION