Article Version of Record

Clustering longitudinal data using R: A Monte Carlo Study

Author(s) / Creator(s)

Verboon, Peter
Pat-El, Ron

Abstract / Description

The analysis of change within subjects over time is an ever more important research topic. Besides modelling the individual trajectories, a related aim is to identify clusters of subjects within these trajectories. Various methods for analyzing these longitudinal trajectories have been proposed. In this paper we investigate the performance of three different methods under various conditions in a Monte Carlo study. The first method is based on the non-parametric k-means algorithm. The second is a latent class mixture model, and the third a method based on the analysis of change indices. All methods are available in R. Results show that the k-means method performs consistently well in recovering the known clustering structure. The mixture model method performs reasonably well, but the change indices method has problems with smaller data sets.

Keyword(s)

longitudinal clustering Monte Carlo change trajectories k-means latent class mixture model R

Persistent Identifier

Date of first publication

2022-06-30

Journal title

Methodology

Volume

18

Issue

2

Page numbers

144–163

Publisher

PsychOpen GOLD

Publication status

publishedVersion

Review status

peerReviewed

Is version of

Citation

Verboon, P., & Pat-El, R. (2022). Clustering longitudinal data using R: A Monte Carlo Study. Methodology, 18(2), 144-163. https://doi.org/10.5964/meth.7143
  • Author(s) / Creator(s)
    Verboon, Peter
  • Author(s) / Creator(s)
    Pat-El, Ron
  • PsychArchives acquisition timestamp
    2022-10-28T10:30:16Z
  • Made available on
    2022-10-28T10:30:16Z
  • Date of first publication
    2022-06-30
  • Abstract / Description
    The analysis of change within subjects over time is an ever more important research topic. Besides modelling the individual trajectories, a related aim is to identify clusters of subjects within these trajectories. Various methods for analyzing these longitudinal trajectories have been proposed. In this paper we investigate the performance of three different methods under various conditions in a Monte Carlo study. The first method is based on the non-parametric k-means algorithm. The second is a latent class mixture model, and the third a method based on the analysis of change indices. All methods are available in R. Results show that the k-means method performs consistently well in recovering the known clustering structure. The mixture model method performs reasonably well, but the change indices method has problems with smaller data sets.
    en_US
  • Publication status
    publishedVersion
  • Review status
    peerReviewed
  • Citation
    Verboon, P., & Pat-El, R. (2022). Clustering longitudinal data using R: A Monte Carlo Study. Methodology, 18(2), 144-163. https://doi.org/10.5964/meth.7143
    en_US
  • ISSN
    1614-2241
  • Persistent Identifier
    https://hdl.handle.net/20.500.12034/7653
  • Persistent Identifier
    https://doi.org/10.23668/psycharchives.8370
  • Language of content
    eng
  • Publisher
    PsychOpen GOLD
  • Is version of
    https://doi.org/10.5964/meth.7143
  • Is related to
    https://doi.org/10.23668/psycharchives.7052
  • Keyword(s)
    longitudinal
    en_US
  • Keyword(s)
    clustering
    en_US
  • Keyword(s)
    Monte Carlo
    en_US
  • Keyword(s)
    change
    en_US
  • Keyword(s)
    trajectories
    en_US
  • Keyword(s)
    k-means
    en_US
  • Keyword(s)
    latent class
    en_US
  • Keyword(s)
    mixture model
    en_US
  • Keyword(s)
    R
    en_US
  • Dewey Decimal Classification number(s)
    150
  • Title
    Clustering longitudinal data using R: A Monte Carlo Study
    en_US
  • DRO type
    article
  • Issue
    2
  • Journal title
    Methodology
  • Page numbers
    144–163
  • Volume
    18
  • Visible tag(s)
    Version of Record
    en_US