Article Version of Record

Bootstrap confidence intervals for 11 robust correlations in the presence of outliers and leverage observations

Author(s) / Creator(s)

Li, Johnson Ching-Hong

Abstract / Description

Researchers often examine whether two continuous variables (X and Y) are linearly related. Pearson’s correlation (r) is a widely-employed statistic for assessing bivariate linearity. However, the accuracy of r is known to decrease when data contain outliers and/or leverage observations, a circumstance common in behavioral and social sciences research. This study compares 11 robust correlations with r and evaluates the associated bootstrap confidence intervals [bootstrap standard interval (BSI), bootstrap percentile interval (BPI), and bootstrap bias-corrected-and-accelerated interval (BCaI)] across conditions with and without outliers and/or leverage observations. The simulation results showed that the median-absolute-deviation correlation (r-MAD), median-based correlation (r-MED), and trimmed correlation (r-TRIM) consistently outperformed the other estimates, including r, when data contain outliers and/or leverage observations. This study provides an easy-to-use R code for computing robust correlations and their associated confidence intervals, offers recommendations for their reporting, and discusses implications of the findings for future research.

Keyword(s)

robust correlation bootstrap confidence intervals outliers Monte Carlo simulation

Persistent Identifier

Date of first publication

2022-06-30

Journal title

Methodology

Volume

18

Issue

2

Page numbers

99–125

Publisher

PsychOpen GOLD

Publication status

publishedVersion

Review status

peerReviewed

Is version of

Citation

Li, J. C.-H. (2022). Bootstrap confidence intervals for 11 robust correlations in the presence of outliers and leverage observations. Methodology, 18(2), 99-125. https://doi.org/10.5964/meth.8467
  • Author(s) / Creator(s)
    Li, Johnson Ching-Hong
  • PsychArchives acquisition timestamp
    2022-10-28T10:30:16Z
  • Made available on
    2022-10-28T10:30:16Z
  • Date of first publication
    2022-06-30
  • Abstract / Description
    Researchers often examine whether two continuous variables (X and Y) are linearly related. Pearson’s correlation (r) is a widely-employed statistic for assessing bivariate linearity. However, the accuracy of r is known to decrease when data contain outliers and/or leverage observations, a circumstance common in behavioral and social sciences research. This study compares 11 robust correlations with r and evaluates the associated bootstrap confidence intervals [bootstrap standard interval (BSI), bootstrap percentile interval (BPI), and bootstrap bias-corrected-and-accelerated interval (BCaI)] across conditions with and without outliers and/or leverage observations. The simulation results showed that the median-absolute-deviation correlation (r-MAD), median-based correlation (r-MED), and trimmed correlation (r-TRIM) consistently outperformed the other estimates, including r, when data contain outliers and/or leverage observations. This study provides an easy-to-use R code for computing robust correlations and their associated confidence intervals, offers recommendations for their reporting, and discusses implications of the findings for future research.
    en_US
  • Publication status
    publishedVersion
  • Review status
    peerReviewed
  • Citation
    Li, J. C.-H. (2022). Bootstrap confidence intervals for 11 robust correlations in the presence of outliers and leverage observations. Methodology, 18(2), 99-125. https://doi.org/10.5964/meth.8467
    en_US
  • ISSN
    1614-2241
  • Persistent Identifier
    https://hdl.handle.net/20.500.12034/7655
  • Persistent Identifier
    https://doi.org/10.23668/psycharchives.8372
  • Language of content
    eng
  • Publisher
    PsychOpen GOLD
  • Is version of
    https://doi.org/10.5964/meth.8467
  • Is related to
    https://doi.org/10.23668/psycharchives.7051
  • Keyword(s)
    robust correlation
    en_US
  • Keyword(s)
    bootstrap confidence intervals
    en_US
  • Keyword(s)
    outliers
    en_US
  • Keyword(s)
    Monte Carlo simulation
    en_US
  • Dewey Decimal Classification number(s)
    150
  • Title
    Bootstrap confidence intervals for 11 robust correlations in the presence of outliers and leverage observations
    en_US
  • DRO type
    article
  • Issue
    2
  • Journal title
    Methodology
  • Page numbers
    99–125
  • Volume
    18
  • Visible tag(s)
    Version of Record
    en_US