Article Version of Record

Using pointwise mutual information for breast cancer health disparities research with SEER-Medicare claims

Author(s) / Creator(s)

Egleston, Brian L.
Chanda, Ashis Kumar
Bai, Tian
Fang, Carolyn Y.
Bleicher, Richard J.
Vucetic, Slobodan

Abstract / Description

Identification of procedures using International Classification of Diseases or Healthcare Common Procedure Coding System codes is challenging when conducting medical claims research. We demonstrate how Pointwise Mutual Information can be used to find associated codes. We apply the method to an investigation of racial differences in breast cancer outcomes. We used Surveillance Epidemiology and End Results (SEER) data linked to Medicare claims. We identified treatment using two methods. First, we used previously published definitions. Second, we augmented definitions using codes empirically identified by the Pointwise Mutual Information statistic. Similar to previous findings, we found that presentation differences between Black and White women closed much of the estimated survival curve gap. However, we found that survival disparities were completely eliminated with the augmented treatment definitions. We were able to control for a wider range of treatment patterns that might affect survival differences between Black and White women with breast cancer.

Keyword(s)

SEER-Medicare claims machine learning pointwise mutual information breast cancer health disparities

Persistent Identifier

Date of first publication

2023-03-31

Journal title

Methodology

Volume

19

Issue

1

Page numbers

43–59

Publisher

PsychOpen GOLD

Publication status

publishedVersion

Review status

peerReviewed

Is version of

Citation

Egleston, B. L., Chanda, A. K., Bai, T., Fang, C. Y., Bleicher, R. J., & Vucetic, S. (2023). Using pointwise mutual information for breast cancer health disparities research with SEER-Medicare claims. Methodology, 19(1), 43-59. https://doi.org/10.5964/meth.8535
  • Author(s) / Creator(s)
    Egleston, Brian L.
  • Author(s) / Creator(s)
    Chanda, Ashis Kumar
  • Author(s) / Creator(s)
    Bai, Tian
  • Author(s) / Creator(s)
    Fang, Carolyn Y.
  • Author(s) / Creator(s)
    Bleicher, Richard J.
  • Author(s) / Creator(s)
    Vucetic, Slobodan
  • PsychArchives acquisition timestamp
    2023-04-28T10:04:26Z
  • Made available on
    2023-04-28T10:04:26Z
  • Date of first publication
    2023-03-31
  • Abstract / Description
    Identification of procedures using International Classification of Diseases or Healthcare Common Procedure Coding System codes is challenging when conducting medical claims research. We demonstrate how Pointwise Mutual Information can be used to find associated codes. We apply the method to an investigation of racial differences in breast cancer outcomes. We used Surveillance Epidemiology and End Results (SEER) data linked to Medicare claims. We identified treatment using two methods. First, we used previously published definitions. Second, we augmented definitions using codes empirically identified by the Pointwise Mutual Information statistic. Similar to previous findings, we found that presentation differences between Black and White women closed much of the estimated survival curve gap. However, we found that survival disparities were completely eliminated with the augmented treatment definitions. We were able to control for a wider range of treatment patterns that might affect survival differences between Black and White women with breast cancer.
    en_US
  • Publication status
    publishedVersion
  • Review status
    peerReviewed
  • Citation
    Egleston, B. L., Chanda, A. K., Bai, T., Fang, C. Y., Bleicher, R. J., & Vucetic, S. (2023). Using pointwise mutual information for breast cancer health disparities research with SEER-Medicare claims. Methodology, 19(1), 43-59. https://doi.org/10.5964/meth.8535
    en_US
  • ISSN
    1614-2241
  • Persistent Identifier
    https://hdl.handle.net/20.500.12034/8353
  • Persistent Identifier
    https://doi.org/10.23668/psycharchives.12830
  • Language of content
    eng
  • Publisher
    PsychOpen GOLD
  • Is version of
    https://doi.org/10.5964/meth.8535
  • Is related to
    https://doi.org/10.23668/psycharchives.12591
  • Is related to
    https://github.com/ashischanda/MedCS/
  • Keyword(s)
    SEER-Medicare claims
    en_US
  • Keyword(s)
    machine learning
    en_US
  • Keyword(s)
    pointwise mutual information
    en_US
  • Keyword(s)
    breast cancer
    en_US
  • Keyword(s)
    health disparities
    en_US
  • Dewey Decimal Classification number(s)
    150
  • Title
    Using pointwise mutual information for breast cancer health disparities research with SEER-Medicare claims
    en_US
  • DRO type
    article
  • Issue
    1
  • Journal title
    Methodology
  • Page numbers
    43–59
  • Volume
    19
  • Visible tag(s)
    Version of Record
    en_US