Precision phenotyping for curating research cohorts of patients with unexplained post-acute sequelae of COVID-19, 2024, Azhir

Discussion in 'Long Covid research' started by Dolphin, Nov 8, 2024.

  1. Dolphin

    Dolphin Senior Member (Voting Rights)

    Messages:
    5,884
    https://www.cell.com/med/fulltext/S2666-6340(24)00407-0

    November 08, 2024
    Open access
    Precision phenotyping for curating research cohorts of patients with unexplained post-acute sequelae of COVID-19

    Context and significance
    Identifying cohorts of patients with post-acute sequelae of COVID-19 (PASC), or long COVID, using real-world data is complex. The absence of precise definitions for PASC poses significant challenges in clinical research and patient care. Utilizing electronic health records from a large integrated healthcare system, Azhir et al. developed a precision phenotyping algorithm incorporating a novel attention mechanism that accounts for both infection-related chronic conditions and differential diagnoses. This approach demonstrated superior accuracy in identifying PASC cases compared to the existing ICD-10-CM code U09.9 while also mitigating demographic biases in diagnosis. The implications are profound, offering a refined tool for constructing research cohorts to explore the genetics and metabolomics of long COVID, thereby enhancing the health systems’ capacity to manage it.

    Highlights

    • Precision PASC phenotyping algorithm identifies long COVID with attention mechanism

    • Incorporates both infection-association chronic condition and diagnosis of exclusion

    • Outperforms U09.9 in precision and reduces bias in long COVID identification

    • Captures rare long COVID symptoms, including vision loss and diabetic complications

    Summary

    Background


    Scalable identification of patients with post-acute sequelae of COVID-19 (PASC) is challenging due to a lack of reproducible precision phenotyping algorithms, which has led to suboptimal accuracy, demographic biases, and underestimation of the PASC.

    Methods


    In a retrospective case-control study, we developed a precision phenotyping algorithm for identifying cohorts of patients with PASC. We used longitudinal electronic health records data from over 295,000 patients from 14 hospitals and 20 community health centers in Massachusetts. The algorithm employs an attention mechanism to simultaneously exclude sequelae that prior conditions can explain and include infection-associated chronic conditions. We performed independent chart reviews to tune and validate the algorithm.

    Findings


    The PASC phenotyping algorithm improves precision and prevalence estimation and reduces bias in identifying PASC cohorts compared to the ICD-10-CM code U09.9. The algorithm identified a cohort of over 24,000 patients with 79.9% precision. Our estimated prevalence of PASC was 22.8%, which is close to the national estimates for the region. We also provide in-depth analyses, encompassing identified lingering effects by organ, comorbidity profiles, and temporal differences in the risk of PASC.

    Conclusions


    PASC precision phenotyping boasts superior precision and prevalence estimation while exhibiting less bias in identifying patients with PASC. The cohort derived from this algorithm will serve as a springboard for delving into the genetic, metabolomic, and clinical intricacies of PASC, surmounting the constraints of prior PASC cohort studies.

    Funding

    This research was funded by the US National Institute of Allergy and Infectious Diseases (NIAID).

    Azhir A et al. “Precision Phenotyping for Curating Research Cohorts of Patients with Unexplained Post-Acute Sequelae of COVID-19” Med DOI: 10.1016/j.medj.2024.10.009

     
    Last edited: Nov 8, 2024
    Kitty likes this.
  2. Dolphin

    Dolphin Senior Member (Voting Rights)

    Messages:
    5,884
    https://www.massgeneralbrigham.org/...ses-of-long-covid-from-patient-health-records

    Press release:

    Mass General Brigham researchers developed an AI algorithm to unveil the elusive traces of long COVID in patients’ health records using ‘precision phenotyping’

    KEY TAKEAWAYS

    • Researchers from Mass General Brigham are leveraging artificial intelligence to help identify the signs of long COVID, track how different symptoms manifest over time, and eliminate alternative explanations for patients’ symptoms.
    • The new approach suggests that 22.8% of the population experience the symptoms of long COVID, a figure that may paint a more realistic picture of the pandemic’s long-term toll.
    • Through analyzing a patient’s history over time, this new AI tool may offer a personalized approach to care and help reduce inequities and biases seen in current diagnostics for long COVID.
    Investigators at Mass General Brigham have developed an AI-based tool to sift through electronic health records to help clinicians identify cases of long COVID, an often mysterious condition that can encompass a litany of enduring symptoms, including fatigue, chronic cough, and brain fog after infection from SARS-CoV-2. The results, which are published in the journal Med, could identify more people who should be receiving care for this potentially debilitating condition. The number of cases they identified also suggests that the prevalence of long COVID could be greatly underrecognized.

    “Our AI tool could turn a foggy diagnostic process into something sharp and focused, giving clinicians the power to make sense of a challenging condition,” said senior author Hossein Estiri, PhD, head of AI Research at the Center for AI and Biomedical Informatics of the Learning Healthcare System (CAIBILS) at Mass General Brigham and an associate professor of Medicine at Harvard Medical School. “With this work, we may finally be able to see long COVID for what it truly is—and more importantly, how to treat it.”

    Long COVID, also known as Post-Acute Sequelae of SARS-CoV-2 infection (PASC), includes a wide range of symptoms. For the purposes of their study, Estiri and colleagues defined it as a diagnosis of exclusion that is also infection associated. That means the diagnosis could not be explained in the patient’s unique medical record and it also had to associate with a COVID infection. In addition, the diagnosis needed to have persisted for 2 months or longer in a 12-month follow up window.

    The algorithm used in the AI tool was developed by drawing de-identified patient data from the clinical records of nearly 300,000 patients across 14 hospitals and 20 community health centers in the Mass General Brigham system. Rather than having to rely on a single diagnosis code, the AI utilizes a novel method developed by Estiri and colleagues called “precision phenotyping” that sifts through individual records to identify symptoms and conditions linked to COVID-19 and to track symptoms over time in order to differentiate them from other illnesses. For example, the algorithm can detect if shortness of breath may be the result of pre-existing conditions like heart failure or asthma rather than a long COVID. Only when every other possibility was exhausted would the tool flag the patient as having long COVID.

    “Physicians are often faced with having to wade through a tangled web of symptoms and medical histories, unsure of which threads to pull, while balancing busy caseloads. Having a tool powered by AI that can methodically do it for them could be a game-changer,” said Alaleh Azhir, MD, the co-lead author who is an internal medicine resident at Brigham Women’s Hospital, a founding member of the Mass General Brigham healthcare system.

    The patient-centered diagnoses provided by this new method may also help alleviate biases built into current diagnostics for long COVID, according to the researchers, who note that patients diagnosed with the official ICD-10 diagnostic code for long COVID trend towards those with easier access to healthcare. While other diagnostic studies have suggested that approximately 7% of the population suffers from long COVID, this new approach reveals a much higher estimate—22.8%. The authors stated that this figure aligns more closely with national trends and paints a more realistic picture of the pandemic’s long-term toll.

    The researchers determined their tool was about 3 percent more accurate than what ICD-10 codes capture, while being less biased. Specifically, their study demonstrated that the individuals they identified as having long COVID mirror the broader demographic makeup of Massachusetts, unlike long COVID algorithms that rely on a single diagnostic code or individual clinical encounters, skewing results toward certain populations such as those with more access to care. “This broader scope ensures that marginalized communities, often sidelined in clinical studies, are no longer invisible,” said Estiri.

    Limitations of the study and AI tool include that health record data used in the algorithm to account for long COVID symptoms may be less complete than what is captured by physicians in post-visit clinical notes. Another limitation was the algorithm did not capture possible worsening of a prior condition, which may have been a long COVID symptom. For example, if a patient had COPD and prior episodes of it worsened before they developed COVID-19, the algorithm might have removed them even if their persisting symptoms were a long COVID indicator. Declines in the amount of COVID-19 testing in recent years also makes it difficult to identify when a patient may have first gotten COVID-19. The study was also limited to patients in Massachusetts.

    Future studies may explore the algorithm in cohorts of patients with specific conditions, like COPD or diabetes. The researchers also plan to release this algorithm publicly on open access where physicians and healthcare systems globally can use it in their patient populations.

    In addition to opening the door to better clinical care, this work may lay the foundation for future research into the genetic and biochemical factors behind long COVID’s various subtypes. “Questions about the true burden of long COVID—questions that have thus far remained elusive—now seem more within reach,” said Estiri.

    Authorship: In addition to Estiri, Mass General Brigham authors include Alaleh Azhir, Jonas Hügel, Jiazi Tian, Jingya Cheng, Ingrid V. Bassett, Emily S. Lau, Yevgeniy R. Semenov, Virginia A. Triant, Zachary H. Strasser, Jeffrey G. Klann, and Shawn N. Murphy. Additional authors include, Douglas S. Bell, Elmer V. Bernstam, Maha R. Farhat, Darren W. Henderson, Michele Morris, and Shyam Visweswaran.

    Disclosures: None.

    Funding: Support from the National Institutes of Health, National Institute of Allergy and Infectious Diseases (NIAID) R01AI165535, National Heart, Lung, and Blood Institute (NHLBI) OT2HL161847, and National Center for Advancing Translational Sciences (NCATS) UL1 TR003167, UL1 TR001881, and U24TR004111. J.Hügel's work was partially funded by a fellowship within the IFI programme of the German Academic Exchange Service (DAAD) and by the Federal Ministry of Education and Research (BMBF) as well by the German Research Foundation (426671079).

    Paper cited: Azhir A et al. “Precision Phenotyping for Curating Research Cohorts of Patients with Unexplained Post-Acute Sequelae of COVID-19” Med DOI: 10.1016/j.medj.2024.10.009

     
    Kitty and Yann04 like this.
  3. InitialConditions

    InitialConditions Senior Member (Voting Rights)

    Messages:
    1,714
    Location:
    North-West England
    At the risk of sounding like a stuck record, I believe those with life-changing disability and illness due to long covid are at risk of having their illness watered down with prevalence rates such as these, which are based on the very lax WHO definiton of long covid (any lingering (>2 months) symptoms after SARS-Cov-2).
     
    alktipping, Dolphin and Kitty like this.

Share This Page