AI in Healthcare Archives

AI in Healthcare Blog

Dandelion Health is a provider of multimodal, longitudinal clinical data for healthcare innovators. This session shows how it built a de-identification process for free-text clinical notes, with John Snow Labs’ Healthcare NLP & LLM at its core. This process maintains patient privacy, minimizes risks for hospital systems, and preserves the bulk of free-text notes to provide researchers with high fidelity clinical data.Dandelion Health partners with hospital systems, deidentifies their clinical data in their environment, and then copies this data to the Dandelion data lake so that customers can perform research and validation within the secure Dandelion platform. To ensure HIPAA compliance, deidentification requires an expert determination to confirm that minimal protected health information (PHI) remains after the process.Tabular data is straight-forward to handle by removing or masking data fields with PHI related values – such as patient names, birth dates, addresses, or contact details. Free text patient notes are much more difficult to automatically deidentify, as this requires PHI words and phrases to be redacted or masked, after which the whole of the patient note must be verified.Key topics of the presentation include:1. Breaking down different note types (e.g. radiology reports, pathology reports, echo narratives, progress notes) according to level of risk, and adapting the de-id process accordingly.2. Assessing note subtypes (e.g. radiology reports for DEXA scans, or fetal radiology reports) in order to carve out exceptions to our standard process (e.g. unique note structure, or age formats such as “27w” that need to be redacted).3. Determining the importance of recall, precision, and PHI frequency for quasi-identifiers.4. Applying pre-processing or enhancements such as HIPS (hiding in plain sight) to reduce risk based on the recall, precision, and frequency of PHI in free-text notes. This presentation features real-world case-studies and examples, demonstrating the power of: validating clinician data-quality hypotheses with language models, using different NLP & LLM strategies for different datasets, and letting QA/QC statistics tell the story – so we know that we’re doing right by the patient.

Blog

Deidentifying Free-Text Patient Notes: No Need for Tradeoffs

Applying Healthcare-Specific LLMs to Build Oncology Patient Timelines and Recommend Clinical Guidelines

The emergence of precision oncology necessitates a comprehensive understanding of how genetic, epigenetic, and other factors influence tumor behavior and response to treatment regimens. This understanding is crucial for translating...

Maximizing Patient Care through AI-Enhanced HCC Code Discovery

Hierarchical Condition Category (HCC) coding plays a pivotal role in federally regulated risk adjustment payment models, ensuring accurate reimbursement for health insurance plans and better care for managed populations. Providers...

Measuring the Benefits of Healthcare Specific Large Language Models

There is overwhelming evidence from academic research and industry benchmarks that domain-specific and task-specific large language models outperform general-purpose LLMs across multiple dimensions: Accuracy, veracity, human preference, and cost. This...

Reasoning in Natural Language: Assessing Large Language Model capabilities in Sentiment Analysis

Ida Lucente

A report dedicated to the most current research aimed at using Large Language Models (LLMs) in the field of Sentiment Analysis. This task involves extracting the author’s opinion from the...