John Snow Labs is the De-facto Industry Leader for Medical Large Language Models.
State of the Art Medical Large Language Models
Clinical Note Summarization
Preferred 88% more often on factuality, 92% more often on relevance, 68% more often on conciseness compared to GPT-4o.
Sample Questions:
- Summarize the final pathological diagnosis of the lesion and the patient’s follow-up and recovery after surgery.
- Summarize the patient’s medical history and initial presentation.
- Summarize the background and objectives of the study from the given text.
Clinical Information Extraction
Preferred 46% more often on factuality, 50% more often on relevance and 45% more often on conciseness compared to GPT-4o.
Sample Questions:
- Can the TyG index be used to predict gestational diabetes mellitus (GDM) according to the following text?
- Given the note, what procedures did the patient undergo?
- Given the medical text, did Anlotinib benefit the patient?
BioMedical Question Answering
Preferred 175% more often on factuality, 200% more often on relevance, 256% more often on conciseness compared to GPT-4o.
Sample Questions:
- Given the report, what biomarkers are commonly negative in APL cases?
- Given the note, why is the chemotherapy the mainly used treatment in TNBC patients?
- Given the article, what is sNFL used for?
Private and Compliance Deployment Within Your Network
Runs Privately
Deploy the Medical LLMs within your secure infrastructure, ensuring data sovereignty and full control over sensitive information.
No Data Sharing
Medical LLMs process data locally. No need for external data sharing or internet dependencies.
Built for Compliance
In line with privacy standards like HIPAA or GDPR, ensuring seamless integration into highly regulated environments.
Putting Healthcare LLMs to Production Use
Using Healthcare-Specific LLM’s for Data Discovery from Patient Notes & Stories
The US Department of Veterans Affairs, a health system which serves over 9 million veterans and their families. This collaboration with VA National Artificial Intelligence Institute (NAII), VA Innovations Unit (VAIU) and Office of Information Technology (OI&T) show that while out-of-the-box accuracy of current LLM’s on clinical notes is unacceptable, it can be significantly improved with pre-processing, for example by using John Snow Labs’ clinical text summarization models prior to feeding that as content to the LLM generative AI output.
Text-Prompted Patient Cohort Retrieval: Leveraging Healthcare LLM Models for Precision Population Health Management
Using John Snow Lab’s Healthcare LLM models, the ClosedLoop platform enables users to retrieve cohorts using free-text prompts. Examples include: “Which patients are in the top 5% of risk for an unplanned admission and have chronic kidney disease of stage 3 or higher?” or “Which patients are in the top 5% risk for an admission, older than 72, and have not undergone an annual wellness checkup?”
Lessons Learned Applying Large Language Models
This session shares currently deployed software, lessons learned, and best practices that John Snow Labs has learned while enabling academic medical centers, pharmaceuticals, and health IT companies to build LLM-based solutions. It covers what generative AI use cases are healthcare organizations deploying today, which solution architectures are used to deliver these use cases, and general-purpose LLMs perform versus healthcare-specific LLMs in these use cases.
Lots of companies make claims about healthcare-specific LLM’s. John Snow Labs are the only ones who publish reproducible accuracy benchmarks and have Medical LLM systems in production.