Med-Gemini : A New AI Model Reaching 91.1% Accuracy in Medical Diagnostics

Nitika Sharma 01 May, 2024 • 3 min read

Researchers from Google and DeepMind have introduced Med-Gemini, a new generation of AI models specifically tailored for medical applications. Building on the strengths of the 2023 Gemini models renowned for language processing, multimodal understanding, and long-context reasoning, Med-Gemini significantly enhances these capabilities for healthcare applications.

Med-Gemini’s superiority is demonstrated through evaluation on 14 medical benchmarks, where it achieves new state-of-the-art performance on 10 benchmarks, often surpassing GPT-4 models significantly. Notably, on MedQA (USMLE), Med-Gemini achieved 91.1% accuracy, outperforming prior models by 4.6%.

Results from our first ✨Med-Gemini✨models, variants of the base Gemini multimodal models that have been tuned for the medical domain.

The multimodal and long context capabilities of the Gemini models really shine in some of the capabilities.

Joint work with @vivnat… https://t.co/KkKdLXjBx7
— Jeff Dean (@🏡) (@JeffDean) April 30, 2024

The Making of Med-Gemini

Med-Gemini opens exciting doors for AI in medicine. It can assist doctors in tackling complex diagnoses, engage in informative medical dialogue, and efficiently analyze vast amounts of data within electronic health records. The researchers achieved this specialization through innovative techniques:

Self-training with Web Search Integration: Med-Gemini can access and integrate up-to-date medical information from the web, ensuring its knowledge stays current.
Multimodal Fine-Tuning: The model can adapt to incorporate new medical data formats, making it future-proof.
Customized Encoders: Med-Gemini can process various data types, including text, images, videos, and even sensor readings from medical equipment.

Self-training with Web Search Integration

Capabilities of Med-Gemini

Med-Gemini is introduced as a family of highly capable, multimodal medical models built upon Gemini. The models’ clinical reasoning capabilities are enhanced through self-training and web search integration, while multimodal performance is improved via fine-tuning and customized encoders.

Med-Gemini models achieve state-of-the-art (SoTA) performance on 10 out of 14 medical benchmarks spanning text, multimodal, and long-context applications, surpassing the GPT-4 model family on every benchmark where a direct comparison could be made.

The bar chart below demonstrates the relative percentage gains from the models over prior SoTA across the benchmarks. Particularly on the MedQA (USMLE) benchmark, a new SoTA is achieved, surpassing the prior best (Med-PaLM 2) by a significant margin of 4.6%.

Additionally, re-annotation of the dataset with expert clinicians reveals that 7.4% of questions are deemed unfit for evaluation due to lacking key information, having incorrect answers, or supporting multiple plausible interpretations. These data quality issues are accounted for to characterize the performance of the model more precisely.

Med-Gemini models excel in multimodal and long-context capabilities, evidenced by their SoTA performance on several benchmarks including needle-in-a-haystack retrieval from long, de-identified health records, and medical video question answering benchmarks.

Beyond benchmarks, the real-world potential of Med-Gemini is demonstrated through quantitative evaluation on medical summarization, referral letter generation, and medical simplification tasks where the models outperform human experts, in addition to qualitative examples of multimodal medical dialogue.

Safety and Accuracy Remain Paramount

The paper emphasizes the importance of safety and accuracy in medical applications. The researchers acknowledge the need for specialized techniques like prompting and fine-tuning to ensure responsible AI development in this critical domain.

One such technique is the “uncertainty-guided search strategy.” This allows Med-Gemini to access and integrate relevant web search results during complex clinical reasoning tasks, leading to more nuanced and reliable outcomes.

Also Read: Top 7 AI Healthcare Solution Providers

Dialogue Example

You can find the research paper here.

Our Say

Med-Gemini’s multimodal capabilities open doors for more natural and comprehensive interactions between healthcare providers and patients. Doctors can leverage the model’s ability to analyze various data types, while the model itself can interact more conversationally, requesting additional information for a more complete picture.

This development adds to Google’s growing portfolio of healthcare-focused AI models, including Med-PaLM 2, AlphaFold, and Flan-PaLM. Med-Gemini represents a significant step forward in AI-powered healthcare, paving the way for a future with enhanced diagnostics, personalized medicine, and improved patient-provider communication.

Follow us on Google News to stay updated with the latest innovations in the world of AI, Data Science, & GenAI.