Advances in Multimodal Clinical Diagnosis and Analysis

The field of clinical diagnosis and analysis is rapidly advancing with the integration of multimodal large language models (MLLMs) and traditional deep learning techniques. Recent developments have focused on improving the accuracy and efficiency of disease diagnosis, clinical risk prediction, and medical image analysis. Notably, the use of MLLMs has shown promise in handling complex clinical data, such as unstructured text and images, and in generating informative demonstrations for in-context learning. Furthermore, the application of explainable AI techniques has enhanced the transparency and trustworthiness of diagnostic models, allowing for better understanding of diagnostic uncertainties and improved clinical decision-making. The combination of these advances has the potential to revolutionize clinical practice, enabling more accurate and efficient diagnosis, and ultimately improving patient outcomes. Noteworthy papers in this area include the proposal of Retrieval-Augmented In-Context Learning (RAICL) framework, which dynamically retrieves informative demonstrations to enhance in-context learning in MLLMs, and the development of ConfiDx, an uncertainty-aware large language model that excels in identifying diagnostic uncertainties and generating trustworthy explanations for diagnoses.

Sources

Entropy Heat-Mapping: Localizing GPT-Based OCR Errors with Sliding-Window Shannon Analysis

MIMIC-\RNum{4}-Ext-22MCTS: A 22 Millions-Event Temporal Clinical Time-Series Dataset with Relative Timestamp for Risk Prediction

Efficient Vocabulary-Free Fine-Grained Visual Recognition in the Age of Multimodal LLMs

Explainable AI-Driven Detection of Human Monkeypox Using Deep Learning and Vision Transformers: A Comprehensive Analysis

High-Fidelity Pseudo-label Generation by Large Language Models for Training Robust Radiology Report Classifiers

Retrieval-augmented in-context learning for multimodal large language models in disease classification

Enhancing LLMs' Clinical Reasoning with Real-World Data from a Nationwide Sepsis Registry

A Vision-Language Model for Focal Liver Lesion Classification

Uncertainty-Aware Large Language Models for Explainable Disease Diagnosis

PointExplainer: Towards Transparent Parkinson's Disease Diagnosis

Hesitation is defeat? Connecting Linguistic and Predictive Uncertainty

Integration of Large Language Models and Traditional Deep Learning for Social Determinants of Health Prediction

Biomed-DPT: Dual Modality Prompt Tuning for Biomedical Vision-Language Models