-
MMed-RAG: Versatile Multimodal RAG System for Medical Vision Language Models
Paper β’ 2410.13085 β’ Published β’ 24 -
MedTrinity-25M: A Large-scale Multimodal Dataset with Multigranular Annotations for Medicine
Paper β’ 2408.02900 β’ Published β’ 31 -
MediAug: Exploring Visual Augmentation in Medical Imaging
Paper β’ 2504.18983 β’ Published β’ 7 -
Interpretable Bilingual Multimodal Large Language Model for Diverse Biomedical Tasks
Paper β’ 2410.18387 β’ Published
Collections
Discover the best community collections!
Collections including paper arxiv:2410.13085
-
FactCheXcker: Mitigating Measurement Hallucinations in Chest X-ray Report Generation Models
Paper β’ 2411.18672 β’ Published -
CoMT: Chain-of-Medical-Thought Reduces Hallucination in Medical Report Generation
Paper β’ 2406.11451 β’ Published -
Lingshu: A Generalist Foundation Model for Unified Multimodal Medical Understanding and Reasoning
Paper β’ 2506.07044 β’ Published β’ 114 -
ΞΌ^2Tokenizer: Differentiable Multi-Scale Multi-Modal Tokenizer for Radiology Report Generation
Paper β’ 2507.00316 β’ Published β’ 15
-
PDFTriage: Question Answering over Long, Structured Documents
Paper β’ 2309.08872 β’ Published β’ 55 -
Adapting Large Language Models via Reading Comprehension
Paper β’ 2309.09530 β’ Published β’ 82 -
Table-GPT: Table-tuned GPT for Diverse Table Tasks
Paper β’ 2310.09263 β’ Published β’ 40 -
Context-Aware Meta-Learning
Paper β’ 2310.10971 β’ Published β’ 17
-
MMed-RAG: Versatile Multimodal RAG System for Medical Vision Language Models
Paper β’ 2410.13085 β’ Published β’ 24 -
MedTrinity-25M: A Large-scale Multimodal Dataset with Multigranular Annotations for Medicine
Paper β’ 2408.02900 β’ Published β’ 31 -
MediAug: Exploring Visual Augmentation in Medical Imaging
Paper β’ 2504.18983 β’ Published β’ 7 -
Interpretable Bilingual Multimodal Large Language Model for Diverse Biomedical Tasks
Paper β’ 2410.18387 β’ Published
-
FactCheXcker: Mitigating Measurement Hallucinations in Chest X-ray Report Generation Models
Paper β’ 2411.18672 β’ Published -
CoMT: Chain-of-Medical-Thought Reduces Hallucination in Medical Report Generation
Paper β’ 2406.11451 β’ Published -
Lingshu: A Generalist Foundation Model for Unified Multimodal Medical Understanding and Reasoning
Paper β’ 2506.07044 β’ Published β’ 114 -
ΞΌ^2Tokenizer: Differentiable Multi-Scale Multi-Modal Tokenizer for Radiology Report Generation
Paper β’ 2507.00316 β’ Published β’ 15
-
PDFTriage: Question Answering over Long, Structured Documents
Paper β’ 2309.08872 β’ Published β’ 55 -
Adapting Large Language Models via Reading Comprehension
Paper β’ 2309.09530 β’ Published β’ 82 -
Table-GPT: Table-tuned GPT for Diverse Table Tasks
Paper β’ 2310.09263 β’ Published β’ 40 -
Context-Aware Meta-Learning
Paper β’ 2310.10971 β’ Published β’ 17