lingshu-medical-mllm/Lingshu-32B · Medical report scores

Medical report scores

by deathknight0 - opened Jun 22, 2025

Jun 22, 2025

•

edited Jun 22, 2025

Thanks for the models. Your smaller model seem to outperform your larger model on most medical report benchmarks. Can you share any insights why this might be the case..?

xww033

Lingshu: MLLMs for Unified Multimodal Medical Understanding and Reasoning org Jun 23, 2025

Thanks for the models. Your smaller model seem to outperform your larger model on most medical report benchmarks. Can you share any insights why this might be the case..?

This is also very surprising to us. Since part of the training split is included in the data and the report generation data is substantially different from the majority VQA data, we hypothesize that the smaller model better fits the training set patterns of report generation. Another reason is pinpointed in our paper that current automatic evaluation metrics cannot fully reflect the model's capability of medical report generation.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment