Medical report scores

#4
by deathknight0 - opened

Thanks for the models. Your smaller model seem to outperform your larger model on most medical report benchmarks. Can you share any insights why this might be the case..?

Lingshu: MLLMs for Unified Multimodal Medical Understanding and Reasoning org

Thanks for the models. Your smaller model seem to outperform your larger model on most medical report benchmarks. Can you share any insights why this might be the case..?

This is also very surprising to us. Since part of the training split is included in the data and the report generation data is substantially different from the majority VQA data, we hypothesize that the smaller model better fits the training set patterns of report generation. Another reason is pinpointed in our paper that current automatic evaluation metrics cannot fully reflect the model's capability of medical report generation.

Sign up or log in to comment