Could you please clarify whether the parameters of this model were trained using the SigLIP2 method or fine-tuned through multiple stages of VideoLLaMA3?
· Sign up or log in to comment