SGLANG uses image and video fields, and the current configuration cause the model to not recognize images and videos. Therefore, make this compatible
image
video
· Sign up or log in to comment