Image-Text-to-Text
Transformers
Safetensors
English
Chinese
text-generation
GUI
GUI-Grounding
Vision-language
multimodal
conversational
custom_code
# Load model directly
from transformers import AutoModelForCausalLM
model = AutoModelForCausalLM.from_pretrained("yappertar4/Sergup-SPT-9B0on-Tencent", trust_remote_code=True, dtype="auto")Quick Links
Эта модель дообучение модели от Tencent со зрением!
- Downloads last month
- 17
Model tree for yappertar4/Sergup-SPT-9B0on-Tencent
Base model
Qwen/Qwen3-8B-Base
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("image-text-to-text", model="yappertar4/Sergup-SPT-9B0on-Tencent", trust_remote_code=True) messages = [ { "role": "user", "content": [ {"type": "image", "url": "https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/p-blog/candy.JPG"}, {"type": "text", "text": "What animal is on the candy?"} ] }, ] pipe(text=messages)