PeijieWang
/

GDP-4B

+---
+license: mit
+language:
+- en
+base_model:
+- Qwen/Qwen3-VL-4B-Instruct
+tags:
+- geometry diagram parsing
+- reasoning
+- formalization
+---
+## 1. Install dependencies
+We follow the official environment setup of Qwen3-VL. Please refer to:
+👉 https://huggingface.co/Qwen/Qwen3-VL-4B-Instruct
+## 🚀 Inference
+We provide a minimal example for running inference with the released Geoparsing model.
+```bash
+import torch
+from transformers import Qwen3VLForConditionalGeneration, AutoProcessor
+model_path = "YOUR_MODEL_PATH"  # local path or HuggingFace repo id
+model = Qwen3VLForConditionalGeneration.from_pretrained(
+    model_path,
+    torch_dtype="auto",
+    device_map="cuda:0"
+)
+processor = AutoProcessor.from_pretrained(model_path)
+messages = [
+    {
+        "role": "user",
+        "content": [
+            {
+                "type": "image",
+                "image": "examples/3_17.jpg",
+            },
+            {
+                "type": "text",
+                "text": "Please parse the geometric diagram and provide its formal description.",
+            },
+        ],
+    }
+]
+inputs = processor.apply_chat_template(
+    messages,
+    tokenize=True,
+    add_generation_prompt=True,
+    return_dict=True,
+    return_tensors="pt"
+)
+inputs = inputs.to(model.device)
+generated_ids = model.generate(
+    **inputs,
+    max_new_tokens=1280
+)
+generated_ids_trimmed = [
+    out_ids[len(in_ids):] for in_ids, out_ids in zip(inputs.input_ids, generated_ids)
+]
+output_text = processor.batch_decode(
+    generated_ids_trimmed,
+    skip_special_tokens=True,
+    clean_up_tokenization_spaces=False
+)
+print(output_text[0])