KiyotakaWang commited on
Commit
2b41305
·
verified ·
1 Parent(s): a5b93d6

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +10 -0
README.md CHANGED
@@ -34,3 +34,13 @@ Project Links
34
  🧩 SAgoge Dataset: https://huggingface.co/datasets/InternSVG/SAgoge
35
 
36
  🤖 InternSVG-8B Model: https://huggingface.co/InternSVG/InternSVG-8B
 
 
 
 
 
 
 
 
 
 
 
34
  🧩 SAgoge Dataset: https://huggingface.co/datasets/InternSVG/SAgoge
35
 
36
  🤖 InternSVG-8B Model: https://huggingface.co/InternSVG/InternSVG-8B
37
+
38
+ ## Reliable Reasoning in SVG-LLMs via Multi-Task Multi-Reward Reinforcement Learning
39
+
40
+ In this work, we present CTRL-S (Chain-of-Thought Reinforcement Learning for SVG), a unified framework that introduces a chain-of-thought mechanism to explicitly expose the model’s reasoning process during SVG generation. To support this structured reasoning, we construct SVG-Sophia, a high-quality dataset of 145K samples across SVG code refinement, Text-to-SVG, and Image-to-SVG tasks. Furthermore, we design a robust multi-reward reinforcement learning scheme powered by the GRPO algorithm. By jointly optimizing across DINO, image-text similarity, format, and code efficiency rewards in a multi-task setting, our approach systematically boosts structural coherence and generation capabilities. Extensive experiments show that CTRL-S outperforms existing methods, achieving higher task success rates, superior code quality, and exceptional visual fidelity.
41
+
42
+ 📄 ArXiv Paper: https://arxiv.org/abs/2603.16189
43
+
44
+ 💻 GitHub Repository: https://github.com/hmwang2002/CTRL-S
45
+
46
+ 🧩 SVG-Sophia Dataset: https://huggingface.co/datasets/InternSVG/SVG-Sophia