Chtholly17 commited on
Commit
acf8f8f
Β·
verified Β·
1 Parent(s): 5ddccd3

Update README: add author list, real GitHub/Collection/project-page URLs

Browse files
Files changed (1) hide show
  1. README.md +6 -6
README.md CHANGED
@@ -28,9 +28,9 @@ This release is the **Stage-3 step-279** checkpoint.
28
  | Resource | Link |
29
  |---|---|
30
  | πŸ“„ Paper | <TODO_PAPER_URL> |
31
- | πŸ’» Code | https://github.com/<TODO>/VLM-CapCurriculum |
32
- | 🌐 Project page | <TODO_WEBSITE> |
33
- | πŸ€— Collection (model + data + eval) | https://huggingface.co/collections/UCSC-VLAA/<TODO> |
34
 
35
  ## Headline numbers (extended benchmark suite, AVG over 10 benchmarks)
36
 
@@ -52,7 +52,7 @@ Three RLVR stages with GRPO (on top of [EasyR1](https://github.com/hiyouga/EasyR
52
 
53
  All three stages share **one** system / format prompt β€” see [Inference](#inference) below.
54
 
55
- Detailed launch scripts: [`training/examples/internvl3_5_8b/`](https://github.com/<TODO>/VLM-CapCurriculum/tree/main/training/examples/internvl3_5_8b) in the code repo.
56
 
57
  ## Inference
58
 
@@ -72,7 +72,7 @@ lmdeploy serve api_server UCSC-VLAA/VLM-CapCurriculum-InternVL3.5-8B-Staged \
72
  --server-port 23343 --tp 4
73
  ```
74
 
75
- For VLMEvalKit-style benchmark eval, plug it in via the `InternVL3_5_8B_Staged` alias defined in [`evaluation/configs/models.py`](https://github.com/<TODO>/VLM-CapCurriculum/blob/main/evaluation/configs/models.py).
76
 
77
  ## Intended use & limitations
78
 
@@ -87,7 +87,7 @@ Released under **Apache-2.0**, matching the upstream backbone. If you use this m
87
  ```bibtex
88
  @inproceedings{vlmcapcurriculum2026,
89
  title = {From Seeing to Thinking: Decoupling Perception and Reasoning Improves Post-Training of Vision-Language Models},
90
- author = {TODO},
91
  booktitle = {Proceedings of the International Conference on Machine Learning (ICML)},
92
  year = {2026}
93
  }
 
28
  | Resource | Link |
29
  |---|---|
30
  | πŸ“„ Paper | <TODO_PAPER_URL> |
31
+ | πŸ’» Code | https://github.com/UCSC-VLAA/VLM-CapCurriculum |
32
+ | 🌐 Project page | https://ucsc-vlaa.github.io/VLM-CapCurriculum |
33
+ | πŸ€— Collection (model + data + eval) | https://huggingface.co/collections/UCSC-VLAA/vlm-capcurriculum-from-seeing-to-thinking-icml-2026-6a07691f944148ccb2b183b8 |
34
 
35
  ## Headline numbers (extended benchmark suite, AVG over 10 benchmarks)
36
 
 
52
 
53
  All three stages share **one** system / format prompt β€” see [Inference](#inference) below.
54
 
55
+ Detailed launch scripts: [`training/examples/internvl3_5_8b/`](https://github.com/UCSC-VLAA/VLM-CapCurriculum/tree/main/training/examples/internvl3_5_8b) in the code repo.
56
 
57
  ## Inference
58
 
 
72
  --server-port 23343 --tp 4
73
  ```
74
 
75
+ For VLMEvalKit-style benchmark eval, plug it in via the `InternVL3_5_8B_Staged` alias defined in [`evaluation/configs/models.py`](https://github.com/UCSC-VLAA/VLM-CapCurriculum/blob/main/evaluation/configs/models.py).
76
 
77
  ## Intended use & limitations
78
 
 
87
  ```bibtex
88
  @inproceedings{vlmcapcurriculum2026,
89
  title = {From Seeing to Thinking: Decoupling Perception and Reasoning Improves Post-Training of Vision-Language Models},
90
+ author = {Juncheng Wu and Hardy Chen and Haoqin Tu and Xianfeng Tang and Freda Shi and Hui Liu and Hanqing Lu and Cihang Xie and Yuyin Zhou},
91
  booktitle = {Proceedings of the International Conference on Machine Learning (ICML)},
92
  year = {2026}
93
  }