jankin123
/

4DThinker-3B

Video-Text-to-Text

dynamic-spatial-reasoning

vision-language-model

latent-reasoning

Model card Files Files and versions

Metrics Training metrics Community

jankin123 commited on 15 days ago

Commit

1c9ae24

·

verified ·

1 Parent(s): 13cf9a7

Update README.md

Files changed (1) hide show

README.md +12 -0

README.md CHANGED Viewed

@@ -62,6 +62,18 @@ model = Qwen2_5_VLForConditionalGeneration.from_pretrained(
 processor = AutoProcessor.from_pretrained("./model/4drl")
 ```
 ## License
 Apache License 2.0

 processor = AutoProcessor.from_pretrained("./model/4drl")
 ```
+## Bibtex
+If you find 4DThinker helpful for your work, please cite
+```
+@article{chen20264dthinker,
+  title={4DThinker: Thinking with 4D Imagery for Dynamic Spatial Understanding},
+  author={Chen, Zhangquan and Zhang, Manyuan and Yu, Xinlei and An, Xiang and Li, Bo and Xie, Xin and Wang, ZiDong and Sun, Mingze and Chen, Shuang and Li, Hongyu and others},
+  journal={arXiv preprint arXiv:2605.05997},
+  year={2026}
+}
+```
 ## License
 Apache License 2.0