shawlyu nielsr HF Staff commited on
Commit
af5eb5e
·
1 Parent(s): 3bd7644

Improve model card (#1)

Browse files

- Improve model card (965d4ba5c97956b94e33dbfe4f71241e4c6bb0e9)


Co-authored-by: Niels Rogge <nielsr@users.noreply.huggingface.co>

Files changed (1) hide show
  1. README.md +37 -4
README.md CHANGED
@@ -1,26 +1,38 @@
1
  ---
2
  license: mit
 
3
  tags:
4
- - depth-estimation
5
  - video-depth
6
  - monocular-geometry
7
  - streaming
8
  ---
9
 
10
- # DyFN
11
 
12
- Pretrained checkpoint for **Stabilizing Streaming Video Geometry via Dynamic Feature Normalization**.
 
 
 
 
 
13
 
14
  - **File:** `DyFN.pt`
15
  - **Parameters:** ~320M
16
  - **Base:** MoGe-ViT-L with ConvGRU temporal stabilizer
17
- - **Code:** [shawLyu/Streaming_DyFN](https://github.com/shawLyu/Streaming_DyFN)
18
 
19
  ## Usage
20
 
 
 
 
 
 
 
 
21
  ```python
22
  from moge.model.v1 import MoGeModel
23
 
 
24
  model = MoGeModel.from_pretrained("shawlyu/DyFN")
25
  ```
26
 
@@ -29,3 +41,24 @@ Or pass a local path:
29
  ```python
30
  model = MoGeModel.from_pretrained("./pretrained/DyFN.pt")
31
  ```
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
  license: mit
3
+ pipeline_tag: depth-estimation
4
  tags:
 
5
  - video-depth
6
  - monocular-geometry
7
  - streaming
8
  ---
9
 
10
+ # DyFN: Stabilizing Streaming Video Geometry via Dynamic Feature Normalization
11
 
12
+ This repository contains the pretrained checkpoint for **DyFN**, a model designed for consistent 3D geometry estimation from streaming RGB input.
13
+
14
+ [**Paper**](https://huggingface.co/papers/2605.25308) | [**Project Page**](https://shawlyu.github.io/DyFN) | [**Code**](https://github.com/shawLyu/Streaming_DyFN)
15
+
16
+ ## Description
17
+ Dynamic Feature Normalization (DyFN) is a lightweight, causal recurrent module that dynamically and robustly modulates feature statistics to maintain stable geometry over time. By finetuning only DyFN (a mere 2% additional parameters) on pretrained monocular geometry models, it effectively eliminates temporal artifacts such as disjointed layering and positional jitter without compromising single-image accuracy.
18
 
19
  - **File:** `DyFN.pt`
20
  - **Parameters:** ~320M
21
  - **Base:** MoGe-ViT-L with ConvGRU temporal stabilizer
 
22
 
23
  ## Usage
24
 
25
+ To use this model, you can install the package via:
26
+ ```bash
27
+ pip install git+https://github.com/shawLyu/Streaming_DyFN.git
28
+ ```
29
+
30
+ Then, load the model with the following snippet:
31
+
32
  ```python
33
  from moge.model.v1 import MoGeModel
34
 
35
+ # Load from Hugging Face Hub
36
  model = MoGeModel.from_pretrained("shawlyu/DyFN")
37
  ```
38
 
 
41
  ```python
42
  model = MoGeModel.from_pretrained("./pretrained/DyFN.pt")
43
  ```
44
+
45
+ ## Citation
46
+
47
+ If you find this project useful in your research, please cite:
48
+
49
+ ```bibtex
50
+ @inproceedings{lyu2026streamingdepth,
51
+ title={Stabilizing Streaming Video Geometry via Dynamic Feature Normalization},
52
+ author={Lyu, Xiaoyang and Liu, Muxin and Wu, Xiaoshan and Wang, Ruicheng and Huang, Yi-Hua and Sun, Yang-Tian and Shi, Shaoshuai and Qi, Xiaojuan},
53
+ booktitle={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
54
+ year={2026}
55
+ }
56
+
57
+ @inproceedings{wang2025moge,
58
+ title={Moge: Unlocking accurate monocular geometry estimation for open-domain images with optimal training supervision},
59
+ author={Wang, Ruicheng and Xu, Sicheng and Dai, Cassie and Xiang, Jianfeng and Deng, Yu and Tong, Xin and Yang, Jiaolong},
60
+ booktitle={Proceedings of the Computer Vision and Pattern Recognition Conference},
61
+ pages={5261--5271},
62
+ year={2025}
63
+ }
64
+ ```