nielsr HF Staff commited on
Commit
4588667
·
verified ·
1 Parent(s): d41e78d

Improve model card and add metadata

Browse files

Hi, I'm Niels from the community science team at Hugging Face.

This PR improves the model card for DecQ. Key changes include:
- Added the `image-to-image` pipeline tag to the metadata.
- Included links to the research paper and the official GitHub repository.
- Added a brief description of the model and its architecture.
- Added a sample usage section with instructions for image reconstruction.
- Added a BibTeX citation.

Files changed (1) hide show
  1. README.md +42 -0
README.md CHANGED
@@ -1,3 +1,45 @@
1
  ---
2
  license: apache-2.0
 
3
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
  license: apache-2.0
3
+ pipeline_tag: image-to-image
4
  ---
5
+
6
+ # DecQ: Detail-Condensing Queries for Enhanced Reconstruction and Generation in Representation Autoencoders
7
+
8
+ This repository contains the weights for **DecQ**, a framework that introduces lightweight detail-condensing queries into Representation Autoencoders (RAEs). DecQ improves spatial reconstruction capacity while preserving the pretrained semantic space of vision foundation models (VFMs), facilitating high-quality image reconstruction and faster convergence in latent diffusion models.
9
+
10
+ - **Paper:** [DecQ: Detail-Condensing Queries for Enhanced Reconstruction and Generation in Representation Autoencoders](https://huggingface.co/papers/2605.22777)
11
+ - **Repository:** [GitHub - Tianhang-Wang/DecQ](https://github.com/Tianhang-Wang/DecQ)
12
+
13
+ ## Overview
14
+
15
+ DecQ addresses the reconstruction–generation trade-off in RAEs by using detail-condensing queries to extract fine-grained information from intermediate VFM features through condenser modules. These queries are incorporated into the decoder to support reconstruction and are jointly generated with patch tokens during generative modeling.
16
+
17
+ Key features:
18
+ - **Lightweight:** Only 8 additional queries and 3.9% extra computation.
19
+ - **Improved Reconstruction:** Significant PSNR improvement over frozen DINOv2-based RAEs.
20
+ - **Faster Convergence:** Achieves 3.3× faster convergence in generative modeling.
21
+
22
+ This repository currently contains the **Stage 1 tokenizer weights**.
23
+
24
+ ## Sample Usage
25
+
26
+ To perform image reconstruction using the Stage 1 autoencoder, you can use the sampling script provided in the [official repository](https://github.com/Tianhang-Wang/DecQ):
27
+
28
+ ```bash
29
+ python src/stage1_sample.py \
30
+ --config <config_path> \
31
+ --image <input_image_path>
32
+ ```
33
+
34
+ Refer to the GitHub repository for environment setup and configuration files.
35
+
36
+ ## Citation
37
+
38
+ ```bibtex
39
+ @article{wang2026decq,
40
+ title={DecQ: Detail-Condensing Queries for Enhanced Reconstruction and Generation in Representation Autoencoders},
41
+ author={Wang, Tianhang and Chen, Yitong and Song, Wei and Wu, Zuxuan and Li, Min and Wang, Jiaqi},
42
+ journal={arXiv preprint arXiv:2605.22777},
43
+ year={2026}
44
+ }
45
+ ```