Improve model card: Add `library_name` and structure content (#1)
Browse files- Improve model card: Add `library_name` and structure content (ecb2bd440cd8aadeb3fc9b02ab35bcb068b35c55)
Co-authored-by: Niels Rogge <nielsr@users.noreply.huggingface.co>
README.md
CHANGED
|
@@ -1,12 +1,28 @@
|
|
| 1 |
---
|
| 2 |
-
license: apache-2.0
|
| 3 |
base_model:
|
| 4 |
- stabilityai/stable-diffusion-2-1
|
|
|
|
| 5 |
pipeline_tag: image-to-image
|
|
|
|
| 6 |
---
|
| 7 |
|
| 8 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 9 |
|
| 10 |
-
|
|
|
|
| 11 |
|
| 12 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
---
|
|
|
|
| 2 |
base_model:
|
| 3 |
- stabilityai/stable-diffusion-2-1
|
| 4 |
+
license: apache-2.0
|
| 5 |
pipeline_tag: image-to-image
|
| 6 |
+
library_name: diffusers
|
| 7 |
---
|
| 8 |
|
| 9 |
+
# OSCAR: One-Step Diffusion Codec Across Multiple Bit-rates
|
| 10 |
+
|
| 11 |
+
## Abstract
|
| 12 |
+
Pretrained latent diffusion models have shown strong potential for lossy image compression, owing to their powerful generative priors. Most existing diffusion-based methods reconstruct images by iteratively denoising from random noise, guided by compressed latent representations. While these approaches have achieved high reconstruction quality, their multi-step sampling process incurs substantial computational overhead. Moreover, they typically require training separate models for different compression bit-rates, leading to significant training and storage costs. To address these challenges, we propose a one-step diffusion codec across multiple bit-rates. termed OSCAR. Specifically, our method views compressed latents as noisy variants of the original latents, where the level of distortion depends on the bit-rate. This perspective allows them to be modeled as intermediate states along a diffusion trajectory. By establishing a mapping from the compression bit-rate to a pseudo diffusion timestep, we condition a single generative model to support reconstructions at multiple bit-rates. Meanwhile, we argue that the compressed latents retain rich structural information, thereby making one-step denoising feasible. Thus, OSCAR replaces iterative sampling with a single denoising pass, significantly improving inference efficiency. Extensive experiments demonstrate that OSCAR achieves superior performance in both quantitative and visual quality metrics.
|
| 13 |
+
|
| 14 |
+
## Paper
|
| 15 |
+
[OSCAR: One-Step Diffusion Codec Across Multiple Bit-rates](https://arxiv.org/pdf/2505.16091)
|
| 16 |
|
| 17 |
+
## Code
|
| 18 |
+
The code is available on GitHub: [https://github.com/jp-guo/OSCAR](https://github.com/jp-guo/OSCAR)
|
| 19 |
|
| 20 |
+
## Citation
|
| 21 |
+
```bibtex
|
| 22 |
+
@article{guo2025oscar,
|
| 23 |
+
title={OSCAR: One-Step Diffusion Codec Across Multiple Bit-rates},
|
| 24 |
+
author={Guo, Jinpei and Ji, Yifei and Chen, Zheng and Liu, Kai and Liu, Min and Rao, Wang and Li, Wenbo and Guo, Yong and Zhang, Yulun},
|
| 25 |
+
journal={arXiv preprint arXiv:2505.16091},
|
| 26 |
+
year={2025}
|
| 27 |
+
}
|
| 28 |
+
```
|