nielsr HF Staff commited on
Commit
fb4a8ec
·
verified ·
1 Parent(s): b74ebe5

Add pipeline tag, sample usage and paper/code links

Browse files

Hi! I'm Niels from the Hugging Face community science team.

This pull request improves the model card by:
- Adding the `text-generation` pipeline tag to the metadata.
- Including a link to the research paper: "Your Absorbing Discrete Diffusion Secretly Models the Conditional Distributions of Clean Data".
- Providing a sample usage code snippet from the official repository to help users load the model.
- Adding the BibTeX citation from the paper.

Files changed (1) hide show
  1. README.md +34 -3
README.md CHANGED
@@ -1,7 +1,38 @@
1
- Reparameterized Absorbing Discrete Diffusion (RADD) medium model with lambda-dce loss trained for 400k iterations.
 
 
2
 
3
- Code: https://github.com/ML-GSAI/RADD.
4
 
5
- Paper: https://arxiv.org/abs/2406.03736.
6
 
 
7
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ pipeline_tag: text-generation
3
+ ---
4
 
5
+ # RADD-Medium (lambda-dce)
6
 
7
+ Reparameterized Absorbing Discrete Diffusion (RADD) medium model with lambda-dce loss trained for 400k iterations.
8
 
9
+ This model was introduced in the paper [Your Absorbing Discrete Diffusion Secretly Models the Conditional Distributions of Clean Data](https://huggingface.co/papers/2406.03736).
10
 
11
+ ## Model Description
12
+ RADD is a dedicated diffusion model without time-condition that characterizes time-independent conditional probabilities. This architecture unifies training objectives for absorbing discrete diffusion and any-order autoregressive models (AO-ARMs). The removal of the time condition allows for caching strategies that significantly improve sampling speed. This specific checkpoint is the **medium** version (approx. 405M parameters) trained using the **$\lambda$-DCE** loss function.
13
+
14
+ ## Links
15
+ - **Paper:** [Your Absorbing Discrete Diffusion Secretly Models the Conditional Distributions of Clean Data](https://arxiv.org/abs/2406.03736)
16
+ - **Code:** [Official GitHub Repository](https://github.com/ML-GSAI/RADD)
17
+
18
+ ## Sample Usage
19
+ To load the model and noise schedule, you can use the following code (requires the `load_model.py` script from the official [GitHub repository](https://github.com/ML-GSAI/RADD)):
20
+
21
+ ```python
22
+ from load_model import load_model
23
+
24
+ # Load model and noise schedule
25
+ model, noise = load_model('JingyangOu/radd-lambda-dce-medium', device='cuda')
26
+ ```
27
+
28
+ ## Citation
29
+ ```bibtex
30
+ @misc{ou2024absorbingdiscretediffusionsecretly,
31
+ title={Your Absorbing Discrete Diffusion Secretly Models the Conditional Distributions of Clean Data},
32
+ author={Jingyang Ou and Shen Nie and Kaiwen Xue and Fengqi Zhu and Jiacheng Sun and Zhenguo Li and Chongxuan Li},
33
+ year={2024},
34
+ eprint={2406.03736},
35
+ archivePrefix={arXiv},
36
+ primaryClass={cs.LG},
37
+ }
38
+ ```