MDM-Prime-v2-C4

MDM-Prime-v2 is an enhanced version of the MDM-Prime framework. MDM-Prime is a discrete diffusion model enhanced with the Partial masking scheme (Prime). It enables fine-grained denoising and improves generation quality across both image and text domains.

This repository contains the models presented in the paper:

Links:


Model Details

  • Dataset: C4 (English)
  • Model Size: 14M - 3.4B
  • Context Length: 2,048

How to Use

To download the weights, one can download the huggingface_hub library via pip install -U huggingface_hub and perform the following python code:

from huggingface_hub import hf_hub_download

path = hf_hub_download(
    repo_id="chen-hao-chao/mdm-prime-v2-c4",
    filename="${checkpoint_name}"
)

Replace ${checkpoint_name} with ${model}/${setup} (e.g., prime/prime_param_3426M_iter_14000). This repository is organized as follows:

mdm-prime-v2-c4/
β”œβ”€β”€ README.md
β”œβ”€β”€ arm/
β”œβ”€β”€ mdm/
└── prime/
    β”œβ”€β”€ prime_param_14M_iter_70000/
    β”œβ”€β”€ prime_param_25M_iter_80000/
    β”œβ”€β”€ .../
    └── prime_param_3426M_iter_14000/
         β”œβ”€β”€ latest_checkpointed_iteration.txt
         └── iter_0014000

For more details regarding the training and inference processes, please refer to our github repository: chen-hao-chao/mdm-prime-v2.


Citing MDM-Prime and MDM-Prime-v2

If you find this repository useful, please consider citing our paper.

@article{chao2026mdmprimev2,
      title = {{MDM-Prime-v2: Binary Encoding and Index Shuffling Enable Compute-optimal Scaling of Diffusion Language Models}}, 
      author = {Chen-Hao Chao, Wei-Fang Sun, Junwei Quan, Chun-Yi Lee, Rahul G. Krishnan},
      year = {2026},
}
@inproceedings{chao2025mdmprime,
      title = {{Beyond Masked and Unmasked: Discrete Diffusion Models via Partial Masking}}, 
      author = {Chen-Hao Chao, Wei-Fang Sun, Hanwen Liang, Chun-Yi Lee, Rahul G. Krishnan},
      booktitle = {Proceedings of the Conference on Neural Information Processing Systems (NeurIPS)},
      year = {2025},
}
Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support

Dataset used to train chen-hao-chao/mdm-prime-v2-c4

Collection including chen-hao-chao/mdm-prime-v2-c4

Paper for chen-hao-chao/mdm-prime-v2-c4