I-DLM-8B-LoRA-r128

LoRA adapter (rank=128) for I-DLM-8B, enabling lossless Relaxed Introspective Strided Decoding (R-ISD).

This model was presented in the paper Introspective Diffusion Language Models.

Overview

This LoRA adapter is used in the gated LoRA setup described in the I-DLM paper:

LoRA is active only on MASK positions during ISD decoding
Clean/verify positions use base model weights only
This produces output bit-for-bit identical to the base AR model (Qwen3-8B)

Usage

Note: This LoRA adapter is used with the I-DLM-8B base model via our SGLang-based serving pipeline. Direct loading via transformers is not currently supported for reproducing paper results. Please use the SGLang-based ISD pipeline for inference.