Drop residual 'Ministral' references in DLM config docstring
#4
by mkhadkevich - opened
Follow-up to the Ministral DLM β Nemotron Labs Diffusion rebrand (PR #2).
The class-level docstring and the vocab_size arg description inconfiguration_nemotron_labs_diffusion.py still referred to a Ministral3Model / 'the Ministral model'. Re-point them atNemotronLabsDiffusionEncoderModel / 'the model' so the public docstring
matches the renamed config class.
Base-class references (Ministral3Model, Ministral3Attention, ...) in
modeling_nemotron_labs_diffusion.py and modeling_ministral.py are kept on
purpose β the diffusion model wraps the base Ministral3 LLM architecture.
The base TP-plan comment on line 120 of the config also stays for the same
reason.
MMaghoumi changed pull request status to merged