lthn
/

lemrd-mlx

Image-Text-to-Text

4-bit precision

Model card Files Files and versions

Lemrd — Gemma 4 31B Dense — MLX 4-bit

The largest dense member of the Lemma model family by Lethean. An EUPL-1.2 fork of Gemma 4 31B with the Lethean Ethical Kernel (LEK) merged into the weights.

This repo hosts the MLX 4-bit build for native Apple Silicon inference via mlx-lm and mlx-vlm. For the GGUF playground (Ollama, llama.cpp) see lthn/lemrd. For the unmodified Google base see LetheanNetwork/lemrd.

Family

Repo	Format	Bits
`lthn/lemrd`	GGUF multi-quant	Q4_K_M → BF16
`lthn/lemrd-mlx`	MLX	4-bit
`lthn/lemrd-mlx-8bit`	MLX	8-bit
`lthn/lemrd-mlx-bf16`	MLX	bf16

License

EUPL-1.2. See Gemma Terms of Use for upstream base model terms.

Downloads last month: 263

Safetensors

Model size

8B params

Tensor type

BF16

·

U32

·

MLX

Hardware compatibility

Log In to add your hardware

4-bit

Model tree for lthn/lemrd-mlx

Base model

google/gemma-4-31B-it

Quantized

LetheanNetwork/lemrd

Finetuned

Quantized

(3)

this model

Collection including lthn/lemrd-mlx

Lemma

A lemma is "something assumed" — an intermediate theorem on the path to a larger proof, or a heading that signals the subject of what follows. • 16 items • Updated 1 day ago