Model name:
MN-12B-Lyra-v4

Brief description:
A finetune of Mistral Nemo by Sao10K.
Uses the ChatML prompt format.

Presets:
You can use the built in ChatML presets within SillyTavern and adjust from there.
Alternatively, check out Virt-io's ChatML v1.9 presets here, make sure you read the repository page for how to use them properly.

Request page:
https://huggingface.co/Lewdiculous/Model-Requests/discussions/75

Model link:
https://huggingface.co/Sao10K/MN-12B-Lyra-v4

Quantized with llama.cpp:
b3707

Downloads last month: 8,030

GGUF

Model size

12B params

Architecture

llama

Hardware compatibility

2-bit

3-bit

4-bit

5-bit

6-bit

8-bit

16-bit

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for Lewdiculous/MN-12B-Lyra-v4-GGUF-IQ-Imatrix

Base model

Sao10K/MN-12B-Lyra-v4

Quantized

(21)

this model

Collection including Lewdiculous/MN-12B-Lyra-v4-GGUF-IQ-Imatrix

Quantized Models (GGUF, IQ, Imatrix)

Collection

Various GGUF quantizations of small models. Models with a "checkmark" are personal favorites. An "orange arrow" means it's being uploaded. • 97 items • Updated Mar 2 • 71