Llama_ChatSFT

Mini-Llama 8B Chat - 0129 - GGUF

My instruct model has undergone DoRA SFT on my custom synthetic chat dataset, containing single and multi-round chats containing SFW, NSFW, and Toxic single and multi-round chats. This reinforces the model's uncensored compliance with all prompts and teaches it how to better fill roles assigned to it in the system prompt.

This model has yet to go through DPO preference training and may still have rough edges.

** Be aware that this adapter, when used without a system prompt to assign it a role may make up its own role. Meaning if you just say 'Hello' it could resond with 'Hello, how may I assist you?' or it might respond with something like "Hi, my name is Carol and I'm a librarian here to assist you with finding the book you're looking for."

For the base pretrain, see: Nabbers1999/Mini-Llama-8B-Base-0124

For the instruct, see: Nabbers1999/Mini-Llama-8B-Instruct-0124

Downloads last month
15
GGUF
Model size
8B params
Architecture
llama
Hardware compatibility
Log In to add your hardware

4-bit

5-bit

6-bit

8-bit

16-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for Nabbers1999/Mini-Llama-8B-Chat-SFT-0129-GGUF

Collection including Nabbers1999/Mini-Llama-8B-Chat-SFT-0129-GGUF