LLaMA 3.2-3B English Therapy GGUF v2

This is a corrected GGUF quantized version of a LLaMA 3.2-3B model fine-tuned for empathetic therapy conversations.

Model Details

Base Model: meta-llama/Llama-3.2-3B-Instruct
Quantization: Q4_K_M
Size: ~1.8GB
Format: GGUF
Use Case: Mental health support chatbot

Usage

from llama_cpp import Llama

llm = Llama(
    model_path="model.gguf",
    n_ctx=2048,
    n_threads=4
)

response = llm(
    "User: I'm feeling anxious today.\nTherapist:",
    max_tokens=200,
    temperature=0.7
)
print(response["choices"][0]["text"])

Disclaimer

This is an AI assistant, not a replacement for professional mental health care.

Downloads last month: 35

GGUF

Model size

3B params

Architecture

llama

Hardware compatibility

We're not able to determine the quantization variants.

View all variants

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for E-motionAssistant/llama-3.2-3b-english-therapy-GGUF-v2

Base model

meta-llama/Llama-3.2-3B-Instruct

Quantized

(439)

this model