quantize_llama / README.md
Junn17's picture
Upload README.md with huggingface_hub
61e4461 verified
metadata
license: apache-2.0
base_model: meta-llama/Llama-3.1-8B-Instruct
tags:
  - gguf
  - q4_k_m
  - quantized

Llama Q4_K_M GGUF

Quantized from Junn17/llama using Unsloth.

Usage

llama-cli --model llama_model.Q4_K_M.gguf -p "Hello!"