quantize_llama2 / README.md
Junn17's picture
Upload README.md with huggingface_hub
2fe2728 verified
metadata
license: llama2
base_model: meta-llama/Llama-2-7b-chat-hf
tags:
  - gguf
  - q4_k_m
  - quantized

Llama-2-7B-Chat Q4_K_M GGUF

Quantized from Junn17/llama using Unsloth.

Usage

llama-cli --model llama-2-7b-chat.Q4_K_M.gguf -p "Hello!"