typhoon-s-thaillm-8b-instruct-research-preview - GGUF

About

For a convenient overview and download list, visit our model page.

If you are unsure how to use GGUF files, refer to the llama.cpp documentation for more details.

./llama-cli -m typhoon-s-thaillm-8b-instruct-research-preview-q4_k_m.gguf -p "Hello!"

(sorted by size, not necessarily quality)

Link	Type	Size/GB	Notes
GGUF	q2_k	3.06	very low quality, for testing
GGUF	q3_k_m	3.84
GGUF	q4_0	4.45
GGUF	q4_k_m	4.68	recommended, good balance
GGUF	q5_k_m	5.45
GGUF	q8_0	8.11	near-full precision

Special thanks to the llama.cpp team for their amazing work.

GGUF

Model size

8B params

Architecture

qwen3

Hardware compatibility

2-bit

3-bit

4-bit

5-bit

8-bit

Base model

Finetuned

Quantized

(3)

this model