QuantFactory/Replete-LLM-gemma-9b-test-merged-250k-GGUF
This is quantized version of Replete-AI/Replete-LLM-gemma-9b-test-merged-250k created using llama.cpp
Original Model Card
Test on the (Replete-AI/Everything_Instruct_Mini_Multilingual) dataset
Prompt template:
<start_of_turn>system
{}<end_of_turn>
<start_of_turn>user
{}<end_of_turn>
<start_of_turn>model
{}
- Downloads last month
- 260
Hardware compatibility
Log In to add your hardware
2-bit
3-bit
4-bit
5-bit
6-bit
8-bit
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐ Ask for provider support