-
GGUF Editor
🏢95Edit GGUF model metadata from Hugging Face or local files
-
mergekit-gui
🔀290Merge AI models using a YAML configuration file
-
GGUF My Repo
🦙1.91kQuantize Hugging Face models to GGUF and publish repo
-
SignRoundV2: Closing the Performance Gap in Extremely Low-Bit Post-Training Quantization for LLMs
Paper • 2512.04746 • Published • 14
Joe
Joe57005
·
AI & ML interests
None yet
Recent Activity
updated a collection 7 days ago
Models to try upvoted a collection 9 days ago
APEX Quants (GGUF) updated a collection 11 days ago
Models to tryOrganizations
None yet
For MOE 1.5B
Models to try
-
bunnycore/Gemma2-2b-function-calling-lora
Updated • 1 -
NickyNicky/gemma-2b-it_oasst2_all_chatML_function_calling_Agent_v1
Text Generation • 3B • Updated • 26 • 1 -
hugging-quants/Llama-3.2-1B-Instruct-Q8_0-GGUF
Text Generation • 1B • Updated • 833k • 46 -
gorilla-llm/gorilla-openfunctions-v2
Text Generation • Updated • 801 • 245
For finetune
-
glaiveai/glaive-function-calling-v2
Viewer • Updated • 113k • 16.3k • 498 - Running17
Chat Template Editor
💬17View, edit, test and submit Chat Templates
- Running95
GGUF Editor
🏢95Edit GGUF model metadata from Hugging Face or local files
-
0xSero/glm47-reap-calibration-v2
Viewer • Updated • 1.36k • 12 • 3
Good for home automation
Large context LLMs that work well with Home Assistant via Llama.cpp server running on CPU with 16GB ram.
-
inclusionAI/Ling-mini-2.0
Text Generation • 16B • Updated • 17k • 190 -
Orion-zhen/Qwen3-30B-A3B-Instruct-2507-IQK-GGUF
31B • Updated • 114 • 1 -
Intel/Qwen3-30B-A3B-Instruct-2507-gguf-q2ks-mixed-AutoRound
31B • Updated • 85 • 26 -
Tiiny/SmallThinker-21BA3B-Instruct
Text Generation • 22B • Updated • 161 • 111
LLM Tools
- Running95
GGUF Editor
🏢95Edit GGUF model metadata from Hugging Face or local files
- Runtime errorFeatured290
mergekit-gui
🔀290Merge AI models using a YAML configuration file
- Running on A10G1.91k
GGUF My Repo
🦙1.91kQuantize Hugging Face models to GGUF and publish repo
-
SignRoundV2: Closing the Performance Gap in Extremely Low-Bit Post-Training Quantization for LLMs
Paper • 2512.04746 • Published • 14
For finetune
-
glaiveai/glaive-function-calling-v2
Viewer • Updated • 113k • 16.3k • 498 - Running17
Chat Template Editor
💬17View, edit, test and submit Chat Templates
- Running95
GGUF Editor
🏢95Edit GGUF model metadata from Hugging Face or local files
-
0xSero/glm47-reap-calibration-v2
Viewer • Updated • 1.36k • 12 • 3
For MOE 1.5B
Good for home automation
Large context LLMs that work well with Home Assistant via Llama.cpp server running on CPU with 16GB ram.
-
inclusionAI/Ling-mini-2.0
Text Generation • 16B • Updated • 17k • 190 -
Orion-zhen/Qwen3-30B-A3B-Instruct-2507-IQK-GGUF
31B • Updated • 114 • 1 -
Intel/Qwen3-30B-A3B-Instruct-2507-gguf-q2ks-mixed-AutoRound
31B • Updated • 85 • 26 -
Tiiny/SmallThinker-21BA3B-Instruct
Text Generation • 22B • Updated • 161 • 111
Models to try
-
bunnycore/Gemma2-2b-function-calling-lora
Updated • 1 -
NickyNicky/gemma-2b-it_oasst2_all_chatML_function_calling_Agent_v1
Text Generation • 3B • Updated • 26 • 1 -
hugging-quants/Llama-3.2-1B-Instruct-Q8_0-GGUF
Text Generation • 1B • Updated • 833k • 46 -
gorilla-llm/gorilla-openfunctions-v2
Text Generation • Updated • 801 • 245