Instructions to use kanarena/supergemma4-e4b-abliterated-multimodal-gguf-4bit with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- llama-cpp-python
How to use kanarena/supergemma4-e4b-abliterated-multimodal-gguf-4bit with llama-cpp-python:
# !pip install llama-cpp-python from llama_cpp import Llama llm = Llama.from_pretrained( repo_id="kanarena/supergemma4-e4b-abliterated-multimodal-gguf-4bit", filename="Supergemma4-E4B-Abliterated-7.5B-Q4_K_M.gguf", )
llm.create_chat_completion( messages = [ { "role": "user", "content": [ { "type": "text", "text": "Describe this image in one sentence." }, { "type": "image_url", "image_url": { "url": "https://cdn.britannica.com/61/93061-050-99147DCE/Statue-of-Liberty-Island-New-York-Bay.jpg" } } ] } ] ) - Notebooks
- Google Colab
- Kaggle
- Local Apps
- llama.cpp
How to use kanarena/supergemma4-e4b-abliterated-multimodal-gguf-4bit with llama.cpp:
Install from brew
brew install llama.cpp # Start a local OpenAI-compatible server with a web UI: llama-server -hf kanarena/supergemma4-e4b-abliterated-multimodal-gguf-4bit:Q4_K_M # Run inference directly in the terminal: llama-cli -hf kanarena/supergemma4-e4b-abliterated-multimodal-gguf-4bit:Q4_K_M
Install from WinGet (Windows)
winget install llama.cpp # Start a local OpenAI-compatible server with a web UI: llama-server -hf kanarena/supergemma4-e4b-abliterated-multimodal-gguf-4bit:Q4_K_M # Run inference directly in the terminal: llama-cli -hf kanarena/supergemma4-e4b-abliterated-multimodal-gguf-4bit:Q4_K_M
Use pre-built binary
# Download pre-built binary from: # https://github.com/ggerganov/llama.cpp/releases # Start a local OpenAI-compatible server with a web UI: ./llama-server -hf kanarena/supergemma4-e4b-abliterated-multimodal-gguf-4bit:Q4_K_M # Run inference directly in the terminal: ./llama-cli -hf kanarena/supergemma4-e4b-abliterated-multimodal-gguf-4bit:Q4_K_M
Build from source code
git clone https://github.com/ggerganov/llama.cpp.git cd llama.cpp cmake -B build cmake --build build -j --target llama-server llama-cli # Start a local OpenAI-compatible server with a web UI: ./build/bin/llama-server -hf kanarena/supergemma4-e4b-abliterated-multimodal-gguf-4bit:Q4_K_M # Run inference directly in the terminal: ./build/bin/llama-cli -hf kanarena/supergemma4-e4b-abliterated-multimodal-gguf-4bit:Q4_K_M
Use Docker
docker model run hf.co/kanarena/supergemma4-e4b-abliterated-multimodal-gguf-4bit:Q4_K_M
- LM Studio
- Jan
- vLLM
How to use kanarena/supergemma4-e4b-abliterated-multimodal-gguf-4bit with vLLM:
Install from pip and serve model
# Install vLLM from pip: pip install vllm # Start the vLLM server: vllm serve "kanarena/supergemma4-e4b-abliterated-multimodal-gguf-4bit" # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:8000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "kanarena/supergemma4-e4b-abliterated-multimodal-gguf-4bit", "messages": [ { "role": "user", "content": [ { "type": "text", "text": "Describe this image in one sentence." }, { "type": "image_url", "image_url": { "url": "https://cdn.britannica.com/61/93061-050-99147DCE/Statue-of-Liberty-Island-New-York-Bay.jpg" } } ] } ] }'Use Docker
docker model run hf.co/kanarena/supergemma4-e4b-abliterated-multimodal-gguf-4bit:Q4_K_M
- Ollama
How to use kanarena/supergemma4-e4b-abliterated-multimodal-gguf-4bit with Ollama:
ollama run hf.co/kanarena/supergemma4-e4b-abliterated-multimodal-gguf-4bit:Q4_K_M
- Unsloth Studio new
How to use kanarena/supergemma4-e4b-abliterated-multimodal-gguf-4bit with Unsloth Studio:
Install Unsloth Studio (macOS, Linux, WSL)
curl -fsSL https://unsloth.ai/install.sh | sh # Run unsloth studio unsloth studio -H 0.0.0.0 -p 8888 # Then open http://localhost:8888 in your browser # Search for kanarena/supergemma4-e4b-abliterated-multimodal-gguf-4bit to start chatting
Install Unsloth Studio (Windows)
irm https://unsloth.ai/install.ps1 | iex # Run unsloth studio unsloth studio -H 0.0.0.0 -p 8888 # Then open http://localhost:8888 in your browser # Search for kanarena/supergemma4-e4b-abliterated-multimodal-gguf-4bit to start chatting
Using HuggingFace Spaces for Unsloth
# No setup required # Open https://huggingface.co/spaces/unsloth/studio in your browser # Search for kanarena/supergemma4-e4b-abliterated-multimodal-gguf-4bit to start chatting
- Pi new
How to use kanarena/supergemma4-e4b-abliterated-multimodal-gguf-4bit with Pi:
Start the llama.cpp server
# Install llama.cpp: brew install llama.cpp # Start a local OpenAI-compatible server: llama-server -hf kanarena/supergemma4-e4b-abliterated-multimodal-gguf-4bit:Q4_K_M
Configure the model in Pi
# Install Pi: npm install -g @mariozechner/pi-coding-agent # Add to ~/.pi/agent/models.json: { "providers": { "llama-cpp": { "baseUrl": "http://localhost:8080/v1", "api": "openai-completions", "apiKey": "none", "models": [ { "id": "kanarena/supergemma4-e4b-abliterated-multimodal-gguf-4bit:Q4_K_M" } ] } } }Run Pi
# Start Pi in your project directory: pi
- Hermes Agent new
How to use kanarena/supergemma4-e4b-abliterated-multimodal-gguf-4bit with Hermes Agent:
Start the llama.cpp server
# Install llama.cpp: brew install llama.cpp # Start a local OpenAI-compatible server: llama-server -hf kanarena/supergemma4-e4b-abliterated-multimodal-gguf-4bit:Q4_K_M
Configure Hermes
# Install Hermes: curl -fsSL https://hermes-agent.nousresearch.com/install.sh | bash hermes setup # Point Hermes at the local server: hermes config set model.provider custom hermes config set model.base_url http://127.0.0.1:8080/v1 hermes config set model.default kanarena/supergemma4-e4b-abliterated-multimodal-gguf-4bit:Q4_K_M
Run Hermes
hermes
- Docker Model Runner
How to use kanarena/supergemma4-e4b-abliterated-multimodal-gguf-4bit with Docker Model Runner:
docker model run hf.co/kanarena/supergemma4-e4b-abliterated-multimodal-gguf-4bit:Q4_K_M
- Lemonade
How to use kanarena/supergemma4-e4b-abliterated-multimodal-gguf-4bit with Lemonade:
Pull the model
# Download Lemonade from https://lemonade-server.ai/ lemonade pull kanarena/supergemma4-e4b-abliterated-multimodal-gguf-4bit:Q4_K_M
Run and chat with the model
lemonade run user.supergemma4-e4b-abliterated-multimodal-gguf-4bit-Q4_K_M
List all available models
lemonade list
SuperGemma4-E4B-Abliterated-Multimodal GGUF 4bit
This is the llama.cpp-ready GGUF 4bit distribution of Jiunsong/supergemma4-e4b-abliterated.
Contents include:
- Converted and quantized supergemma4-e4b-abliterated model from Jiunsong/supergemma4-e4b-abliterated
- mmproj file for image functionality, sourced from lmstudio-community/gemma-4-E4B-it-GGUF (for enabling vision)
- Configuration file (Virtual model) for using Reasoning(aka 'Think') within LM Studio
How to Install
It is a installation guide for using the model in LM Studio on Windows. Before proceeding with these steps, you must first install and launch LM Studio."
Instructions:
Open Windows Terminal (PowerShell).
Change the directory to LM Studio's "Models Directory." (If you specified a different path for your Models Directory, change to that custom path instead.)
cd $env:USERPROFILE\.lmstudio\models
- Clone the following repository into a folder named
kanarena.
git clone https://huggingface.co/kanarena/supergemma4-e4b-abliterated-multimodal-gguf-4bit .\kanarena\supergemma4-e4b-abliterated-multimodal-gguf-4bit\
- Move the configuration files (virtual model) to the main models directory.
Move-Item -Path ".\kanarena\supergemma4-e4b-abliterated-multimodal-gguf-4bit\jiunsong" -Destination ".\"
- You can now use this model in LM Studio. If the model does not appear, please Refresh model list.
SuperGemma4-E4B-Abliterated-Multimodal GGUF 4bit
์ด ๋ฐฐํฌํ์ Jiunsong/supergemma4-e4b-abliterated ๋ชจ๋ธ์ llama.cpp ๋๋ LM ์คํ๋์ค ์์ ์ฌ์ฉํ ์ ์๋ GGUF ํ์ผ์ ๋๋ค.
- ๋ณํ ๋ฐ ์์ํ ๋ชจ๋ธ: Jiunsong/supergemma4-e4b-abliterated์์ ๋ณํ๋๊ณ ์์ํ๋ Jiunsong/supergemma4-e4b-abliterated ๋ชจ๋ธ ํ์ผ.
- ์ด๋ฏธ์ง ๊ธฐ๋ฅ์ฉ ๋ชจ๋ธ (mmproj): ๋น์ (Vision) ๊ธฐ๋ฅ์ ํ์ฑํํ๊ธฐ ์ํด lmstudio-community/gemma-4-E4B-it-GGUF์์ ๊ฐ์ ธ์จ ์ด๋ฏธ์ง ๊ด๋ จ mmproj ํ์ผ.
- ์ถ๋ก ์ค์ ํ์ผ (๊ฐ์ ๋ชจ๋ธ): LM Studio ๋ด์์ ์ถ๋ก (Reasoning, Think๋ก ์๋ ค์ ธ์์) ๊ธฐ๋ฅ ์ฌ์ฉ์ ์ํ ๊ฐ์ ๋ชจ๋ธ ์ค์ ํ์ผ.
์ค์น๋ฒ
์ด ์ค์น๋ฒ์ ์๋์ฐ์ฉ LM ์คํ๋์ค์์ ๋ชจ๋ธ ์ฌ์ฉํ๊ธฐ ์ํ ๊ฒ์ ๋๋ค. ์๋ ์ค์น๋ฒ์ ๋ฐ๋ผํ๊ธฐ ์ ์ ๋จผ์ LM ์คํ๋์ค๋ฅผ ์ค์นํ๊ณ ์ต์ด ์คํ์ ํ์ฌ์ผ ํฉ๋๋ค.
์ค์น๋ฒ:
์๋์ฐ ํฐ๋ฏธ๋(Powershell)์ ์ฝ๋๋ค.
LM Studio์ "Models Directory" ์์น๋ก ๋๋ ํ ๋ฆฌ๋ฅผ ๋ณ๊ฒฝํฉ๋๋ค. (๋ง์ฝ Models Directory๋ฅผ ๋ค๋ฅธ ๊ฒฝ๋ก๋ก ์ง์ ํ๋ค๋ฉด ํด๋น ๊ฒฝ๋ก๋ก ๋๋ ํ ๋ฆฌ๋ฅผ ๋ณ๊ฒฝํฉ๋๋ค.)
cd $env:USERPROFILE\.lmstudio\models
- ์ด ๋ ํฌ์งํ ๋ฆฌ๋ฅผ
kanarena์ด๋ฆ์ ํด๋ ์์ ๋ค์ด๋ก๋ ํฉ๋๋ค.
git clone https://huggingface.co/kanarena/supergemma4-e4b-abliterated-multimodal-gguf-4bit .\kanarena\supergemma4-e4b-abliterated-multimodal-gguf-4bit\
- ์ค์ ํ์ผ(๊ฐ์ ๋ชจ๋ธ)์ model(์์ ๋๋ ํ ๋ฆฌ)๋ก ์ด๋ํฉ๋๋ค.
Move-Item -Path ".\kanarena\supergemma4-e4b-abliterated-multimodal-gguf-4bit\jiunsong" -Destination ".\"
- ์ด์ LM Studio ์์ ์ด ๋ชจ๋ธ์ ์ฌ์ฉํ ์ ์์ต๋๋ค. ๋ง์ฝ ๋ณด์ด์ง ์๋ ๊ฒฝ์ฐ ๋ชจ๋ธ ๋ฆฌ์คํธ ์๋ก๊ณ ์นจ์ ํด์ฃผ์ธ์.
- Downloads last month
- 989
4-bit
Model tree for kanarena/supergemma4-e4b-abliterated-multimodal-gguf-4bit
Base model
google/gemma-4-E4B