How to use from
llama.cpp
Install from brew
brew install llama.cpp
# Start a local OpenAI-compatible server with a web UI:
llama-server -hf THChou1220/gemma4-e2b-webvid4K_FT:Q4_K_M
# Run inference directly in the terminal:
llama-cli -hf THChou1220/gemma4-e2b-webvid4K_FT:Q4_K_M
Install from WinGet (Windows)
winget install llama.cpp
# Start a local OpenAI-compatible server with a web UI:
llama-server -hf THChou1220/gemma4-e2b-webvid4K_FT:Q4_K_M
# Run inference directly in the terminal:
llama-cli -hf THChou1220/gemma4-e2b-webvid4K_FT:Q4_K_M
Use pre-built binary
# Download pre-built binary from:
# https://github.com/ggerganov/llama.cpp/releases
# Start a local OpenAI-compatible server with a web UI:
./llama-server -hf THChou1220/gemma4-e2b-webvid4K_FT:Q4_K_M
# Run inference directly in the terminal:
./llama-cli -hf THChou1220/gemma4-e2b-webvid4K_FT:Q4_K_M
Build from source code
git clone https://github.com/ggerganov/llama.cpp.git
cd llama.cpp
cmake -B build
cmake --build build -j --target llama-server llama-cli
# Start a local OpenAI-compatible server with a web UI:
./build/bin/llama-server -hf THChou1220/gemma4-e2b-webvid4K_FT:Q4_K_M
# Run inference directly in the terminal:
./build/bin/llama-cli -hf THChou1220/gemma4-e2b-webvid4K_FT:Q4_K_M
Use Docker
docker model run hf.co/THChou1220/gemma4-e2b-webvid4K_FT:Q4_K_M
Quick Links

gemma4-e2b-webvid4K_FT

Full fine-tune of google/gemma-4-e2b-it on AI-generated video data derived from WebVid.

Training

  • Dataset: bear7011/gemma-4-e4b-webvid-4K
  • Samples: 3,941 video instruction examples
  • Method: full fine-tuning, no LoRA
  • Precision: bfloat16
  • GPUs: 4
  • DeepSpeed: ZeRO-3 with CPU optimizer and parameter offload
  • Epochs: 1
  • Global steps: 124
  • Per-device batch size: 1
  • Gradient accumulation steps: 8
  • Optimizer: AdamW
  • Learning rate: 5e-6
  • Projector learning rate: 5e-6
  • Image encoder learning rate: 0.0
  • Weight decay: 0.01
  • Warmup ratio: 0.03
  • LR scheduler: cosine
  • Gradient checkpointing: enabled
  • Max sequence length: 2304
  • Final training loss: 2.3344

Checkpoints and training logs are not included in this repository.

Downloads last month
-
Safetensors
Model size
5B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support