QORA-4B is a 4-billion parameter language model with built-in vision. Pure Rust multimodal inference engine build on Qwen3.5-4B

#7
by drdraq - opened

Pure Rust multimodal inference engine. No Python, no CUDA, no external ML frameworks. Single executable + model weights = portable AI that runs on any machine.

Try: https://huggingface.co/qoranet/QORA-4B

Text generation

qor4b --prompt "Explain quantum computing" --max-tokens 500

Image understanding

qor4b --prompt "What's in this image?" --image photo.jpg

Video understanding (directory of frame images)

qor4b --prompt "What happens in this video?" --video frames_dir/

Thinking mode (default, extended reasoning)

qor4b --prompt "Solve: integral of x^2 * e^x dx" --think-budget 2048

No-think mode (faster, direct answers)

qor4b --prompt "What is 2+2?" --no-think

Greedy decoding (deterministic output)

qor4b --prompt "Hello" --greedy

Sign up or log in to comment