QORA-4B is a 4-billion parameter language model with built-in vision. Pure Rust multimodal inference engine build on Qwen3.5-4B
#7
by drdraq - opened
Pure Rust multimodal inference engine. No Python, no CUDA, no external ML frameworks. Single executable + model weights = portable AI that runs on any machine.
Try: https://huggingface.co/qoranet/QORA-4B
Text generation
qor4b --prompt "Explain quantum computing" --max-tokens 500
Image understanding
qor4b --prompt "What's in this image?" --image photo.jpg
Video understanding (directory of frame images)
qor4b --prompt "What happens in this video?" --video frames_dir/
Thinking mode (default, extended reasoning)
qor4b --prompt "Solve: integral of x^2 * e^x dx" --think-budget 2048
No-think mode (faster, direct answers)
qor4b --prompt "What is 2+2?" --no-think
Greedy decoding (deterministic output)
qor4b --prompt "Hello" --greedy