metadata
title: VRFAI
emoji: π€
colorFrom: blue
colorTo: yellow
sdk: static
pinned: false
VRFAI β Edge AI & Model Optimization
We optimize and deploy LLMs, ASR, VLM and VLA (Vision-Language-Action) models on real-world systems.
π§ What we do
- Optimization: quantization (INT8/INT4/FP8/NVFP4), pruning, distillation, ...
- Deployment: VLLM, TensorRT, ONNX Runtime, edge runtimes
- Systems: real-time pipelines (vision, audio, language, action)
π― Focus
- Edge devices (Jetson, SoCs)
- Robotics & VLA systems
- Latency, stability, deployability
β‘ Philosophy
Optimization = model + runtime + system
VRFAI β making AI models fast, efficient, and real