Add ROCM_ACCELERATION.md β deep dive into ROCm model acceleration libraries and how they relate to StreamingVLM f6c261b verified s23deepak commited on about 13 hours ago
Add EXPLAINED.md β deep dive into StreamingVLM architecture and patching methodology 0ae32a3 verified s23deepak commited on about 14 hours ago
Update README: add Qwen3-VL-2B support, CUDA flash_attention_2, dual platform guide f1156c0 verified s23deepak commited on about 14 hours ago
Add Qwen3-VL-2B support + CUDA flash_attention_2 config 189031e verified s23deepak commited on about 14 hours ago
Add pos_emb.py β contiguous RoPE for infinite streams 4306b5f verified s23deepak commited on about 14 hours ago
Add model_forward.py β top-level streaming forward af7a516 verified s23deepak commited on about 14 hours ago
Add vision_forward.py β chunked SDPA for vision encoder c555dfc verified s23deepak commited on about 14 hours ago
Add language_forward.py β SDPA attention for text decoder d788786 verified s23deepak commited on about 14 hours ago
Add requirements, setup, and training files b7cd7a3 verified s23deepak commited on about 14 hours ago
Initial commit: StreamingVLM port for Qwen3-VL 4B on ROCm 2f4b97d verified s23deepak commited on about 14 hours ago