12 25

Balazs Bedo

Vaporeonn

AI & ML interests

None yet

Recent Activity

upvoted an article 5 days ago

Falcon Perception

liked a model 25 days ago

Subh775/Threat-Detection-RFDETR

liked a model 4 months ago

mistralai/Devstral-Small-2-24B-Instruct-2512

View all activity

Organizations

None yet

upvoted an article 5 days ago

Article

Falcon Perception

18 days ago

•

upvoted 2 papers 7 months ago

Video models are zero-shot learners and reasoners

Paper • 2509.20328 • Published Sep 24, 2025 • 100

SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features

Paper • 2502.14786 • Published Feb 20, 2025 • 164

upvoted a paper 9 months ago

Multi-Granular Spatio-Temporal Token Merging for Training-Free Acceleration of Video LLMs

Paper • 2507.07990 • Published Jul 10, 2025 • 46

upvoted 2 articles 10 months ago

Article

Welcome the NVIDIA Llama Nemotron Nano VLM to Hugging Face Hub

Jun 27, 2025

•

Article

🤔👀🎬🖥️📖 Kimi-VL-A3B-Thinking-2506: A Quick Navigation

Jun 21, 2025

•

upvoted a collection 10 months ago

V-JEPA 2

Collection

A frontier video understanding model developed by FAIR, Meta, which extends the pretraining objectives of https://ai.meta.com/blog/v-jepa-yann • 8 items • Updated Jun 13, 2025 • 206

upvoted 2 articles 11 months ago

Article

A Dive into Vision-Language Models

Feb 3, 2023

•

Article

nanoVLM: The simplest repository to train your VLM in pure PyTorch

May 21, 2025

•

255

upvoted a collection 11 months ago

MedGemma Release

Collection

Collection of Gemma 3 variants for performance on medical text and image comprehension to accelerate building healthcare-based AI applications. • 9 items • Updated Mar 12 • 479

upvoted an article 11 months ago

Article

Vision Language Models (Better, faster, stronger)

May 12, 2025

•

606

upvoted a paper 11 months ago

D-FINE: Redefine Regression Task in DETRs as Fine-grained Distribution Refinement

Paper • 2410.13842 • Published Oct 17, 2024 • 6

Balazs Bedo

AI & ML interests

Recent Activity

Organizations

Vaporeonn's activity

Falcon Perception

Welcome the NVIDIA Llama Nemotron Nano VLM to Hugging Face Hub

🤔👀🎬🖥️📖 Kimi-VL-A3B-Thinking-2506: A Quick Navigation

A Dive into Vision-Language Models

nanoVLM: The simplest repository to train your VLM in pure PyTorch

Vision Language Models (Better, faster, stronger)