13 8

Fei Zhang

SII-Ferenas

AI & ML interests

None yet

Recent Activity

liked a model about 1 month ago

moonshotai/Kimi-K2.5

liked a dataset about 1 month ago

BAAI/DenseFusion-1M

upvoted a paper about 1 month ago

Penguin-VL: Exploring the Efficiency Limits of VLM with LLM-based Vision Encoders

View all activity

Organizations

None yet

liked a model about 1 month ago

moonshotai/Kimi-K2.5

Image-Text-to-Text • 1.1T • Updated Feb 27 • 5.56M • • 2.46k

liked a dataset about 1 month ago

BAAI/DenseFusion-1M

Viewer • Updated Oct 17, 2024 • 1.18M • 793 • 40

upvoted a paper about 1 month ago

Penguin-VL: Exploring the Efficiency Limits of VLM with LLM-based Vision Encoders

Paper • 2603.06569 • Published Mar 6 • 118

liked a model 2 months ago

robbyant/lingbot-world-base-cam

Image-to-Video • Updated Feb 2 • 330

upvoted a paper 2 months ago

Causal Forcing: Autoregressive Diffusion Distillation Done Right for High-Quality Real-Time Interactive Video Generation

Paper • 2602.02214 • Published Feb 2 • 24

upvoted a paper 3 months ago

mHC: Manifold-Constrained Hyper-Connections

Paper • 2512.24880 • Published Dec 31, 2025 • 322

upvoted 2 papers 4 months ago

Qwen3-VL Technical Report

Paper • 2511.21631 • Published Nov 26, 2025 • 161

TUNA: Taming Unified Visual Representations for Native Unified Multimodal Models

Paper • 2512.02014 • Published Dec 1, 2025 • 74

upvoted a paper 5 months ago

SAM 3: Segment Anything with Concepts

Paper • 2511.16719 • Published Nov 20, 2025 • 134

upvoted 2 papers 7 months ago

LongLive: Real-time Interactive Long Video Generation

Paper • 2509.22622 • Published Sep 26, 2025 • 189

Visual Programmability: A Guide for Code-as-Thought in Chart Understanding

Paper • 2509.09286 • Published Sep 11, 2025 • 11

upvoted 2 papers 8 months ago

Visual-CoG: Stage-Aware Reinforcement Learning with Chain of Guidance for Text-to-Image Generation

Paper • 2508.18032 • Published Aug 25, 2025 • 41

InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency

Paper • 2508.18265 • Published Aug 25, 2025 • 217

upvoted 2 papers 9 months ago

VLM^2-Bench: A Closer Look at How Well VLMs Implicitly Link Explicit Matching Visual Cues

Paper • 2502.12084 • Published Feb 17, 2025 • 35

ConText: Driving In-context Learning for Text Removal and Segmentation

Paper • 2506.03799 • Published Jun 4, 2025 • 1

liked a model 9 months ago

SII-Ferenas/ConText

Updated Jul 17, 2025 • 2

updated a model 9 months ago

SII-Ferenas/ConText

Updated Jul 17, 2025 • 2

published a model 9 months ago

SII-Ferenas/ConText

Updated Jul 17, 2025 • 2

liked a dataset about 1 year ago

lmms-lab/LLaVA-NeXT-Interleave-Bench

Viewer • Updated Aug 9, 2024 • 38.7k • 630 • 16

liked a model about 1 year ago

microsoft/wham

Updated Dec 17, 2025 • 100 • 267

Fei Zhang

AI & ML interests

Recent Activity

Organizations

SII-Ferenas's activity