1 7 7

Yan Fang

YanFang

YanFangCS

AI & ML interests

Computer Vision, Incremental Learning, semi-supervised learning

Recent Activity

upvoted a paper 14 days ago

CutClaw: Agentic Hours-Long Video Editing via Music Synchronization

upvoted a paper about 2 months ago

OneVision-Encoder: Codec-Aligned Sparsity as a Foundational Principle for Multimodal Intelligence

liked a Space 4 months ago

HuggingFaceM4/FineVision

View all activity

Organizations

None yet

upvoted a paper 14 days ago

CutClaw: Agentic Hours-Long Video Editing via Music Synchronization

Paper • 2603.29664 • Published 15 days ago • 48

upvoted a paper about 2 months ago

OneVision-Encoder: Codec-Aligned Sparsity as a Foundational Principle for Multimodal Intelligence

Paper • 2602.08683 • Published Feb 9 • 52

liked a Space 4 months ago

FineVision: Open Data is All You Need

📝

221

A new open-source dataset for training VLMs

upvoted a paper 4 months ago

StereoWorld: Geometry-Aware Monocular-to-Stereo Video Generation

Paper • 2512.09363 • Published Dec 10, 2025 • 74

upvoted an article 4 months ago

Article

Why Did MiniMax M2 End Up as a Full Attention Model?

Oct 30, 2025

•

liked a Space 5 months ago

Smol Training Playbook - Table of Contents

📚

New activity in internlm/CapRL-2M 6 months ago

The annotated captions in CapRL-2M have Chinese and English mixed cases.

#3 opened 6 months ago by

YanFang

upvoted a paper 6 months ago

Thinking with Camera: A Unified Multimodal Model for Camera-Centric Understanding and Generation

Paper • 2510.08673 • Published Oct 9, 2025 • 127

liked a model 6 months ago

Qwen/Qwen3-VL-30B-A3B-Thinking-FP8

Image-Text-to-Text • 31B • Updated Nov 26, 2025 • 3.77k • 53

liked a model 7 months ago

Qwen/Qwen3-VL-235B-A22B-Thinking

Image-Text-to-Text • 236B • Updated Nov 26, 2025 • 327k • • 387

liked a model 8 months ago

Qwen/Qwen-Image

Text-to-Image • Updated Aug 18, 2025 • 229k • • 2.46k

upvoted an article 12 months ago

Article

Introducing Idefics2: A Powerful 8B Vision-Language Model for the community

Apr 15, 2024

•

191

updated a dataset about 1 year ago

YanFang/sc-dense-wds

Viewer • Updated Apr 13, 2025 • 5.08M • 4

published a dataset about 1 year ago

YanFang/sc-dense-wds

Viewer • Updated Apr 13, 2025 • 5.08M • 4

upvoted a paper about 1 year ago

Video-T1: Test-Time Scaling for Video Generation

Paper • 2503.18942 • Published Mar 24, 2025 • 90

updated 2 datasets about 1 year ago

YanFang/sc-wds

Viewer • Updated Mar 13, 2025 • 17.9M • 104

YanFang/dense-sc-wds

Viewer • Updated Mar 7, 2025 • 4.56M • 3

liked a model about 1 year ago

speedinghzl/omni-superclass

Updated May 10, 2025 • 1