Sports Video Understanding Benchmarks
AI & ML interests
Computer Vision; Video Understanding; Action Recognition
Recent Activity
View all activity
Papers
SteadyDancer: Harmonized and Coherent Human Image Animation with First-Frame Preservation
SAM 2++: Tracking Anything at Any Granularity
-
MCG-NJU/LongVPO-Stage1-InternVL3-8B
Video-Text-to-Text • 8B • Updated • 9 -
MCG-NJU/LongVPO-Stage2-InternVL3-8B
Video-Text-to-Text • 8B • Updated • 7 -
MCG-NJU/LongVPO-Training-Data
Viewer • Updated • 14.5k • 20 -
LongVPO: From Anchored Cues to Self-Reasoning for Long-Form Video Preference Optimization
Paper • 2602.02341 • Published • 1
-
MCG-NJU/SteadyDancer-14B
Image-to-Video • Updated • 500 • 69 -
SteadyDancer: Harmonized and Coherent Human Image Animation with First-Frame Preservation
Paper • 2511.19320 • Published • 43 -
MCG-NJU/X-Dance
Viewer • Updated • 36 • 328 • 18 -
MCG-NJU/SteadyDancer-GGUF
Image-to-Video • 16B • Updated • 554 • 23
VideoMAE Pre-trained Models
-
VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training
Paper • 2203.12602 • Published • 3 -
MCG-NJU/videomae-base
Video Classification • 94.2M • Updated • 63.7k • 51 -
MCG-NJU/videomae-base-finetuned-kinetics
Video Classification • 86.5M • Updated • 34.7k • 48 -
MCG-NJU/videomae-base-finetuned-ssv2
Video Classification • Updated • 1.37k • 7
Video-o3: Native Interleaved Clue Seeking for Long Video Multi-Hop Reasoning
Learning Human Skill Generators at Key-Step Levels
CaReBench data, CaRe models and all the contrastively trained MLLMs (including InternVL2, MiniCPM-V 2.6, LLaVA NeXT Video, Qwen2-VL and Tariser).
Sports Video Understanding Benchmarks
VideoMAE Pre-trained Models
-
VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training
Paper • 2203.12602 • Published • 3 -
MCG-NJU/videomae-base
Video Classification • 94.2M • Updated • 63.7k • 51 -
MCG-NJU/videomae-base-finetuned-kinetics
Video Classification • 86.5M • Updated • 34.7k • 48 -
MCG-NJU/videomae-base-finetuned-ssv2
Video Classification • Updated • 1.37k • 7
-
MCG-NJU/LongVPO-Stage1-InternVL3-8B
Video-Text-to-Text • 8B • Updated • 9 -
MCG-NJU/LongVPO-Stage2-InternVL3-8B
Video-Text-to-Text • 8B • Updated • 7 -
MCG-NJU/LongVPO-Training-Data
Viewer • Updated • 14.5k • 20 -
LongVPO: From Anchored Cues to Self-Reasoning for Long-Form Video Preference Optimization
Paper • 2602.02341 • Published • 1
Video-o3: Native Interleaved Clue Seeking for Long Video Multi-Hop Reasoning
-
MCG-NJU/SteadyDancer-14B
Image-to-Video • Updated • 500 • 69 -
SteadyDancer: Harmonized and Coherent Human Image Animation with First-Frame Preservation
Paper • 2511.19320 • Published • 43 -
MCG-NJU/X-Dance
Viewer • Updated • 36 • 328 • 18 -
MCG-NJU/SteadyDancer-GGUF
Image-to-Video • 16B • Updated • 554 • 23
Learning Human Skill Generators at Key-Step Levels
CaReBench data, CaRe models and all the contrastively trained MLLMs (including InternVL2, MiniCPM-V 2.6, LLaVA NeXT Video, Qwen2-VL and Tariser).