49 8

Shuai Liu

Choiszt

https://github.com/choiszt

Choiszt

AI & ML interests

None yet

Recent Activity

updated a dataset 3 days ago

Choiszt/FileGram

upvoted a paper 5 days ago

HY-Embodied-0.5: Embodied Foundation Models for Real-World Agents

upvoted a paper 10 days ago

Video-MME-v2: Towards the Next Stage in Benchmarks for Comprehensive Video Understanding

View all activity

Organizations

upvoted a paper 5 days ago

HY-Embodied-0.5: Embodied Foundation Models for Real-World Agents

Paper • 2604.07430 • Published 10 days ago • 182

upvoted a paper 10 days ago

Video-MME-v2: Towards the Next Stage in Benchmarks for Comprehensive Video Understanding

Paper • 2604.05015 • Published 12 days ago • 233

upvoted 2 papers 11 days ago

FileGram: Grounding Agent Personalization in File-System Behavioral Traces

Paper • 2604.04901 • Published 12 days ago • 40

A Simple Baseline for Streaming Video Understanding

Paper • 2604.02317 • Published 16 days ago • 72

upvoted 2 papers 16 days ago

HippoCamp: Benchmarking Contextual Agents on Personal Computers

Paper • 2604.01221 • Published 16 days ago • 29

PerceptionComp: A Video Benchmark for Complex Perception-Centric Reasoning

Paper • 2603.26653 • Published 21 days ago • 18

upvoted a paper 21 days ago

Proact-VL: A Proactive VideoLLM for Real-Time AI Companions

Paper • 2603.03447 • Published Mar 3 • 37

upvoted a paper 25 days ago

Insight-V++: Towards Advanced Long-Chain Visual Reasoning with Multimodal Large Language Models

Paper • 2603.18118 • Published about 1 month ago • 12

upvoted 3 papers about 1 month ago

Demystifing Video Reasoning

Paper • 2603.16870 • Published Mar 17 • 369

HSImul3R: Physics-in-the-Loop Reconstruction of Simulation-Ready Human-Scene Interactions

Paper • 2603.15612 • Published Mar 16 • 152

ArtHOI: Articulated Human-Object Interaction Synthesis by 4D Reconstruction from Video Priors

Paper • 2603.04338 • Published Mar 4 • 24

upvoted a paper about 2 months ago

A Very Big Video Reasoning Suite

Paper • 2602.20159 • Published Feb 23 • 519

upvoted 2 papers 2 months ago

OneVision-Encoder: Codec-Aligned Sparsity as a Foundational Principle for Multimodal Intelligence

Paper • 2602.08683 • Published Feb 9 • 52

Demo-ICL: In-Context Learning for Procedural Video Knowledge Acquisition

Paper • 2602.08439 • Published Feb 9 • 28

upvoted a paper 3 months ago

DynamicVLA: A Vision-Language-Action Model for Dynamic Object Manipulation

Paper • 2601.22153 • Published Jan 29 • 74

upvoted a paper 4 months ago

The Prism Hypothesis: Harmonizing Semantic and Pixel Representations via Unified Autoencoding

Paper • 2512.19693 • Published Dec 22, 2025 • 67

upvoted 4 papers 5 months ago

OpenMMReasoner: Pushing the Frontiers for Multimodal Reasoning with an Open and General Recipe

Paper • 2511.16334 • Published Nov 20, 2025 • 96

Shuai Liu

AI & ML interests

Recent Activity

Organizations

Choiszt's activity