-
PersonaLive! Expressive Portrait Image Animation for Live Streaming
Paper • 2512.11253 • Published • 40 -
NeoVerse: Enhancing 4D World Model with in-the-wild Monocular Videos
Paper • 2601.00393 • Published • 133 -
Agent READMEs: An Empirical Study of Context Files for Agentic Coding
Paper • 2511.12884 • Published • 28 -
SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion
Paper • 2503.11576 • Published • 156
MN
ma1664
·
AI & ML interests
None yet
Recent Activity
updated a collection 28 days ago
Papers updated a collection about 2 months ago
Models updated a collection about 2 months ago
ModelsOrganizations
None yet
Spaces
- Configuration errorFeatured446
FastVLM WebGPU
🍎446Real-time video captioning powered by FastVLM
- Running on ZeroMCPFeatured2.17k
Qwen Image Edit Camera Control
🎬2.17kFast 4 step inference with Qwen Image Edit 2509
- Running on ZeroFeatured404
Depth Anything 3
🏢404Generate depth maps from your photos
Papers
-
PersonaLive! Expressive Portrait Image Animation for Live Streaming
Paper • 2512.11253 • Published • 40 -
NeoVerse: Enhancing 4D World Model with in-the-wild Monocular Videos
Paper • 2601.00393 • Published • 133 -
Agent READMEs: An Empirical Study of Context Files for Agentic Coding
Paper • 2511.12884 • Published • 28 -
SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion
Paper • 2503.11576 • Published • 156
Spaces
- Configuration errorFeatured446
FastVLM WebGPU
🍎446Real-time video captioning powered by FastVLM
- Running on ZeroMCPFeatured2.17k
Qwen Image Edit Camera Control
🎬2.17kFast 4 step inference with Qwen Image Edit 2509
- Running on ZeroFeatured404
Depth Anything 3
🏢404Generate depth maps from your photos
models 0
None public yet
datasets 0
None public yet