Elizaveta Goncharova PRO
Elizaveta
AI & ML interests
None yet
Recent Activity
liked a dataset 3 days ago
allenai/olmOCR-bench upvoted a collection 3 days ago
NVIDIA EGM upvoted a collection 3 days ago
ReasoningOrganizations
None yet
LLM-tokenizers
Reasoning
-
Towards System 2 Reasoning in LLMs: Learning How to Think With Meta Chain-of-Though
Paper • 2501.04682 • Published • 99 -
URSA: Understanding and Verifying Chain-of-thought Reasoning in Multimodal Mathematics
Paper • 2501.04686 • Published • 53 -
Nemotron-Cascade: Scaling Cascaded Reinforcement Learning for General-Purpose Reasoning Models
Paper • 2512.13607 • Published • 38
Multimodal-Grounding
-
Migician: Revealing the Magic of Free-Form Multi-Image Grounding in Multimodal Large Language Models
Paper • 2501.05767 • Published • 29 -
An Empirical Study of Autoregressive Pre-training from Videos
Paper • 2501.05453 • Published • 41 -
Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos
Paper • 2501.04001 • Published • 47 -
MemFlow: Flowing Adaptive Memory for Consistent and Efficient Long Video Narratives
Paper • 2512.14699 • Published • 28
Diffusion Multimodality
LLM
LLM-tokenizers
Multimodal-Models
Reasoning
-
Towards System 2 Reasoning in LLMs: Learning How to Think With Meta Chain-of-Though
Paper • 2501.04682 • Published • 99 -
URSA: Understanding and Verifying Chain-of-thought Reasoning in Multimodal Mathematics
Paper • 2501.04686 • Published • 53 -
Nemotron-Cascade: Scaling Cascaded Reinforcement Learning for General-Purpose Reasoning Models
Paper • 2512.13607 • Published • 38
Multimidality-Analysis
Multimodal-Grounding
-
Migician: Revealing the Magic of Free-Form Multi-Image Grounding in Multimodal Large Language Models
Paper • 2501.05767 • Published • 29 -
An Empirical Study of Autoregressive Pre-training from Videos
Paper • 2501.05453 • Published • 41 -
Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos
Paper • 2501.04001 • Published • 47 -
MemFlow: Flowing Adaptive Memory for Consistent and Efficient Long Video Narratives
Paper • 2512.14699 • Published • 28