Submitted by akhaliq 128 The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery · 6 authors 13.2k 11
Submitted by akhaliq 73 Mutual Reasoning Makes Smaller LLMs Stronger Problem-Solvers · 6 authors 3.42k 9
Submitted by akhaliq 55 ControlNeXt: Powerful and Efficient Control for Image and Video Generation · 6 authors 1.64k 8
Submitted by akhaliq 38 CogVideoX: Text-to-Video Diffusion Models with An Expert Transformer · 19 authors 12.6k 6
Submitted by akhaliq 18 FruitNeRF: A Unified Neural Radiance Field based Fruit Counting Framework · 4 authors 325 2
Submitted by akhaliq 17 VisualAgentBench: Towards Large Multimodal Models as Visual Foundation Agents · 30 authors 263 3
Submitted by akhaliq 15 HeadGAP: Few-shot 3D Head Avatar via Generalizable Gaussian Priors · 12 authors 2
Submitted by akhaliq 14 UniPortrait: A Unified Framework for Identity-Preserving Single- and Multi-Human Image Personalization · 3 authors 275 5
Submitted by akhaliq 9 Body Transformer: Leveraging Robot Embodiment for Policy Learning · 5 authors 187 2
Submitted by mamaj92 9 Your Context Is Not an Array: Unveiling Random Access Limitations in Transformers · 3 authors 2