ZhitongGao 's Collections
MM-IQ: Benchmarking Human-Like Abstraction and Reasoning in Multimodal
Models
Paper
• 2502.00698
• Published • 24
DeepRAG: Thinking to Retrieval Step by Step for Large Language Models
Paper
• 2502.01142
• Published • 25
ZebraLogic: On the Scaling Limits of LLMs for Logical Reasoning
Paper
• 2502.01100
• Published • 21
The Jumping Reasoning Curve? Tracking the Evolution of Reasoning
Performance in GPT-[n] and o-[n] Models on Multimodal Puzzles
Paper
• 2502.01081
• Published • 13
Improved Training Technique for Latent Consistency Models
Paper
• 2502.01441
• Published • 8
PhD Knowledge Not Required: A Reasoning Challenge for Large Language
Models
Paper
• 2502.01584
• Published • 9
COCONut-PanCap: Joint Panoptic Segmentation and Grounded Captions for
Fine-Grained Understanding and Generation
Paper
• 2502.02589
• Published • 9
Analyze Feature Flow to Enhance Interpretation and Steering in Language
Models
Paper
• 2502.03032
• Published • 60
Great Models Think Alike and this Undermines AI Oversight
Paper
• 2502.04313
• Published • 33
Ola: Pushing the Frontiers of Omni-Modal Language Model with Progressive
Modality Alignment
Paper
• 2502.04328
• Published • 29
ConceptAttention: Diffusion Transformers Learn Highly Interpretable
Features
Paper
• 2502.04320
• Published • 36
BOLT: Bootstrap Long Chain-of-Thought in Language Models without
Distillation
Paper
• 2502.03860
• Published • 25