Self-Training with Direct Preference Optimization Improves Chain-of-Thought Reasoning Paper • 2407.18248 • Published Jul 25, 2024 • 33
AutoFigure-Edit: Generating Editable Scientific Illustration Paper • 2603.06674 • Published Mar 3 • 19
Faster Video Diffusion with Trainable Sparse Attention Paper • 2505.13389 • Published May 19, 2025 • 38
Fast Video Generation with Sliding Tile Attention Paper • 2502.04507 • Published Feb 6, 2025 • 51
TinyLlama/TinyLlama-1.1B-intermediate-step-1431k-3T Text Generation • Updated Sep 27, 2024 • 83.4k • • 187
Self-Training with Direct Preference Optimization Improves Chain-of-Thought Reasoning Paper • 2407.18248 • Published Jul 25, 2024 • 33
Learning Multi-Step Reasoning by Solving Arithmetic Tasks Paper • 2306.01707 • Published Jun 2, 2023 • 2
LMMs-Eval: Reality Check on the Evaluation of Large Multimodal Models Paper • 2407.12772 • Published Jul 17, 2024 • 35
TinyLlama/TinyLlama-1.1B-Chat-v1.0 Text Generation • 1B • Updated Mar 17, 2024 • 3.01M • 1.57k