$V_1$: Unifying Generation and Self-Verification for Parallel Reasoners Paper • 2603.04304 • Published Mar 4 • 14
V_1: Unifying Generation and Self-Verification for Parallel Reasoners Paper • 2603.04304 • Published Mar 4 • 14
causal-rewards/sycophancy_dpo_llama3.1_8b_ultrachat200k_iter1_new Viewer • Updated Sep 21, 2025 • 847 • 6
causal-rewards/sycophancy_dpo_llama3.1_8b_ultrachat200k_iter1_new Viewer • Updated Sep 21, 2025 • 847 • 6
causal-rewards/ultrafeedback_60658_pref_dataset_original_plus_filtered_improved_degraded_attimp_threshold0p2 Viewer • Updated Jul 3, 2025 • 920k • 6
causal-rewards/ultrafeedback_60658_pref_dataset_original_plus_filtered_improved_degraded_attimp_threshold0p2 Viewer • Updated Jul 3, 2025 • 920k • 6
GraPE: A Generate-Plan-Edit Framework for Compositional T2I Synthesis Paper • 2412.06089 • Published Dec 8, 2024 • 4
IndicGenBench: A Multilingual Benchmark to Evaluate Generation Capabilities of LLMs on Indic Languages Paper • 2404.16816 • Published Apr 25, 2024 • 3