deepseek-ai/DeepSeek-R1-0528 Text Generation • 685B • Updated May 29, 2025 • 769k • • 2.42k
Enigmata: Scaling Logical Reasoning in Large Language Models with Synthetic Verifiable Puzzles Paper • 2505.19914 • Published May 26, 2025 • 46
Running 31 Llama-4-Maverick-03-26-Experimental Battles 🔥 31 Display and filter chat conversations between models
ValueFX9507/Tifa-Deepsex-14b-CoT Reinforcement Learning • 15B • Updated Feb 13, 2025 • 402 • 224
Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training Paper • 2501.11425 • Published Jan 20, 2025 • 109