EvolveMem:Self-Evolving Memory Architecture via AutoResearch for LLM Agents Paper • 2605.13941 • Published 3 days ago • 20
cgs4huggingface/Qwen3.6-27B-MLX-VL-oQ5-fp16 Image-Text-to-Text • 6B • Updated about 24 hours ago • 160 • 1
Mean Mode Screaming: Mean--Variance Split Residuals for 1000-Layer Diffusion Transformers Paper • 2605.06169 • Published 9 days ago • 182
General Multimodal Protein Design Enables DNA-Encoding of Chemistry Paper • 2604.05181 • Published Apr 6 • 31
Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability Paper • 2604.06628 • Published Apr 8 • 324
Adam's Law: Textual Frequency Law on Large Language Models Paper • 2604.02176 • Published Apr 2 • 502
GrandCode: Achieving Grandmaster Level in Competitive Programming via Agentic Reinforcement Learning Paper • 2604.02721 • Published Apr 3 • 628
DataFlex: A Unified Framework for Data-Centric Dynamic Training of Large Language Models Paper • 2603.26164 • Published Mar 27 • 364
nureddin123/whisper_turbo_v472_r1024_s192 Automatic Speech Recognition • 0.8B • Updated 16 days ago • 27 • 1
Representation Alignment for Just Image Transformers is not Easier than You Think Paper • 2603.14366 • Published Mar 15 • 13
FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization Paper • 2603.19835 • Published Mar 20 • 350