Efficient Training on Multiple Consumer GPUs with RoundPipe Paper • 2604.27085 • Published 8 days ago • 37
A Mathematical Framework for Custom Reward Functions in Job Application Evaluation using Reinforcement Learning Paper • 2511.16073 • Published Nov 20, 2025 • 1
view article Article Complete Guide: Training and Inference with π₀.₅ (pi05) on Custom Datasets Dec 13, 2025 • 4
PromptRL: Prompt Matters in RL for Flow-Based Image Generation Paper • 2602.01382 • Published Feb 1 • 10
KaSA: Knowledge-Aware Singular-Value Adaptation of Large Language Models Paper • 2412.06071 • Published Dec 8, 2024 • 10
fev-bench: A Realistic Benchmark for Time Series Forecasting Paper • 2509.26468 • Published Sep 30, 2025 • 4
view article Article Training and Finetuning Multimodal Embedding & Reranker Models with Sentence Transformers 21 days ago • 69
From Reasoning to Agentic: Credit Assignment in Reinforcement Learning for Large Language Models Paper • 2604.09459 • Published 24 days ago • 13
Better Generalization with Semantic IDs: A Case Study in Ranking for Recommendations Paper • 2306.08121 • Published Jun 13, 2023 • 2
Dive into Claude Code: The Design Space of Today's and Future AI Agent Systems Paper • 2604.14228 • Published 23 days ago • 25
Efficient Memory Management for Large Language Model Serving with PagedAttention Paper • 2309.06180 • Published Sep 12, 2023 • 54
HY-World 2.0: A Multi-Modal World Model for Reconstructing, Generating, and Simulating 3D Worlds Paper • 2604.14268 • Published 22 days ago • 117
Kronos: A Foundation Model for the Language of Financial Markets Paper • 2508.02739 • Published Aug 2, 2025 • 30