--- base_model: - microsoft/phi-4 datasets: - HuggingFaceFW/fineweb-edu license: apache-2.0 model_name: Qwen3_1.7B_LoopUS_SFT pipeline_tag: text-generation tags: - LoopUS - LoopedTransformers ---
BAELAB, Pusan National University, Busan, Korea
DOLAB, Changwon National University, Changwon, Korea
Taekhyun Park1, Yongjae Lee1, Dohee Kim2, Hyerim Bae1,†
🌟 Github | 🌐 Project Page | 📄 Paper
# Overview **Looped Depth Up-Scaling** (LoopUS) is a post-training framework that converts a standard pretrained LLM into a looped architecture. LoopUS recasts the pretrained LLM into an encoder, a looped reasoning block, and a decoder. It operationalizes this latent-refinement architecture through: 1. **Block Decomposition:** Recasts a pretrained transformer into a reusable latent-refinement architecture. 2. **Input-Dependent Selective Gate:** Adaptively controls hidden state propagation to mitigate drift. 3. **Random Deep Supervision:** Enables memory-efficient learning over long recursive horizons. 4. **Confidence Head:** Allows for adaptive early exiting during inference. Through stable latent looping, LoopUS improves reasoning-oriented performance without extending the generated traces or requiring recurrent training from scratch. # Illustration of LoopUS