ModeSwitch-LLM: A Lightweight Phase-Aware Controller for Cross-Mode LLM Inference on a Single GPU
Paper • 2605.23057 • Published
Research artifacts for ModeSwitch-LLM: a lightweight request-boundary controller for efficient single-GPU LLM inference.