L1 distilled student model: query_router (DeBERTa teacher, 25MB INT8, 0.2ms) 075c9b9 verified democraticLLM commited on 19 days ago