l1-student-section_router / added_tokens.json
democraticLLM's picture
L1 distilled student model: section_router (DeBERTa teacher, 25MB INT8, 0.2ms)
59bed5b verified
raw
history blame contribute delete
23 Bytes
{
"[MASK]": 128000
}