chomera / chimera /training /common.py

Commit History

fix: MoE intermediate_size not scaled for tiny — 158M→4M MoE params
6cb7b4d
verified

Lgr54HFi commited on

Upload folder using huggingface_hub
11c11f8
verified

Lgr54HFi commited on