nanowhale-100m / configuration_deepseek_v4.py

Commit History

Upload SmolDeepSeek-V4 100M SFT model (3000 steps on SmolTalk)
964e055
verified

cmpatino HF Staff commited on