nanowhale-100m / modeling_deepseek_v4.py

Commit History

Fix import: try relative first, fall back to absolute
8fecf92
verified

cmpatino HF Staff commited on

Fix import: use relative import for Hub remote code compatibility
52dd02c
verified

cmpatino HF Staff commited on

Upload SmolDeepSeek-V4 100M SFT model (3000 steps on SmolTalk)
964e055
verified

cmpatino HF Staff commited on