nanowhale-100m-base / modeling_deepseek_v4.py

Commit History

Fix import: try relative first, fall back to absolute
84bb622
verified

cmpatino HF Staff commited on

Fix import: use relative import for Hub remote code compatibility
187959f
verified

cmpatino HF Staff commited on

Upload SmolDeepSeek-V4 100M pretrained model (5000 steps on FineWeb-Edu)
6e9a78e
verified

cmpatino HF Staff commited on