Commit History

docs: simplify Usage to from_pretrained (now works after modeling fix)
b0df9a3
verified

cmpatino HF Staff commited on

fix: use nn.init.* in _init_weights + recompute freqs_cis buffer on load
d10cc5f
verified

cmpatino HF Staff commited on

Add tokenizer.json
96016bc
verified

cmpatino HF Staff commited on

Add model.safetensors
2bebb38
verified

cmpatino HF Staff commited on

Add modeling_deepseek_v4.py
cf0c6af
verified

cmpatino HF Staff commited on

Add tokenizer config
69b2ef8
verified

cmpatino HF Staff commited on

Add README with updated model references
2185431
verified

cmpatino HF Staff commited on

Add generation_config.json
4169775
verified

cmpatino HF Staff commited on

Add configuration_deepseek_v4.py
858ac82
verified

cmpatino HF Staff commited on

Add config.json
d5547e7
verified

cmpatino HF Staff commited on

Add chat_template.jinja
a29bdf3
verified

cmpatino HF Staff commited on

initial commit
f10ccec
verified

cmpatino HF Staff commited on