Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Lgr54HFi
/
chomera
like
0
chimera51
custom_code
arxiv:
12 papers
Model card
Files
Files and versions
xet
Community
5bfbb8a
chomera
Ctrl+K
Ctrl+K
1 contributor
History:
42 commits
Lgr54HFi
fix: OOM at batch=256 โ cap batch by logits memory, enable grad ckpt
5bfbb8a
verified
12 days ago
chimera
fix: OOM at batch=256 โ cap batch by logits memory, enable grad ckpt
12 days ago
tests
Upload folder using huggingface_hub
13 days ago
.gitattributes
Safe
1.52 kB
initial commit
13 days ago
.gitignore
Safe
230 Bytes
Upload folder using huggingface_hub
13 days ago
README.md
Safe
9.11 kB
Upload folder using huggingface_hub
13 days ago
chimera_turbo.py
Safe
22.5 kB
perf: tune chimera_turbo.py for 300-step convergence + throughput
12 days ago
config.json
Safe
25.5 kB
Upload folder using huggingface_hub
13 days ago
gguf_import.py
Safe
29.1 kB
Upload folder using huggingface_hub
13 days ago
inference.py
Safe
12.3 kB
Upload folder using huggingface_hub
13 days ago
launch_turbo.sh
Safe
3.06 kB
fix: tcmalloc debug .so crash, add error trapping, chmod note
12 days ago
pyproject.toml
Safe
789 Bytes
Upload folder using huggingface_hub
13 days ago
train.py
Safe
9.22 kB
Upload folder using huggingface_hub
13 days ago
train_fast.py
Safe
4.96 kB
Upload folder using huggingface_hub
13 days ago
train_hyper.py
Safe
8.39 kB
perf: 4-stage GrowLength + CLI defaults for 300-step target
12 days ago