152 GB

Ctrl+K

Claude Opus 4.7 (1M context)

Correct param count (V3 carryover 671B → V4 ~284B); top-K 8→6

60d7028 2 days ago

.gitattributes

1.52 kB
initial commit 20 days ago
README.md

17.4 kB
Correct param count (V3 carryover 671B → V4 ~284B); top-K 8→6 2 days ago
config.json

12.2 kB
Phase 3b: AWQ-W4A16 quantization (FP8_BLOCK attn + W4A16 routed experts) 20 days ago
generation_config.json

174 Bytes
Phase 3b: AWQ-W4A16 quantization (FP8_BLOCK attn + W4A16 routed experts) 20 days ago
model-00001-of-00004.safetensors

50 GB
xet

Phase 3b: AWQ-W4A16 quantization (FP8_BLOCK attn + W4A16 routed experts) 20 days ago
model-00002-of-00004.safetensors

50 GB
xet

Phase 3b: AWQ-W4A16 quantization (FP8_BLOCK attn + W4A16 routed experts) 20 days ago
model-00003-of-00004.safetensors

50 GB
xet

Phase 3b: AWQ-W4A16 quantization (FP8_BLOCK attn + W4A16 routed experts) 20 days ago
model-00004-of-00004.safetensors

2.48 GB
xet

Phase 3b: AWQ-W4A16 quantization (FP8_BLOCK attn + W4A16 routed experts) 20 days ago
model.safetensors.index.json

8.51 MB
Phase 3b: AWQ-W4A16 quantization (FP8_BLOCK attn + W4A16 routed experts) 20 days ago
recipe.yaml

1.97 kB
Phase 3b: AWQ-W4A16 quantization (FP8_BLOCK attn + W4A16 routed experts) 20 days ago
tokenizer.json

10.1 MB
Phase 3b: AWQ-W4A16 quantization (FP8_BLOCK attn + W4A16 routed experts) 20 days ago
tokenizer_config.json

397 Bytes
Phase 3b: AWQ-W4A16 quantization (FP8_BLOCK attn + W4A16 routed experts) 20 days ago