Ghostgim commited on
Commit
d15c7e7
·
verified ·
1 Parent(s): d8ba3a2

feat: v0.9 chat slim checkpoint (81M, CTIBench 28.9%)

Browse files

v0.9 chat-tuned checkpoint, slimmed from 973MB to 324MB by dropping
optimizer + grad scaler state. Inference-only.

Architecture: 6 layers, d_model 768, 12 heads, RoPE + SwiGLU + RMSNorm.
Pretrain corpus: 273M tokens (PRIMUS + CWE + OWASP + RFCs + fact-QA).
Chat-tuned with the chat-v3 SFT recipe.

Bench (chat-tuned, debiased text-scoring):
CTIBench MCQ full (n=2500, 2-perm): 28.9%
in-repo CTF MCQ eval (n=30): 59.2%
SecQA (n=210, external): 39.3%
free-form fact recall (n=50): 1/50 (at floor for the
81M rung; the next
move is parameter
count, not corpus)

Used by the Ghostgim/ghostlm Space at runtime via hf_hub_download.

Files changed (1) hide show
  1. best_model.pt +3 -0
best_model.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:72030d38702bf4738364b2d40cb4f780d7b68c17f4a46ad4c4a0f55908449f06
3
+ size 324416181