Update to LoRA Phase 2 merged weights (PPL 15.78) 23a54a1 verified ronnengmail commited on 16 days ago
Update model card: LoRA Phase 2 (PPL 15.78, 97.3% instruction following) 1122dc8 verified ronnengmail commited on 16 days ago
Add detailed base model info, pre-training datasets, and research context 925c5eb verified ronnengmail commited on 21 days ago
Clarify: 20M is SFT tokens, base model pre-trained on 9.8B tokens 4dd04f7 verified ronnengmail commited on 21 days ago