Upload exp_c_tokenizer_ablation.json with huggingface_hub e06a0c9 verified ronnengmail commited on 4 days ago
Replace model_arch.py with correct architecture (train_sft_3b.py) 42458aa verified ronnengmail commited on 5 days ago
Fix architecture params: DIM=3072, DEPTH=26, VOCAB=32000, N_HEADS=24 028969d verified ronnengmail commited on 5 days ago
Add model card, config, tokenizer, and architecture code ebf013f verified ronnengmail commited on 5 days ago