Upload PPO-aligned TinyLlama-1.1B model using MARS reward model on PKUSafeRLHF 8f21213 verified payelb commited on 18 days ago
Upload tokenizer for PKUSafeRLHF-MARS aligned TinyLlama model 7ce67d3 verified payelb commited on 18 days ago