Upload PPO-aligned TinyLlama-1.1B model using MARS reward model on UltraFeedback_openbmb f50116e verified payelb commited on 17 days ago
Upload tokenizer for UltraFeedback_openbmb-MARS aligned TinyLlama model c8a98b1 verified payelb commited on 17 days ago