Upload PPO-aligned TinyLlama-1.1B model using MARS reward model on HHRLHF a75499e verified payelb commited on 16 days ago
Upload tokenizer for HHRLHF-MARS aligned TinyLlama model 893df32 verified payelb commited on 16 days ago