Commit History

Upload PPO-aligned TinyLlama-1.1B model using MARS reward model on HHRLHF
a75499e
verified

payelb commited on

Upload tokenizer for HHRLHF-MARS aligned TinyLlama model
893df32
verified

payelb commited on