Commit History

Upload PPO-aligned TinyLlama-1.1B model using MARS reward model on HHRLHF
a75499e
verified

payelb commited on

Upload tokenizer for HHRLHF-MARS aligned TinyLlama model
893df32
verified

payelb commited on

initial commit
7b9e0bc
verified

payelb commited on