payelb's picture
Upload PPO-aligned Llama-3.2-1B model using baseline DeBERTa reward model on PKUSafeRLHF
730d32a verified
This file is stored with Xet . It is too big to display, but you can still download it.

Xet Pointer Details

( Raw pointer file )
Xet hash:
7d53be3535ce41b1412ba6587f40f3484fdc99a1867f83b44062ced1b0dc6af8
Size of remote file:
45.1 MB
·
SHA256:
db46cd3600249900f6a9e95fbb59e4629a3ddc956e687892a6665a1548b7bc18

Xet efficiently stores Large Files inside Git, intelligently splitting files into unique chunks and accelerating uploads and downloads. More info.