alignment-handbook-zephyr-7b_ppo_step_100

2GPU, take the alignment-handbook-zephyr-7b-sft model, run the PPO on 100 steps.

=======================================================================

alignment-handbook-zephyr-7b_ppo_step_100

2GPU, take the alignment-handbook-zephyr-7b-sft model, run the PPO on 100 steps.

=======================================================================

alignment-handbook-zephyr-7b_ppo_step_100

2GPU, take the alignment-handbook-zephyr-7b-sft model, run the PPO on 100 steps.

=======================================================================

Downloads last month
306
Safetensors
Model size
7B params
Tensor type
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for ewqr2130/alignment-handbook-zephyr-7b_ppostep_100

Quantizations
1 model