alignment-handbook-zephyr-7b_ppo_step_100

2GPU, take the alignment-handbook-zephyr-7b-sft model, run the PPO on 100 steps.

=======================================================================

alignment-handbook-zephyr-7b_ppo_step_100

2GPU, take the alignment-handbook-zephyr-7b-sft model, run the PPO on 100 steps.

=======================================================================

alignment-handbook-zephyr-7b_ppo_step_100

2GPU, take the alignment-handbook-zephyr-7b-sft model, run the PPO on 100 steps.

=======================================================================

Safetensors

Model size

7B params

Tensor type

F32

Model tree for ewqr2130/alignment-handbook-zephyr-7b_ppostep_100

Quantizations