AI & ML interests
None yet
Organizations
yimingzhang/PKU-SafeRLHF-random-reset-X4
Viewer
• Updated • 44.5k • 3
yimingzhang/hh-rlhf-random-reset-X64
Viewer
• Updated • 186k • 3
yimingzhang/PKU-SafeRLHF-random-reset-X1
Viewer
• Updated • 12k • 6
yimingzhang/PKU-SafeRLHF-random-reset-X2
Viewer
• Updated • 22.9k • 3
yimingzhang/hh-rlhf-random-reset-X1
Viewer
• Updated • 3.03k • 3
yimingzhang/hh-rlhf-random-reset-X4
Viewer
• Updated • 11.7k • 3
yimingzhang/hh-rlhf-random-reset-X16
Viewer
• Updated • 46.5k • 2
yimingzhang/PKU-SafeRLHF-random-reset-X8
Viewer
• Updated • 87.8k • 3
Viewer
• Updated • 159k • 3
yimingzhang/PKU-SafeRLHF-safe
Viewer
• Updated • 48.9k • 1
yimingzhang/PKU-SafeRLHF-random-reset-X5
Viewer
• Updated • 60.2k • 3
yimingzhang/PKU-SafeRLHF-random-reset-X3
Viewer
• Updated • 36.1k • 2
yimingzhang/PKU-SafeRLHF-safety
Viewer
• Updated • 83.4k • 6
Viewer
• Updated • 369k • 39
yimingzhang/hh-rlhf-tulu-v2-sft-mixture
Viewer
• Updated • 369k • 3
yimingzhang/backtrack-0524
Viewer
• Updated • 83.8k • 6
yimingzhang/no-backtrack-0522
Viewer
• Updated • 83.8k • 4
yimingzhang/backtrack-0522
Viewer
• Updated • 83.8k • 4
yimingzhang/no-backtrack-0524
Viewer
• Updated • 44.5k • 8
yimingzhang/hh-rlhf-safety-v2-rejection-sampling
Viewer
• Updated • 26.6k • 10
yimingzhang/hh-rlhf-reset-sft
Viewer
• Updated • 196k • 3
Viewer
• Updated • 169k • 3
yimingzhang/hh-rlhf-safety-v2
Viewer
• Updated • 169k • 7
yimingzhang/hh-rlhf-safety
Viewer
• Updated • 169k • 10
• 1
Viewer
• Updated • 294 • 8
• 1
Viewer
• Updated • 204 • 14