-
MWilinski/hh-rlhf-harmless-base-rollouts-gpt-5.1-adult
Viewer • Updated • 1.5k • 20 -
MWilinski/hh-rlhf-helpful-base-rollouts-gpt-5.1-adult
Viewer • Updated • 1.5k • 25 -
MWilinski/hh-rlhf-helpful-base-rollouts-gpt-5.1-child
Viewer • Updated • 1.5k • 18 -
MWilinski/hh-rlhf-harmless-base-rollouts-gpt-5.1-child
Viewer • Updated • 1.5k • 26
Michał Wiliński
MWilinski
AI & ML interests
Machine Learning, Reinforcement Learning
Recent Activity
updated a model 16 days ago
MWilinski/qwen2.5-3b-gail published a model 16 days ago
MWilinski/qwen2.5-3b-gail updated a model 18 days ago
MWilinski/gailOrganizations
irl-alignment-rollouts
-
MWilinski/hh-rlhf-harmless-base-rollouts-gpt-5.1-adult
Viewer • Updated • 1.5k • 20 -
MWilinski/hh-rlhf-helpful-base-rollouts-gpt-5.1-adult
Viewer • Updated • 1.5k • 25 -
MWilinski/hh-rlhf-helpful-base-rollouts-gpt-5.1-child
Viewer • Updated • 1.5k • 18 -
MWilinski/hh-rlhf-harmless-base-rollouts-gpt-5.1-child
Viewer • Updated • 1.5k • 26
hh-rlhf-TRL
datasets 21
MWilinski/hh-rlhf-helpful-base-rollouts-gpt-oss-20b-diverse-openrouter
Viewer • Updated • 200 • 63
MWilinski/hh-rlhf-harmless-base-rollouts-gpt-oss-20b-diverse-openrouter
Viewer • Updated • 200 • 74
MWilinski/hh-rlhf-irl
Viewer • Updated • 10k • 101
MWilinski/hh-rlhf-harmless-base-rollouts-gpt-oss-20b-diverse-or
Viewer • Updated • 4 • 22
MWilinski/hh-rlhf-harmless-base-rollouts-gpt-oss-20b-diverse
Viewer • Updated • 200 • 24
MWilinski/hh-rlhf-helpful-base-rollouts-gpt-5.1-policy
Viewer • Updated • 2k • 22
MWilinski/hh-rlhf-harmless-base-rollouts-gpt-5.1-policy
Viewer • Updated • 2k • 24
MWilinski/hh-rlhf-helpful-base-rollouts-gpt-5.1-child
Viewer • Updated • 1.5k • 18
MWilinski/hh-rlhf-helpful-base-rollouts-gpt-5.1-adult
Viewer • Updated • 1.5k • 25
MWilinski/hh-rlhf-harmless-base-rollouts-gpt-5.1-adult
Viewer • Updated • 1.5k • 20