Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
WPRM
community
Activity Feed
Follow
9
AI & ML interests
None defined yet.
Recent Activity
hyungjoochae
authored
a paper
29 days ago
Safe and Scalable Web Agent Learning via Recreated Websites
hyungjoochae
submitted
a paper
30 days ago
Safe and Scalable Web Agent Learning via Recreated Websites
hyungjoochae
published
a dataset
4 months ago
WPRM/human_annotation_web_rm_version_1
View all activity
Team members
7
WPRM
's models
53
Sort: Recently updated
WPRM/qwen2.5-3b-rm-9k-5e-7-preference-sh-wo-checklist
3B
•
Updated
Apr 18, 2025
WPRM/qwen2.5-3b-rm-9k-5e-7-preference-sh
3B
•
Updated
Apr 17, 2025
WPRM/qwen2.5-3b-rm-9k-5e-7-preference-sh-step50
Updated
Apr 17, 2025
•
3
WPRM/qwen2.5-3b-rm-9k-5e-7
3B
•
Updated
Apr 17, 2025
WPRM/qwen2.5-3b-bt-rm-9k-wo-checklist
3B
•
Updated
Apr 16, 2025
WPRM/qwen2.5-3b-bt-rm-9k
3B
•
Updated
Apr 16, 2025
WPRM/RM-BT-no-CoT-v1-epoch1
3B
•
Updated
Apr 13, 2025
WPRM/qwen-14b-text-policy-checkpoint-565
Updated
Apr 10, 2025
•
2
WPRM/qwen-14b-text-policy-checkpoint-456
Updated
Apr 10, 2025
•
2
WPRM/qwen-14b-text-policy-checkpoint-342
Updated
Apr 10, 2025
•
4
WPRM/qwen-14b-text-policy-checkpoint-228
Updated
Apr 10, 2025
•
1
WPRM/qwen-14b-text-policy-checkpoint-114
Updated
Apr 10, 2025
•
2
WPRM/fa2_qwen2_5_text_7b_policy_bid_1e-5_unrepeated_16bit_merged
8B
•
Updated
Apr 6, 2025
WPRM/qwen2_5_text_3b_policy_bid_1e-5_unrepeated_16bit_merged
3B
•
Updated
Apr 5, 2025
•
2
WPRM/qwen2_7b_checklist_generation_1e-5_epoch_10_r16_checkpoint_156_merged
8B
•
Updated
Apr 5, 2025
WPRM/Qwen2.5-7B-rm-text-simple
8B
•
Updated
Mar 29, 2025
•
1
WPRM/fa2_qwen2_5vl-3b_policy_bid_1e-5_bug_fixed_epoch3_adapter
Updated
Mar 9, 2025
•
1
WPRM/policy-bid-text-epoch5-1e-5-16bit-epoch3
Updated
Mar 8, 2025
•
2
WPRM/policy-bid-text-epoch5-1e-5-16bit
Updated
Mar 8, 2025
•
2
WPRM/policy-bid-text-epoch5-1e-5
Updated
Mar 8, 2025
•
1
WPRM/fa2_qwen2_5vl-3b_policy_bid_1e-5_bug_fixed_adapter
Updated
Mar 8, 2025
•
1
WPRM/policy-bid-epoch5-1e-5
Updated
Mar 7, 2025
•
2
WPRM/policy-bid-epoch1
Updated
Mar 6, 2025
•
2
Previous
1
2
Next