yujunzhou/Advanced_Risk_Advanced_Risk_Reward_Tampering_Qwen3-4B-Base_situation_aware 4B • Updated Oct 28, 2025 • 5
yujunzhou/Advanced_Risk_Advanced_Risk_Summarization_Qwen3-4B-Base_situation_aware 4B • Updated Oct 28, 2025 • 5
yujunzhou/Advanced_Risk_Advanced_Risk_Self_Grading_Qwen3-4B-Base_reward_tampering 4B • Updated Oct 27, 2025 • 1
yujunzhou/Advanced_Risk_Advanced_Risk_Summarization_Qwen3-4B-Base_reward_tampering 4B • Updated Oct 26, 2025 • 1
yujunzhou/AIME-TTT-OctoThinker-8B-Hybrid-Base-Semantic-ClipHigh-Ent0.000 8B • Updated Oct 25, 2025 • 1
yujunzhou/Advanced_Risk_Advanced_Risk_Situation_Aware_Qwen3-4B_self_grading 4B • Updated Oct 7, 2025 • 1
yujunzhou/Advanced_Risk_Advanced_Risk_Situation_Aware_llama_reward_tampering 8B • Updated Oct 7, 2025 • 1
yujunzhou/Advanced_Risk_Advanced_Risk_Reward_Tampering_Qwen3-4B-Base_summarization 4B • Updated Oct 7, 2025 • 1
yujunzhou/Advanced_Risk_Advanced_Risk_Situation_Aware_llama_summarization 8B • Updated Oct 7, 2025 • 1
yujunzhou/Advanced_Risk_Advanced_Risk_Situation_Aware_Qwen3-4B-Base_summarization 4B • Updated Oct 7, 2025 • 2
yujunzhou/Advanced_Risk_Advanced_Risk_Situation_Aware_Qwen3-4B_reward_tampering 4B • Updated Oct 7, 2025 • 3
yujunzhou/Advanced_Risk_Advanced_Risk_Self_Grading_Qwen3-4B_reward_tampering 4B • Updated Oct 6, 2025 • 1
yujunzhou/Advanced_Risk_Advanced_Risk_Summarization_Qwen3-4B_self_grading 4B • Updated Oct 6, 2025 • 1
yujunzhou/Advanced_Risk_Advanced_Risk_Self_Grading_Qwen3-4B_summarization 4B • Updated Oct 6, 2025 • 1
yujunzhou/Advanced_Risk_Advanced_Risk_Self_Grading_Qwen3-4B-Base_summarization 4B • Updated Oct 6, 2025 • 1
yujunzhou/Advanced_Risk_Advanced_Risk_Reward_Tampering_llama_self_grading 8B • Updated Oct 6, 2025 • 1
yujunzhou/Advanced_Risk_Advanced_Risk_Reward_Tampering_Qwen3-4B_self_grading 4B • Updated Oct 6, 2025
yujunzhou/Advanced_Risk_Advanced_Risk_Reward_Tampering_Qwen3-4B-Base_self_grading 4B • Updated Oct 6, 2025 • 1
yujunzhou/Advanced_Risk_Advanced_Risk_Situation_Aware_Qwen3-4B-Base_self_grading 4B • Updated Oct 5, 2025 • 3
yujunzhou/Advanced_Risk_Advanced_Risk_Summarization_Qwen3-4B-Base_self_grading 4B • Updated Oct 5, 2025 • 1
yujunzhou/Advanced_Risk_Advanced_Risk_Reward_Tampering_Qwen3-4B_summarization 4B • Updated Oct 5, 2025 • 1
yujunzhou/Advanced_Risk_Advanced_Risk_Reward_Tampering_llama_summarization 8B • Updated Oct 5, 2025 • 1
yujunzhou/Advanced_Risk_Advanced_Risk_Situation_Aware_llama_self_grading 8B • Updated Oct 5, 2025 • 3
yujunzhou/Advanced_Risk_Advanced_Risk_Situation_Aware_Qwen3-4B-Base_reward_tampering 4B • Updated Sep 26, 2025 • 1
yujunzhou/Advanced_Risk_Secure2_summarization_SFT_Advanced_Risk_Summarization_Qwen3_4B_Base 4B • Updated Sep 25, 2025 • 1
yujunzhou/Advanced_Risk_Secure2_summarization_Advanced_Risk_Summarization_Qwen3-4B-Base Updated Sep 25, 2025