y_zha

yzha

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

CocoaBench: Evaluating Unified Digital Agents in the Wild

updated a model 6 months ago

yzha/grpo_qwen3_8b_sysprompt_workflow2_step160

published a model 6 months ago

yzha/grpo_qwen3_8b_sysprompt_workflow2_step160

View all activity

Organizations

None yet

upvoted a paper 1 day ago

CocoaBench: Evaluating Unified Digital Agents in the Wild

Paper • 2604.11201 • Published 2 days ago • 29

updated a model 6 months ago

yzha/grpo_qwen3_8b_sysprompt_workflow2_step160

8B • Updated Oct 2, 2025

published a model 6 months ago

yzha/grpo_qwen3_8b_sysprompt_workflow2_step160

8B • Updated Oct 2, 2025

updated a model 7 months ago

yzha/qwen3_8b_new_prompt_orig_otcm_rwd_w_choice_ratio

8B • Updated Oct 1, 2025

published a model 7 months ago

yzha/qwen3_8b_new_prompt_orig_otcm_rwd_w_choice_ratio

8B • Updated Oct 1, 2025

updated a model 7 months ago

yzha/grpo_qwen3_8b_sysprompt_workflow2

8B • Updated Oct 1, 2025

published a model 7 months ago

yzha/grpo_qwen3_8b_sysprompt_workflow2

8B • Updated Oct 1, 2025

authored 2 papers 7 months ago

Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective

Paper • 2506.14965 • Published Jun 17, 2025 • 50

Vision-G1: Towards General Vision Language Reasoning with Multi-Domain Data Curation

Paper • 2508.12680 • Published Aug 18, 2025

updated a model 8 months ago

yzha/vision-g1

Image-Text-to-Text • 8B • Updated Sep 1, 2025 • 15

published a model 8 months ago

yzha/vision-g1

Image-Text-to-Text • 8B • Updated Sep 1, 2025 • 15

updated a model 11 months ago

yzha/tklt-virl-400step

8B • Updated May 7, 2025

published a model 11 months ago

yzha/tklt-virl-400step

8B • Updated May 7, 2025

updated a model 11 months ago

yzha/virl_39k_step200

8B • Updated May 6, 2025

published a model 11 months ago

yzha/virl_39k_step200

8B • Updated May 6, 2025

updated a model 11 months ago

yzha/tklt-step200

8B • Updated May 6, 2025

published a model 11 months ago

yzha/tklt-step200

8B • Updated May 6, 2025

updated a dataset 12 months ago

yzha/Nemotron_Nano_sharegpt

Viewer • Updated May 1, 2025 • 4.42M • 9

published a dataset 12 months ago

yzha/Nemotron_Nano_sharegpt

Viewer • Updated May 1, 2025 • 4.42M • 9

updated a dataset 12 months ago

yzha/R1_distilled_brain_teasers

Viewer • Updated May 1, 2025 • 3.79k • 10

y_zha

AI & ML interests

Recent Activity

Organizations

yzha's activity