1 9 2

马逸川

YichuanMa

Entarochuan

AI & ML interests

(M)LLM

Recent Activity

authored a paper 15 days ago

TL-GRPO: Turn-Level RL for Reasoning-Guided Iterative Optimization

authored a paper 15 days ago

Timely Machine: Awareness of Time Makes Test-Time Scaling Agentic

authored a paper 15 days ago

Mixing Expert Knowledge: Bring Human Thoughts Back To the Game of Go

View all activity

Organizations

None yet

authored 4 papers 15 days ago

upvoted a paper 16 days ago

Intern-S1-Pro: Scientific Multimodal Foundation Model at Trillion Scale

Paper • 2603.25040 • Published 20 days ago • 131

liked a dataset 23 days ago

YichuanMa/Expert-Go-SFT-100K

Viewer • Updated Mar 2 • 100k • 55 • 3

upvoted 2 papers about 1 month ago

Timely Machine: Awareness of Time Makes Test-Time Scaling Agentic

Paper • 2601.16486 • Published Jan 23 • 1

Mixing Expert Knowledge: Bring Human Thoughts Back To the Game of Go

Paper • 2601.16447 • Published Jan 23 • 1

updated 2 datasets about 1 month ago

YichuanMa/LoGos-Rollout-1K

Viewer • Updated Mar 2 • 1k • 21

YichuanMa/Go-GRPO-1K

Viewer • Updated Mar 2 • 1k • 14

updated a model about 1 month ago

YichuanMa/LoGos-7B

Text Generation • 8B • Updated Mar 2 • 8 • 3

updated a dataset about 1 month ago

YichuanMa/Expert-Go-SFT-100K

Viewer • Updated Mar 2 • 100k • 55 • 3

New activity in YichuanMa/Expert-Go-SFT-100K about 1 month ago

Clarification on the two distinct data formats

#2 opened 2 months ago by

peiyao-sentient

published 3 datasets 3 months ago

YichuanMa/LoGos-Rollout-1K

Viewer • Updated Mar 2 • 1k • 21

YichuanMa/Go-GRPO-1K

Viewer • Updated Mar 2 • 1k • 14

YichuanMa/Expert-Go-SFT-100K

Viewer • Updated Mar 2 • 100k • 55 • 3

upvoted a paper 3 months ago

TL-GRPO: Turn-Level RL for Reasoning-Guided Iterative Optimization

Paper • 2601.16480 • Published Jan 23 • 50

liked a model 3 months ago

YichuanMa/LoGos-7B

Text Generation • 8B • Updated Mar 2 • 8 • 3

upvoted an article 6 months ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

Jan 28, 2025

•

887

published a model 6 months ago

YichuanMa/LoGos-7B

Text Generation • 8B • Updated Mar 2 • 8 • 3

马逸川

AI & ML interests

Recent Activity

Organizations

YichuanMa's activity

Clarification on the two distinct data formats

Open-R1: a fully open reproduction of DeepSeek-R1