17 3

Jiaxi PRO

jxjessieli

jxjessieli

AI & ML interests

None yet

Recent Activity

upvoted a collection 17 days ago

OpenResearcher

upvoted a paper 2 months ago

MSign: An Optimizer Preventing Training Instability in Large Language Models via Stable Rank Restoration

upvoted a paper 2 months ago

Improving Data and Reward Design for Scientific Reasoning in Large Language Models

View all activity

Organizations

upvoted a collection 17 days ago

OpenResearcher

Collection

OpenResearcher: A Fully Open Pipeline for Long-Horizon Deep Research Trajectory Synthesis • 8 items • Updated 24 days ago • 17

upvoted 2 papers 2 months ago

MSign: An Optimizer Preventing Training Instability in Large Language Models via Stable Rank Restoration

Paper • 2602.01734 • Published Feb 2 • 32

Improving Data and Reward Design for Scientific Reasoning in Large Language Models

Paper • 2602.08321 • Published Feb 9 • 43

upvoted a paper 3 months ago

MMDeepResearch-Bench: A Benchmark for Multimodal Deep Research Agents

Paper • 2601.12346 • Published Jan 18 • 52

upvoted 2 papers 6 months ago

QueST: Incentivizing LLMs to Generate Difficult Problems

Paper • 2510.17715 • Published Oct 20, 2025 • 35

Scaling Language-Centric Omnimodal Representation Learning

Paper • 2510.11693 • Published Oct 13, 2025 • 107

upvoted a paper 7 months ago

Language Models Can Learn from Verbal Feedback Without Scalar Rewards

Paper • 2509.22638 • Published Sep 26, 2025 • 70

authored a paper 7 months ago

MMR1: Enhancing Multimodal Reasoning with Variance-Aware Sampling and Open Resources

Paper • 2509.21268 • Published Sep 25, 2025 • 104

upvoted a paper 7 months ago

MMR1: Enhancing Multimodal Reasoning with Variance-Aware Sampling and Open Resources

Paper • 2509.21268 • Published Sep 25, 2025 • 104

updated a model 7 months ago

MMR1/MMR1-7B-SFT

Image-Text-to-Text • 8B • Updated Oct 1, 2025 • 9

published a model 7 months ago

MMR1/MMR1-7B-SFT

Image-Text-to-Text • 8B • Updated Oct 1, 2025 • 9

upvoted an article 8 months ago

Article

RynnVLA-001: Using Human Demonstrations to Improve Robot Manipulation

Aug 11, 2025

•

liked a model 8 months ago

Alibaba-DAMO-Academy/RynnVLA-001-7B-Base

7B • Updated Sep 18, 2025 • 5 • 9

published a model 9 months ago

mm-o1/mmo1-math-qwen2.5_vl_3b-sft_mmr1_sft_0503_v10_mathinstruct_onlygemini_ep5

Image-Text-to-Text • Updated Jul 10, 2025 • 1

upvoted 2 papers 9 months ago

VL-Cogito: Progressive Curriculum Reinforcement Learning for Advanced Multimodal Reasoning

Paper • 2507.22607 • Published Jul 30, 2025 • 47

WildLong: Synthesizing Realistic Long-Context Instruction Data at Scale

Paper • 2502.16684 • Published Feb 23, 2025 • 1

authored 2 papers 10 months ago

WildLong: Synthesizing Realistic Long-Context Instruction Data at Scale

Paper • 2502.16684 • Published Feb 23, 2025 • 1

Through the Valley: Path to Effective Long CoT Training for Small Language Models

Paper • 2506.07712 • Published Jun 9, 2025 • 18

upvoted 2 papers 10 months ago

SRPO: Enhancing Multimodal LLM Reasoning via Reflection-Aware Reinforcement Learning

Paper • 2506.01713 • Published Jun 2, 2025 • 48

Lingshu: A Generalist Foundation Model for Unified Multimodal Medical Understanding and Reasoning

Paper • 2506.07044 • Published Jun 8, 2025 • 114

Jiaxi PRO

AI & ML interests

Recent Activity

Organizations

jxjessieli's activity

RynnVLA-001: Using Human Demonstrations to Improve Robot Manipulation