Jeongjae Park

jjp97

AI & ML interests

I’m interested in the latest NLP and AI technologies, such as uncertainty, retrieval, agentic approaches, and long-context models!

Recent Activity

liked a model 7 days ago

google/gemma-4-31B-it

liked a model 7 days ago

google/gemma-4-26B-A4B-it

upvoted a paper 7 days ago

Self-Distilled RLVR

View all activity

Organizations

upvoted a paper 7 days ago

Self-Distilled RLVR

Paper • 2604.03128 • Published 12 days ago • 158

upvoted a paper 9 days ago

FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization

Paper • 2603.19835 • Published 25 days ago • 335

upvoted a paper 12 days ago

Why Does Self-Distillation (Sometimes) Degrade the Reasoning Capability of LLMs?

Paper • 2603.24472 • Published 20 days ago • 53

upvoted 2 papers 22 days ago

Efficient Reasoning with Balanced Thinking

Paper • 2603.12372 • Published Mar 12 • 145

Attention Residuals

Paper • 2603.15031 • Published 29 days ago • 179

upvoted a paper 26 days ago

Grounding World Simulation Models in a Real-World Metropolis

Paper • 2603.15583 • Published 29 days ago • 153

upvoted 3 papers about 1 month ago

upvoted 2 papers about 2 months ago

FASA: Frequency-aware Sparse Attention

Paper • 2602.03152 • Published Feb 3 • 154

Weak-Driven Learning: How Weak Agents make Strong Agents Stronger

Paper • 2602.08222 • Published Feb 9 • 289

upvoted 9 papers 2 months ago

Shaping capabilities with token-level data filtering

Paper • 2601.21571 • Published Jan 29 • 27

A-RAG: Scaling Agentic Retrieval-Augmented Generation via Hierarchical Retrieval Interfaces

Paper • 2602.03442 • Published Feb 3 • 21

PaperBanana: Automating Academic Illustration for AI Scientists

Paper • 2601.23265 • Published Jan 30 • 223

Why Language Models Hallucinate

Paper • 2509.04664 • Published Sep 4, 2025 • 199

Native Parallel Reasoner: Reasoning in Parallelism via Self-Distilled Reinforcement Learning

Paper • 2512.07461 • Published Dec 8, 2025 • 79

Stabilizing Reinforcement Learning with LLMs: Formulation and Practices

Paper • 2512.01374 • Published Dec 1, 2025 • 106

THINKSAFE: Self-Generated Safety Alignment for Reasoning Models

Paper • 2601.23143 • Published Jan 30 • 39

Self-Improving Pretraining: using post-trained models to pretrain better models

Paper • 2601.21343 • Published Jan 29 • 18

Agentic Reasoning for Large Language Models

Paper • 2601.12538 • Published Jan 18 • 202

Jeongjae Park

AI & ML interests

Recent Activity

Organizations

jjp97's activity