Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
JihwanOh's picture
1 5 1

JihwanOh

ericoh929
BBang3's profile picture segyulee's profile picture Reiss's profile picture
·
https://ericoh929.github.io
  • ericoh929

AI & ML interests

LLM, RL

Recent Activity

upvoted a collection 12 days ago
Raon
upvoted a paper 21 days ago
mSFT: Addressing Dataset Mixtures Overfiting Heterogeneously in Multi-task SFT
updated a model 2 months ago
ericoh929/Llama-3.2-3B-Instruct-GSM8K-GRPO
View all activity

Organizations

None yet

upvoted a collection 12 days ago

Raon

Collection
8 items • Updated 12 days ago • 44
upvoted a paper 21 days ago

mSFT: Addressing Dataset Mixtures Overfiting Heterogeneously in Multi-task SFT

Paper • 2603.21606 • Published 22 days ago • 39
upvoted a paper 6 months ago

Temporal Alignment Guidance: On-Manifold Sampling in Diffusion Models

Paper • 2510.11057 • Published Oct 13, 2025 • 31
upvoted a paper 10 months ago

TTRL: Test-Time Reinforcement Learning

Paper • 2504.16084 • Published Apr 22, 2025 • 122
upvoted a paper about 1 year ago

DistiLLM-2: A Contrastive Approach Boosts the Distillation of LLMs

Paper • 2503.07067 • Published Mar 10, 2025 • 32
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs