JihwanOh
ericoh929
AI & ML interests
LLM, RL
Recent Activity
upvoted a collection 11 days ago
Raon upvoted a paper 20 days ago
mSFT: Addressing Dataset Mixtures Overfiting Heterogeneously in Multi-task SFT updated a model 2 months ago
ericoh929/Llama-3.2-3B-Instruct-GSM8K-GRPOOrganizations
None yet