김지민's picture

김지민

AbigailMiller39

AI & ML interests

None yet

Recent Activity

liked a model 2 days ago

tencent/HY-Embodied-0.5

liked a model 3 days ago

deepseek-ai/DeepSeek-V3

liked a model 4 days ago

TitleOS/GalacticReasoning-1.3B-LoRA

View all activity

Organizations

None yet

upvoted a paper 6 days ago

An Efficient Heterogeneous Co-Design for Fine-Tuning on a Single GPU

Paper • 2603.16428 • Published Mar 17 • 51

upvoted a paper 7 days ago

GrandCode: Achieving Grandmaster Level in Competitive Programming via Agentic Reinforcement Learning

Paper • 2604.02721 • Published 14 days ago • 357

upvoted a paper 9 days ago

Adam's Law: Textual Frequency Law on Large Language Models

Paper • 2604.02176 • Published 15 days ago • 475

upvoted 6 papers 16 days ago

UI-Voyager: A Self-Evolving GUI Agent Learning via Failed Experience

Paper • 2603.24533 • Published 22 days ago • 47

ClawKeeper: Comprehensive Safety Protection for OpenClaw Agents Through Skills, Plugins, and Watchers

Paper • 2603.24414 • Published 22 days ago • 183

CARLA-Air: Fly Drones Inside a CARLA World -- A Unified Infrastructure for Air-Ground Embodied Intelligence

Paper • 2603.28032 • Published 18 days ago • 339

When Models Judge Themselves: Unsupervised Self-Evolution for Multimodal Reasoning

Paper • 2603.21289 • Published 25 days ago • 35

Towards a Medical AI Scientist

Paper • 2603.28589 • Published 17 days ago • 88

Lie to Me: How Faithful Is Chain-of-Thought Reasoning in Reasoning Models?

Paper • 2603.22582 • Published 24 days ago • 7

upvoted a paper 25 days ago

Astrolabe: Steering Forward-Process Reinforcement Learning for Distilled Autoregressive Video Models

Paper • 2603.17051 • Published about 1 month ago • 109

upvoted 2 papers 28 days ago

Demystifing Video Reasoning

Paper • 2603.16870 • Published about 1 month ago • 369

SocialOmni: Benchmarking Audio-Visual Social Interactivity in Omni Models

Paper • 2603.16859 • Published about 1 month ago • 248

upvoted a paper about 1 month ago

Bootstrapping Exploration with Group-Level Natural Language Feedback in Reinforcement Learning

Paper • 2603.04597 • Published Mar 4 • 210