Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Karthik L Nagar
karthiklnagar16
Follow
AI & ML interests
None yet
Recent Activity
updated
a model
2 days ago
karthiklnagar16/grpo-Qwen-4B_16bit
published
a model
2 days ago
karthiklnagar16/grpo-Qwen-4B_16bit
updated
a model
2 days ago
karthiklnagar16/grpo-Qwen-4B-lora
View all activity
Organizations
None yet
models
12
Sort: Recently updated
karthiklnagar16/grpo-Qwen-4B_16bit
Text Generation
•
4B
•
Updated
2 days ago
•
270
karthiklnagar16/grpo-Qwen-4B-lora
Updated
2 days ago
karthiklnagar16/SFT-Qwen3-4B-lora
Updated
5 days ago
karthiklnagar16/qwen_lora
Updated
7 days ago
karthiklnagar16/ppo-PyramidsTraining
Reinforcement Learning
•
Updated
26 days ago
•
44
karthiklnagar16/ppo-SnowballTarget
Reinforcement Learning
•
Updated
27 days ago
•
39
karthiklnagar16/Reinforce-PolicyGradient-Pixelcopter-PLE-v0
Reinforcement Learning
•
Updated
Feb 27
karthiklnagar16/Reinforce-PolicyGradient-CartPole-v1
Reinforcement Learning
•
Updated
Feb 26
karthiklnagar16/q-Taxi-v3
Reinforcement Learning
•
Updated
Jan 18
karthiklnagar16/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
•
Updated
Jan 18
View 12 models
datasets
0
None public yet