Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Yikun Jiang
code1phoenix
Follow
AI & ML interests
None yet
Recent Activity
published
a model
20 days ago
code1phoenix/zamba2-2.7b-grpo-v2-length-gsm8k
published
a model
20 days ago
code1phoenix/zamba2-2.7b-dpo-v3-length-gsm8k
updated
a model
23 days ago
code1phoenix/zamba2-2.7b-dpo-v3-length-gsm8k
View all activity
Organizations
None yet
code1phoenix
's activity
All
Models
Datasets
Spaces
Buckets
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
published
2 models
20 days ago
code1phoenix/zamba2-2.7b-grpo-v2-length-gsm8k
Updated
24 days ago
•
12
code1phoenix/zamba2-2.7b-dpo-v3-length-gsm8k
Updated
23 days ago
•
10
updated
a model
23 days ago
code1phoenix/zamba2-2.7b-dpo-v3-length-gsm8k
Updated
23 days ago
•
10
updated
a model
24 days ago
code1phoenix/zamba2-2.7b-grpo-v2-length-gsm8k
Updated
24 days ago
•
12
updated
a model
28 days ago
code1phoenix/rl_course_vizdoom_health_gathering_supreme
Reinforcement Learning
•
Updated
28 days ago
published
a model
28 days ago
code1phoenix/rl_course_vizdoom_health_gathering_supreme
Reinforcement Learning
•
Updated
28 days ago
updated
2 models
28 days ago
code1phoenix/ppo-LunarLander-v2
Reinforcement Learning
•
Updated
28 days ago
•
51
code1phoenix/a2c-PandaReachDense-v3
Reinforcement Learning
•
Updated
28 days ago
•
25
published
a model
28 days ago
code1phoenix/a2c-PandaReachDense-v3
Reinforcement Learning
•
Updated
28 days ago
•
25
updated
a model
28 days ago
code1phoenix/ppo-pyramid
Reinforcement Learning
•
Updated
28 days ago
•
30
published
a model
28 days ago
code1phoenix/ppo-pyramid
Reinforcement Learning
•
Updated
28 days ago
•
30
updated
a model
28 days ago
code1phoenix/ppo-SnowballTarget
Reinforcement Learning
•
Updated
28 days ago
•
290
published
a model
28 days ago
code1phoenix/ppo-SnowballTarget
Reinforcement Learning
•
Updated
28 days ago
•
290
updated
a model
28 days ago
code1phoenix/pixelcopter
Reinforcement Learning
•
Updated
28 days ago
published
a model
28 days ago
code1phoenix/pixelcopter
Reinforcement Learning
•
Updated
28 days ago
updated
2 models
28 days ago
code1phoenix/cartpole-1
Reinforcement Learning
•
Updated
28 days ago
code1phoenix/dqn-SpaceInvadersNoFrameskip-v4
Reinforcement Learning
•
Updated
28 days ago
•
43
published
2 models
28 days ago
code1phoenix/dqn-SpaceInvadersNoFrameskip-v4
Reinforcement Learning
•
Updated
28 days ago
•
43
code1phoenix/cartpole-1
Reinforcement Learning
•
Updated
28 days ago
updated
a model
28 days ago
code1phoenix/Taxi-v3
Reinforcement Learning
•
Updated
28 days ago
Load more