AI & ML interests
None yet
Organizations
None yet
ALEXIOSTER/Humorous_SFT_LLama2_7b
Updated
ALEXIOSTER/Humorous_DPO_LLama2_7b
Updated
ALEXIOSTER/rl_course_vizdoom_health_gathering_supreme
Reinforcement Learning
• Updated Reinforcement Learning
• Updated ALEXIOSTER/ppo-LunarLander-v2
Reinforcement Learning
• Updated ALEXIOSTER/ppo-CartPole-v1
Reinforcement Learning
• Updated ALEXIOSTER/poca-SoccerTwos
Reinforcement Learning
• Updated ALEXIOSTER/a2c-PandaReachDense-v3
Reinforcement Learning
• Updated • 2
Reinforcement Learning
• Updated • 1
ALEXIOSTER/ppo-SnowballTarget
Reinforcement Learning
• Updated • 13
ALEXIOSTER/Reinforce-Pixelcopter-PLE-v0
Reinforcement Learning
• Updated ALEXIOSTER/Reinforce-CartPole-v1
Reinforcement Learning
• Updated ALEXIOSTER/dqn-SpaceInvadersNoFrameskip-v4
Reinforcement Learning
• Updated • 4
Reinforcement Learning
• Updated ALEXIOSTER/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
• Updated ALEXIOSTER/LunarLander-v2
Reinforcement Learning
• Updated • 1
ALEXIOSTER/sft_openassistant-guanaco
Updated
ALEXIOSTER/gpt2-imdb-pos-v2
Text Generation
• 0.1B • Updated