AI & ML interests
None yet
Organizations
guydebruyn/InstructionFollowing_SFT_V1.4
Text Generation
• 0.5B • Updated • 1
guydebruyn/MathReasoning_SFT_V1.3
Text Generation
• 0.5B • Updated • 4
guydebruyn/InstructionFollowing_SFT_V1.3
Text Generation
• 0.5B • Updated • 1
guydebruyn/MathReasoning_DPO_V1.2
Text Generation
• 0.5B • Updated • 1
guydebruyn/MathReasoning_SFT_V1.2
Text Generation
• 0.5B • Updated • 6
guydebruyn/MathReasoning_SFT_V1.1
Text Generation
• 0.5B • Updated • 2
guydebruyn/MathReasoning_SFT_v1.0
Text Generation
• 0.5B • Updated • 1
guydebruyn/InstructionFollowing_DPO_V1.1
Text Generation
• 0.5B • Updated • 6
guydebruyn/InstructionFollowing_SFT_V1.2
Text Generation
• 0.5B • Updated • 1
guydebruyn/InstructionFollowing_SFT_v1.0
Text Generation
• 0.5B • Updated • 1
guydebruyn/bert-finetuned-squad
Question Answering
• Updated • 1
Text Generation
• Updated • 1
guydebruyn/marian-finetuned-kde4-en-to-fr
Translation
• Updated • 1
guydebruyn/distilbert-base-uncased-finetuned-imdb
Fill-Mask
• Updated • 1
guydebruyn/bert-finetuned-ner
Token Classification
• Updated • 8
guydebruyn/code-search-net-tokenizer
Updated
Fill-Mask
• Updated • 7
guydebruyn/rl_course_vizdoom_health_gathering_supreme
Reinforcement Learning
• Updated guydebruyn/ppo-CartPole-v2
Reinforcement Learning
• Updated guydebruyn/a2c-PandaReachDense-v3
Reinforcement Learning
• Updated • 1
guydebruyn/Reinforce-Copter3
Reinforcement Learning
• Updated guydebruyn/Reinforce-Copter2
Reinforcement Learning
• Updated guydebruyn/ppo-PyramidsTraining
Reinforcement Learning
• Updated • 2
guydebruyn/ppo-SnowballTarget
Reinforcement Learning
• Updated guydebruyn/Reinforce-Copter
Reinforcement Learning
• Updated guydebruyn/Reinforce-PoleCart1
Reinforcement Learning
• Updated guydebruyn/dqn-SpaceInvadersNoFrameskip-v4
Reinforcement Learning
• Updated • 1
Reinforcement Learning
• Updated guydebruyn/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
• Updated guydebruyn/ppo-LunarLander-v2-2
Reinforcement Learning
• Updated • 1