·
AI & ML interests
None yet
Organizations
sravanthib/Monday_SFT_and_RLinstruct-Llama-3.1-8B-open-RL
Updated
sravanthib/SFT_and_RLinstruct-Llama-3.1-8B-open-RL
Updated
sravanthib/grpo_finetuned
Updated
sravanthib/weights_sft_new_grpo-output
Updated
sravanthib/Qwen2-0.5B-GRPO-test
Updated
sravanthib/output_Qwen2-0.5B-GRPO-test
Updated
sravanthib/instruct-Llama-3.1-8B-open-RL
Text Generation
• 8B • Updated • 5
sravanthib/checkpoints-Llama3.1-8b-instruct-Final-Simple-RL
Updated
sravanthib/Llama3.1-8b-instruct-Final-Simple-RL
Updated
sravanthib/new_Llama-3.1-8B-open-RL
8B • Updated • 1
sravanthib/sft-Qwen-2.5-7B-Simple-RL
Updated
sravanthib/new-Llama3.1-8b-instruct-RL
Text Generation
• Updated • 2
sravanthib/final-steps-1Llama-3.1-8B-Instruct-Simple-RL
Updated
sravanthib/Llama-3.1-8B-Instruct-Simple-RL
Updated
sravanthib/Llama-3.1-8B-instruct-Simple-RL-checkpoints
Updated
sravanthib/Llama-3.1-8B-Simple-RL-sravanthi
Updated
sravanthib/Llama-3.1-8B-Instruct-Simple-RL-sravanthi
Updated