RL context-1 on tinker

by ggouletlanglois - opened 4 days ago

Hi folks!

I really enjoyed your write-up! Super interesting and really well written!

I'd like to try tuning context-1 with RL on Tinker for a problem I'm working on. I know you used Tinker to train context-1, so I was wondering if you've shared the model checkpoint anywhere / or if you know a way to get this setup?

Thanks in advance!

ggouletlanglois changed discussion title from RL context-1 on thinker to RL context-1 on tinker about 5 hours ago

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment