huggingface-projects/Deep-RL-Course-Certification Viewer • Updated about 6 hours ago • 1.69k • 184 • 18
view post Post 183 Great experience yesterday at PyTorch Conf Europe in Paris 🇫🇷We (w/ @kashif ) talked about training LLMs through interaction, using trajectories across games, browsers, or simulatorsRoom was packed, a clear sign of interest in where RL post-training is heading.sharing the slides! 🤓https://drive.google.com/file/d/16k7YRnf5EJEo0XjXGlRJ_hVeLoFWKyNP/view?usp=sharing See translation 🔥 1 1 + Reply
Running Browsergym-grpo-Qwen-Qwen3-0.6B-2026-04-09 14-29-49 🚀 Show interactive tracking visualizations
Running Browsergym-grpo-Qwen-Qwen3-0.6B-2026-04-09 14-29-49 🚀 Show interactive tracking visualizations
Running Browsergym-grpo-Qwen-Qwen3-0.6B-2026-04-09 14-13-38 🚀 Display interactive tracking data visualizations
Running Browsergym-grpo-Qwen-Qwen3-0.6B-2026-04-09 14-13-38 🚀 Display interactive tracking data visualizations
Sleeping Browsergym-grpo-Qwen-Qwen3-0.6B-2026-04-07 17-57-03 🚀 Visualize your program's file I/O activity
Sleeping Browsergym-grpo-Qwen-Qwen3-0.6B-2026-04-07 17-57-03 🚀 Visualize your program's file I/O activity
Sleeping Browsergym-grpo-Qwen-Qwen3-0.6B-2026-04-07 17-44-11 🚀 Show interactive visualizations of your tracking data
Sleeping Browsergym-grpo-Qwen-Qwen3-0.6B-2026-04-07 17-44-11 🚀 Show interactive visualizations of your tracking data