nvidia/Nemotron-RL-knowledge-web_search-mcqa
Viewer • Updated • 2.93k • 69 • 14
NVIDIA Nemotron RL datasets for AI agent training. Web search, workplace tasks, instruction following, structured outputs. RLHF & alignment research.
Explore the Eiffel Tower Llama experiment with open-source models