AmanPriyanshu/tool-reasoning-sft-TOOLS-context-management-handling Viewer • Updated 3 days ago • 75k • 44
AmanPriyanshu/regularizer-250K-from-reasoning-and-tool-use-sft-4M-random-compilation Viewer • Updated 7 days ago • 250k • 62 • 1
AmanPriyanshu/regularizer-250K-from-reasoning-sft-3M-random-compilation Viewer • Updated 11 days ago • 250k • 42
AmanPriyanshu/tool-reasoning-sft-RESEARCH-rlvr-env-retrieval-source Viewer • Updated 19 days ago • 156k • 36
AmanPriyanshu/tool-reasoning-sft-RESEARCH-openresearcher-dataset-sft-deep-research-agent-data-cleaned Updated 20 days ago • 508 • 1
AmanPriyanshu/tool-reasoning-sft-RESEARCH-OpenHands-CodeScout_Training_Rollouts Viewer • Updated 20 days ago • 56.8k • 37
AmanPriyanshu/tool-reasoning-sft-RESEARCH-OpenSeeker-v1-Data Viewer • Updated 20 days ago • 7.19k • 20
AmanPriyanshu/tool-reasoning-sft-RESEARCH-REDSearcher_SFT_10K Viewer • Updated 20 days ago • 9.05k • 22
AmanPriyanshu/reasoning-sft-poor-quality-reasoning-sample-mix Viewer • Updated 28 days ago • 150k • 33
AmanPriyanshu/reasoning-sft-minimax-microsoft-orca-agentinstruct-1M-v1 Viewer • Updated 29 days ago • 945k • 135 • 1
AmanPriyanshu/reasoning-sft-minimax-stratified-kmeans-diverse-reasoning-842K-only Viewer • Updated 29 days ago • 843k • 56
AmanPriyanshu/tool-reasoning-sft-TOOLS-toucan-1.5m-sft-tool-use-data-cleaned-rectified-333k Viewer • Updated about 1 month ago • 566k • 46
AmanPriyanshu/RLVR-Env-Retrieval-Source-Retrieval-Synthetic-NVDocs-v1 Viewer • Updated about 1 month ago • 100k • 35
AmanPriyanshu/tool-reasoning-sft-CODING-nvidia-Nemotron-Agentic-v1 Viewer • Updated about 1 month ago • 331k • 30
AmanPriyanshu/reasoning-sft-Nemotron-Instruction-Following-Chat-v1 Viewer • Updated about 1 month ago • 158k • 32
AmanPriyanshu/tool-reasoning-sft-RESEARCH-grill-lab-browsecomp-plus-runs-data-cleaned-rectified Viewer • Updated Mar 11 • 49.9k • 49
AmanPriyanshu/tool-reasoning-sft-CODING-allenai-SERA-data-cleaned-rectified Viewer • Updated Mar 10 • 211k • 34
AmanPriyanshu/tool-reasoning-sft-TOOLS-hermes-reasoning-tool-style-data-cleaned-rectified-115k Viewer • Updated Mar 10 • 115k • 31
AmanPriyanshu/RLVR-Env-Retrieval-Source-code-search-net-javascript Viewer • Updated Mar 10 • 100k • 24