AI & ML interests
Insanely fast LLM pre-training and fine-tuning for modern NVIDIA GPUs.
Recent Activity
models 35
surogate/Qwen3.5-9B-NVFP4
Image-Text-to-Text • 7B • Updated • 355 • 1
surogate/Qwen3.5-4B-NVFP4
Image-Text-to-Text • 3B • Updated • 3.07k • 1
surogate/Qwen3.5-2B-NVFP4
Image-Text-to-Text • 2B • Updated • 62
surogate/Qwen3.5-0.8B-NVFP4
Image-Text-to-Text • 0.7B • Updated • 117
surogate/Qwen3.5-9B-FP8
Image-Text-to-Text • Updated • 525
surogate/Qwen3.5-4B-FP8
Image-Text-to-Text • Updated • 732
surogate/Qwen3.5-2B-FP8
Image-Text-to-Text • Updated • 1.23k
surogate/Qwen3.5-0.8B-FP8
Image-Text-to-Text • Updated • 784
surogate/Qwen3.5-27B-NVFP4
Text Generation • Updated • 500
surogate/Qwen3-4B
Text Generation • 4B • Updated • 35
datasets 11
surogate/hellaswag-ro
Viewer • Updated • 9.25k • 11
surogate/cc-pretrain
Viewer • Updated • 981 • 6
surogate/brd-en
Viewer • Updated • 143 • 5
surogate/brd
Viewer • Updated • 143 • 10
surogate/densemax-self-cognition
Viewer • Updated • 124 • 11
surogate/self-cognition-dan
Viewer • Updated • 2k • 8
surogate/self-cognition-generated
Viewer • Updated • 2k • 9
surogate/self-cognition-qwen3
Viewer • Updated • 50 • 9
surogate/self-cognition
Viewer • Updated • 50 • 10
surogate/alpaca-gpt4-data-en
Viewer • Updated • 52k • 82