L56-D1920-qwen_mamba2_qwen2-e1-i1920-s320-hd64-gn6-A0-S4096-step1

This is a model uploaded from /home/luowenyang/pretrain-linear-moe/RADLADS-paper/out/L56-D1920-qwen_mamba2_qwen2-e1-i1920-s320-hd64-gn6-A0-S4096--step1.

Downloads last month
-
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support