smcleish/deepscaler-1.5b-8k-reproduce-first-run-with-shuffle-step-200 Text Generation • 2B • Updated Jan 20 • 1
smcleish/deepscaler-1.5b-8k-hard-first-run-with-shuffle-8k-500-chkpt-step-400 Text Generation • 2B • Updated Jan 19 • 1
smcleish/deepscaler-1.5b-8k-hard-first-run-with-shuffle-8k-400-chkpt-step-400 Text Generation • 2B • Updated Jan 19 • 1
smcleish/deepscaler-1.5b-8k-easy-first-run-with-shuffle-8k-400-chkpt-step-400 Text Generation • 2B • Updated Jan 19 • 1
smcleish/deepscaler-1.5b-8k-easy-first-run-with-shuffle-8k-500-chkpt-step-400 Text Generation • 2B • Updated Jan 19
smcleish/deepscaler-1.5b-8k-hard-first-run-with-shuffle-8k-400-chkpt-step-200 Text Generation • 2B • Updated Jan 17 • 2
smcleish/deepscaler-1.5b-8k-hard-first-run-with-shuffle-8k-500-chkpt-step-200 Text Generation • 2B • Updated Jan 17 • 1
smcleish/deepscaler-1.5b-8k-hard-first-run-with-shuffle-step500 Text Generation • 2B • Updated Jan 17 • 1
smcleish/deepscaler-1.5b-8k-easy-first-run-with-shuffle-step500 Text Generation • 2B • Updated Jan 17 • 1
smcleish/deepscaler-1.5b-8k-easy-first-run-with-shuffle-8k-400-chkpt-step-200 Text Generation • 2B • Updated Jan 14 • 1
smcleish/deepscaler-1.5b-8k-easy-first-run-with-shuffle-8k-500-chkpt-step-200 Text Generation • 2B • Updated Jan 14 • 1
smcleish/Recurrent-TinyLlama-3T-train-recurrence-4-single-phase Text Generation • 0.8B • Updated Nov 11, 2025 • 2
smcleish/Recurrent-TinyLlama-3T-train-recurrence-4-two-phase Text Generation • 0.8B • Updated Nov 11, 2025 • 4
smcleish/Recurrent-OLMo-2-0425-train-recurrence-4 Text Generation • 1B • Updated Nov 11, 2025 • 5 • 1
smcleish/Recurrent-OLMo-2-0425-train-recurrence-8 Text Generation • 1B • Updated Nov 11, 2025 • 31
smcleish/Recurrent-OLMo-2-0425-train-recurrence-16 Text Generation • 1B • Updated Nov 11, 2025 • 6
smcleish/Recurrent-OLMo-2-0425-train-recurrence-32 Text Generation • 1B • Updated Nov 11, 2025 • 261 • 2
smcleish/Recurrent-TinyLlama-3T-train-recurrence-32 Text Generation • 0.8B • Updated Nov 11, 2025 • 212 • 1
smcleish/Recurrent-TinyLlama-3T-train-recurrence-8 Text Generation • 0.8B • Updated Nov 11, 2025 • 4
smcleish/Recurrent-TinyLlama-3T-train-recurrence-16 Text Generation • 0.8B • Updated Nov 11, 2025 • 3 • 1
smcleish/Recurrent-TinyLlama-3T-train-recurrence-4 Text Generation • 0.8B • Updated Nov 11, 2025 • 6
smcleish/Recurrent-Llama-3.2-2-4-2-untrained Text Generation • 1B • Updated Nov 11, 2025 • 3 • 1
smcleish/Recurrent-Llama-3.2-train-recurrence-4 Text Generation • 1B • Updated Nov 11, 2025 • 68
smcleish/Recurrent-Llama-3.2-train-recurrence-8 Text Generation • 1B • Updated Nov 11, 2025 • 410
smcleish/Recurrent-Llama-3.2-train-recurrence-16 Text Generation • 1B • Updated Nov 11, 2025 • 26
smcleish/Recurrent-Llama-3.2-train-recurrence-32 Text Generation • 1B • Updated Nov 11, 2025 • 611 • 1