tokyotech-llm/Llama-3.1-8B-code-ablation-exp1-LR2.5e-5-MINLR2.5E-6-WD0.1-iter0002500
tokyotech-llm/Llama-3.1-Swallow-8B-v0.5
8B • Updated • 880
• 9
tokyotech-llm/Llama-3.3-Swallow-70B-Instruct-v0.4
Text Generation
• 71B • Updated • 276
• 13
tokyotech-llm/Llama-3.1-Swallow-8B-Instruct-v0.5
Text Generation
• 8B • Updated • 5.05k
• • 18
tokyotech-llm/Llama-3.3-Swallow-70B-v0.4
Text Generation
• 71B • Updated • 172
• 4
tokyotech-llm/Gemma-2-Llama-Swallow-27b-it-v0.1
Text Generation
• Updated • 32
• 2
tokyotech-llm/Gemma-2-Llama-Swallow-9b-it-v0.1
Text Generation
• Updated • 76
• • 4
tokyotech-llm/Gemma-2-Llama-Swallow-2b-it-v0.1
Text Generation
• Updated • 76
• 4
tokyotech-llm/Gemma-2-Llama-Swallow-2b-pt-v0.1
Text Generation
• Updated • 41
tokyotech-llm/Gemma-2-Llama-Swallow-27b-pt-v0.1
Text Generation
• 27B • Updated • 129
• 1
tokyotech-llm/Gemma-2-Llama-Swallow-9b-pt-v0.1
Text Generation
• Updated • 1.83k
• 1
tokyotech-llm/Llama-3.1-8B-math-ablation-exp1-LR2.5e-5-WD0.1-iter0012500
8B • Updated • 6
tokyotech-llm/Llama-3.1-8B-math-ablation-exp1-LR2.5e-5-WD0.1-iter0010000
8B • Updated • 1
tokyotech-llm/Llama-3.1-8B-math-ablation-exp1-LR2.5e-5-WD0.1-iter0007500
8B • Updated • 5
tokyotech-llm/Llama-3.1-8B-math-ablation-exp1-LR2.5e-5-WD0.1-iter0005000
8B • Updated • 1
tokyotech-llm/Llama-3.1-8B-math-ablation-exp1-LR2.5e-5-WD0.1-iter0002500
8B • Updated • 2
tokyotech-llm/Llama-3.1-8B-math-ablation-exp2-LR2.5e-5-WD0.1-iter0012500
8B • Updated • 3
tokyotech-llm/Llama-3.1-8B-math-ablation-exp2-LR2.5e-5-WD0.1-iter0010000
8B • Updated • 7
tokyotech-llm/Llama-3.1-8B-math-ablation-exp2-LR2.5e-5-WD0.1-iter0007500
8B • Updated • 1
tokyotech-llm/Llama-3.1-8B-math-ablation-exp2-LR2.5e-5-WD0.1-iter0005000
8B • Updated • 2
tokyotech-llm/Llama-3.1-8B-math-ablation-exp2-LR2.5e-5-WD0.1-iter0002500
tokyotech-llm/Llama-3.1-Swallow-70B-Instruct-v0.3
Text Generation
• 71B • Updated • 363
• 13
tokyotech-llm/Llama-3.1-Swallow-8B-Instruct-v0.3
Text Generation
• 8B • Updated • 4.51k
• • 24
tokyotech-llm/Llama-3.1-Swallow-8B-Instruct-v0.2
Text Generation
• 8B • Updated • 188
• • 16
tokyotech-llm/Llama-3.1-Swallow-70B-Instruct-v0.1
Text Generation
• 71B • Updated • 77
• 4
tokyotech-llm/Llama-3.1-Swallow-8B-Instruct-v0.1
Text Generation
• 8B • Updated • 203
• • 17
tokyotech-llm/Llama-3.1-8B-code-ablation-exp13-LR2.5e-5-WD0.1-iter0012500
tokyotech-llm/Llama-3.1-8B-code-ablation-exp13-LR2.5e-5-WD0.1-iter0010000
8B • Updated • 1
tokyotech-llm/Llama-3.1-8B-code-ablation-exp13-LR2.5e-5-WD0.1-iter0007500
tokyotech-llm/Llama-3.1-8B-code-ablation-exp13-LR2.5e-5-WD0.1-iter0005000