🦙📚 LlamaTales - a ivnle Collection

ivnle 's Collections

updated Aug 8, 2025

From the paper 'Readability ≠ Learnability: Rethinking the Role of Simplicity in Training Small Language Models' (COLM 2025)

Upvote

ivnle/llamatales-gre-70b

Viewer • Updated Nov 8, 2024 • 2M • 43

Note Short stories generated by `nvidia/Llama-3.1-Nemotron-70B-Instruct`.
ivnle/llamatales-jr-70b

Viewer • Updated Nov 9, 2024 • 3.56M • 70

Note Children's stories generated by `nvidia/Llama-3.1-Nemotron-70B-Instruct`.
ivnle/llamatales-gre

Viewer • Updated Oct 21, 2024 • 2.02M • 10

Note Short stories generated by `meta-llama/Llama-3.1-8B-Instruct`.
ivnle/llamatales-jr

Viewer • Updated Oct 22, 2024 • 3.59M • 26

Note Children's stories generated by `meta-llama/Llama-3.1-8B-Instruct`.
ivnle/tinystories

Viewer • Updated Oct 22, 2024 • 4.97M • 9

Note Source: https://huggingface.co/datasets/roneneldan/TinyStories/blob/main/TinyStories_all_data.tar.gz
ivnle/fineweb

Viewer • Updated Oct 22, 2024 • 2.03M • 13

Note 1B token sample of FineWeb-Edu https://huggingface.co/datasets/HuggingFaceFW/fineweb-edu Model checkpoints below. Naming format is [training data]-[layers]-[hidden size]-[heads]-[non-embedding parameter count].
ivnle/llamatales_gre_8b-lay8-hs512-hd8-33M

Text Generation • 0.2B • Updated Nov 14, 2024
ivnle/llamatales_gre_8b-lay8-hs384-hd6-18M

Text Generation • 0.1B • Updated Nov 14, 2024
ivnle/llamatales_gre_8b-lay4-hs384-hd6-9M

Text Generation • 0.1B • Updated Nov 14, 2024
ivnle/llamatales_gre_8b-lay4-hs128-hd2-1M

Text Generation • 33.9M • Updated Nov 14, 2024
ivnle/llamatales_gre_8b-lay2-hs128-hd2-524K

Text Generation • 33.4M • Updated Nov 14, 2024
ivnle/llamatales_gre_8b-lay1-hs128-hd2-262K

Text Generation • 33.1M • Updated Nov 14, 2024 • 2
ivnle/fineweb-lay8-hs512-hd8-33M

Text Generation • 0.2B • Updated Nov 14, 2024 • 3
ivnle/fineweb-lay8-hs384-hd6-18M

Text Generation • 0.1B • Updated Nov 14, 2024 • 1
ivnle/fineweb-lay4-hs384-hd6-9M

Text Generation • 0.1B • Updated Nov 14, 2024 • 4
ivnle/fineweb-lay4-hs128-hd2-1M

Text Generation • 33.9M • Updated Nov 14, 2024
ivnle/fineweb-lay2-hs128-hd2-524K

Text Generation • 33.4M • Updated Nov 14, 2024 • 2
ivnle/fineweb-lay1-hs128-hd2-262K

Text Generation • 33.1M • Updated Nov 14, 2024 • 1
ivnle/llamatales_jr_8b-lay8-hs512-hd8-33M

Text Generation • 0.2B • Updated Nov 14, 2024
ivnle/llamatales_jr_8b-lay8-hs384-hd6-18M

Text Generation • 0.1B • Updated Nov 14, 2024 • 2
ivnle/llamatales_jr_8b-lay4-hs384-hd6-9M

Text Generation • 0.1B • Updated Nov 14, 2024
ivnle/llamatales_jr_8b-lay4-hs128-hd2-1M

Text Generation • 33.9M • Updated Nov 14, 2024 • 5
ivnle/llamatales_jr_8b-lay2-hs128-hd2-524K

Text Generation • 33.4M • Updated Nov 14, 2024 • 6
ivnle/llamatales_jr_8b-lay1-hs128-hd2-262K

Text Generation • 33.1M • Updated Nov 14, 2024
ivnle/tinystories-lay8-hs512-hd8-33M

Text Generation • 0.2B • Updated Nov 14, 2024 • 2
ivnle/tinystories-lay8-hs384-hd6-18M

Text Generation • 0.1B • Updated Nov 14, 2024 • 2
ivnle/tinystories-lay4-hs384-hd6-9M

Text Generation • 0.1B • Updated Nov 14, 2024 • 2
ivnle/tinystories-lay4-hs128-hd2-1M

Text Generation • 33.9M • Updated Nov 14, 2024
ivnle/tinystories-lay2-hs128-hd2-524K

Text Generation • 33.4M • Updated Nov 14, 2024
ivnle/tinystories-lay1-hs128-hd2-262K

Text Generation • 33.1M • Updated Nov 14, 2024 • 6

Upvote