Lewis Tunstall's picture

In a Training Loop 🔄

Lewis Tunstall PRO

lewtun

huggingface

·

https://lewtun.github.io/blog/

AI & ML interests

LLMs, LLMs, LLMs

Recent Activity

liked a model about 20 hours ago

Qwen/Qwen3.6-35B-A3B

upvoted a paper about 20 hours ago

Embarrassingly Simple Self-Distillation Improves Code Generation

published a model 1 day ago

lewtun/Qwen3-4B-Instruct-2507-SFT

View all activity

Organizations

lewtun 's models 293

lewtun/gemma-7b-dpo-full-mix2-beta-0.1

Text Generation • 9B • Updated Feb 29, 2024 • 1

lewtun/gemma-7b-dpo-full-ultrafeedback-beta-0.01

Text Generation • 9B • Updated Feb 29, 2024 • 2

lewtun/gemma-7b-dpo-full-mix1-beta-0.4-epoch-3

Text Generation • 9B • Updated Feb 29, 2024 • 3

lewtun/gemma-7b-dpo-full-mix1-beta-0.01

Text Generation • 9B • Updated Feb 29, 2024 • 1

lewtun/gemma-7b-dpo-full-mix1-beta-0.05

Text Generation • 9B • Updated Feb 29, 2024 • 2

lewtun/gemma-7b-dpo-full-mix1-beta-0.1-epoch-3

Text Generation • 9B • Updated Feb 29, 2024 • 6

lewtun/gemma-7b-dpo-full-mix1-beta-0.6

Text Generation • 9B • Updated Feb 29, 2024 • 1

lewtun/gemma-7b-dpo-full-mix1-beta-0.4

Text Generation • 9B • Updated Feb 29, 2024 • 1

lewtun/gemma-7b-dpo-full-mix1-beta-0.2

Text Generation • 9B • Updated Feb 29, 2024 • 1

lewtun/gemma-7b-dpo-full-mix1-beta-0.1

Text Generation • 9B • Updated Feb 29, 2024 • 3

lewtun/gemma-7b-dpo-full-ultrafeedback-v0

Text Generation • Updated Feb 29, 2024 • 4

lewtun/gemma-7b-dpo-full-mix-beta-0.1

Updated Feb 29, 2024

lewtun/gemma-7b-dpo-full-orca-v0

Text Generation • 9B • Updated Feb 29, 2024 • 4

lewtun/gemma-7b-sft-full-deita-10k-v0

Text Generation • 9B • Updated Feb 29, 2024 • 2

lewtun/gemma-7b-sft-full-ultrachat-v0

Text Generation • 9B • Updated Feb 29, 2024 • 4 • 1

lewtun/gemma-7b-sft-full-longest-1k-v1

Text Generation • 9B • Updated Feb 29, 2024 • 2

lewtun/gemma-7b-sft-full-longest-1k-v0

Text Generation • 9B • Updated Feb 29, 2024 • 1

lewtun/gemma-7b-sft-full-dolly-v3

Text Generation • 9B • Updated Feb 29, 2024 • 3

lewtun/gemma-7b-sft-full-dolly-v2

Text Generation • 9B • Updated Feb 29, 2024 • 2

lewtun/gemma-7b-sft-full-dolly-v1

Text Generation • 9B • Updated Feb 29, 2024 • 2

lewtun/gemma-7b-sft-full-dolly-v0

Text Generation • 9B • Updated Feb 29, 2024 • 3

lewtun/dummy-model

Text Generation • 0.5B • Updated Feb 21, 2024 • 4

lewtun/zephyr-7b-dpo-qlora-fix

Updated Feb 2, 2024 • 3

lewtun/zephyr-7b-dpo-qlora-8e0975a

Updated Jan 10, 2024 • 8

lewtun/zephyr-7b-dpo-qlora

Updated Jan 9, 2024 • 38

lewtun/handbook-sft-qlora-test

Updated Jan 9, 2024 • 6

lewtun/handbook-sft-test

Text Generation • 7B • Updated Jan 9, 2024 • 1

lewtun/zephyr-7b-dpo-full

Text Generation • 7B • Updated Jan 5, 2024 • 3

lewtun/zephyr-7b-sft-qlora

Updated Jan 4, 2024 • 44

lewtun/kato-dummy

Text Classification • 0.4B • Updated Dec 22, 2023 • 2