Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
In a Training Loop 🔄
205527.2
TFLOPS
1220
242
853
Lewis Tunstall
PRO
lewtun
Follow
minyichen's profile picture
colin-r-carter's profile picture
BisratWorku's profile picture
1,369 followers
·
131 following
https://lewtun.github.io/blog/
_lewtun
lewtun
AI & ML interests
LLMs, LLMs, LLMs
Recent Activity
liked
a model
about 20 hours ago
Qwen/Qwen3.6-35B-A3B
upvoted
a
paper
about 20 hours ago
Embarrassingly Simple Self-Distillation Improves Code Generation
published
a model
1 day ago
lewtun/Qwen3-4B-Instruct-2507-SFT
View all activity
Organizations
lewtun
's models
293
Sort: Recently updated
lewtun/gemma-7b-dpo-full-mix2-beta-0.1
Text Generation
•
9B
•
Updated
Feb 29, 2024
•
1
lewtun/gemma-7b-dpo-full-ultrafeedback-beta-0.01
Text Generation
•
9B
•
Updated
Feb 29, 2024
•
2
lewtun/gemma-7b-dpo-full-mix1-beta-0.4-epoch-3
Text Generation
•
9B
•
Updated
Feb 29, 2024
•
3
lewtun/gemma-7b-dpo-full-mix1-beta-0.01
Text Generation
•
9B
•
Updated
Feb 29, 2024
•
1
lewtun/gemma-7b-dpo-full-mix1-beta-0.05
Text Generation
•
9B
•
Updated
Feb 29, 2024
•
2
lewtun/gemma-7b-dpo-full-mix1-beta-0.1-epoch-3
Text Generation
•
9B
•
Updated
Feb 29, 2024
•
6
lewtun/gemma-7b-dpo-full-mix1-beta-0.6
Text Generation
•
9B
•
Updated
Feb 29, 2024
•
1
lewtun/gemma-7b-dpo-full-mix1-beta-0.4
Text Generation
•
9B
•
Updated
Feb 29, 2024
•
1
lewtun/gemma-7b-dpo-full-mix1-beta-0.2
Text Generation
•
9B
•
Updated
Feb 29, 2024
•
1
lewtun/gemma-7b-dpo-full-mix1-beta-0.1
Text Generation
•
9B
•
Updated
Feb 29, 2024
•
3
lewtun/gemma-7b-dpo-full-ultrafeedback-v0
Text Generation
•
Updated
Feb 29, 2024
•
4
lewtun/gemma-7b-dpo-full-mix-beta-0.1
Updated
Feb 29, 2024
lewtun/gemma-7b-dpo-full-orca-v0
Text Generation
•
9B
•
Updated
Feb 29, 2024
•
4
lewtun/gemma-7b-sft-full-deita-10k-v0
Text Generation
•
9B
•
Updated
Feb 29, 2024
•
2
lewtun/gemma-7b-sft-full-ultrachat-v0
Text Generation
•
9B
•
Updated
Feb 29, 2024
•
4
•
1
lewtun/gemma-7b-sft-full-longest-1k-v1
Text Generation
•
9B
•
Updated
Feb 29, 2024
•
2
lewtun/gemma-7b-sft-full-longest-1k-v0
Text Generation
•
9B
•
Updated
Feb 29, 2024
•
1
lewtun/gemma-7b-sft-full-dolly-v3
Text Generation
•
9B
•
Updated
Feb 29, 2024
•
3
lewtun/gemma-7b-sft-full-dolly-v2
Text Generation
•
9B
•
Updated
Feb 29, 2024
•
2
lewtun/gemma-7b-sft-full-dolly-v1
Text Generation
•
9B
•
Updated
Feb 29, 2024
•
2
lewtun/gemma-7b-sft-full-dolly-v0
Text Generation
•
9B
•
Updated
Feb 29, 2024
•
3
lewtun/dummy-model
Text Generation
•
0.5B
•
Updated
Feb 21, 2024
•
4
lewtun/zephyr-7b-dpo-qlora-fix
Updated
Feb 2, 2024
•
3
lewtun/zephyr-7b-dpo-qlora-8e0975a
Updated
Jan 10, 2024
•
8
lewtun/zephyr-7b-dpo-qlora
Updated
Jan 9, 2024
•
38
lewtun/handbook-sft-qlora-test
Updated
Jan 9, 2024
•
6
lewtun/handbook-sft-test
Text Generation
•
7B
•
Updated
Jan 9, 2024
•
1
lewtun/zephyr-7b-dpo-full
Text Generation
•
7B
•
Updated
Jan 5, 2024
•
3
lewtun/zephyr-7b-sft-qlora
Updated
Jan 4, 2024
•
44
lewtun/kato-dummy
Text Classification
•
0.4B
•
Updated
Dec 22, 2023
•
2
Previous
1
2
3
4
5
6
...
10
Next