Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
11
2
Kristian Schwethelm
KristianS7
Follow
kschwethelm
AI & ML interests
Large Language Models
Recent Activity
updated
a dataset
10 days ago
KristianS7/prepacked-fineweb-edu-llama2-32K-T2048
published
a dataset
10 days ago
KristianS7/prepacked-fineweb-edu-llama2-32K-T2048
updated
a dataset
16 days ago
KristianS7/nanochat-prepacked-fineweb-edu
View all activity
Organizations
None yet
KristianS7
's activity
All
Models
Datasets
Spaces
Buckets
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
updated
a dataset
10 days ago
KristianS7/prepacked-fineweb-edu-llama2-32K-T2048
Updated
10 days ago
•
3.35k
published
a dataset
10 days ago
KristianS7/prepacked-fineweb-edu-llama2-32K-T2048
Updated
10 days ago
•
3.35k
updated
a dataset
16 days ago
KristianS7/nanochat-prepacked-fineweb-edu
Updated
16 days ago
•
1.42k
published
a dataset
about 1 month ago
KristianS7/nanochat-prepacked-fineweb-edu
Updated
16 days ago
•
1.42k
updated
2 models
about 1 month ago
KristianS7/Ouro-1.4B
Text Generation
•
Updated
Mar 11
•
32
KristianS7/Ouro-1.4B-Thinking
Text Generation
•
Updated
Mar 11
•
26
New activity in
ByteDance/Ouro-2.6B
about 1 month ago
Lower evaluation results
1
#2 opened 4 months ago by
MianchuWang
New activity in
ByteDance/Ouro-1.4B
about 1 month ago
Differences in the results of the reproduction test on lm-evaluation-harness
3
#8 opened 3 months ago by
ThreeGold116
New activity in
ByteDance/Ouro-2.6B
about 1 month ago
Fix bos/eos token IDs (config.json + tokenizer_config.json)
#5 opened about 1 month ago by
KristianS7
Fix UniversalTransformerCache.get_mask_sizes for batched generation
#4 opened about 1 month ago by
KristianS7
New activity in
ByteDance/Ouro-1.4B
about 1 month ago
Fix bos/eos token IDs (config.json + tokenizer_config.json)
#11 opened about 1 month ago by
KristianS7
Fix UniversalTransformerCache.get_mask_sizes for batched generation
#10 opened about 1 month ago by
KristianS7
New activity in
ByteDance/Ouro-1.4B-Thinking
about 2 months ago
Fix UniversalTransformerCache.get_mask_sizes for batched generation
1
#5 opened about 2 months ago by
KristianS7
New activity in
ByteDance/Ouro-2.6B-Thinking
about 2 months ago
Fix UniversalTransformerCache.get_mask_sizes for batched generation
1
#8 opened about 2 months ago by
KristianS7
New activity in
ByteDance/Ouro-1.4B
about 2 months ago
Batched generation (batch_size > 1) produces incorrect outputs — possible causal mask issue?
➕
1
1
#9 opened 3 months ago by
vconchel
New activity in
ByteDance/Ouro-2.6B-Thinking
about 2 months ago
Fix bos/eos token IDs + add enable_thinking to chat template
2
#7 opened about 2 months ago by
KristianS7
New activity in
ByteDance/Ouro-1.4B-Thinking
about 2 months ago
Fix bos/eos token IDs + add enable_thinking to chat template
2
#4 opened about 2 months ago by
KristianS7
New activity in
ByteDance/Ouro-2.6B-Thinking
about 2 months ago
Fix bos/eos token IDs + add enable_thinking to chat template
2
#7 opened about 2 months ago by
KristianS7
New activity in
ByteDance/Ouro-1.4B-Thinking
about 2 months ago
Fix bos/eos token IDs + add enable_thinking to chat template
2
#4 opened about 2 months ago by
KristianS7
Load more