Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
Harley-ml
/
LWTMoE-10M-A6M
like
0
Text Generation
Safetensors
Harley-ml/lesswrong
English
qwen3_moe
philosophy
lesswrong
Mixture of Experts
mixture-of-experts
small
tiny
small-language-model
arxiv:
2402.13744
License:
mit
Model card
Files
Files and versions
xet
Community
Copy to bucket
new
main
LWTMoE-10M-A6M
44.4 MB
Ctrl+K
Ctrl+K
1 contributor
History:
11 commits
Harley-ml
Update README.md
b4222f4
verified
19 days ago
.gitattributes
Safe
1.52 kB
initial commit
about 1 month ago
README.md
Safe
13.9 kB
Update README.md
19 days ago
config.json
Safe
1.01 kB
Upload 4 files
about 1 month ago
generation_config.json
Safe
152 Bytes
Upload 4 files
about 1 month ago
model.safetensors
43.9 MB
xet
Upload 4 files
about 1 month ago
tokenizer.json
Safe
537 kB
Upload 4 files
about 1 month ago