Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
VectorNomad
/
autoresearch-540m
like
3
Text Generation
Arabic
English
arabic
multilingual
from-scratch
nanochat
curriculum-learning
License:
cc-by-nc-4.0
Model card
Files
Files and versions
xet
Community
main
autoresearch-540m
9.67 GB
Ctrl+K
Ctrl+K
1 contributor
History:
12 commits
Ahmed
Upload sft/sft_step9000.pt with huggingface_hub
1ae5538
verified
about 1 month ago
checkpoints
Add files using upload-large-folder tool
about 1 month ago
optimizer
Add files using upload-large-folder tool
about 1 month ago
sft
Upload sft/sft_step9000.pt with huggingface_hub
about 1 month ago
.gitattributes
Safe
1.57 kB
Upload throughput_mfu.png with huggingface_hub
about 1 month ago
README.md
Safe
7.82 kB
Upload README.md with huggingface_hub
about 1 month ago
inference.py
Safe
5.27 kB
Upload inference.py with huggingface_hub
about 1 month ago
meta.json
Safe
1.43 kB
Add files using upload-large-folder tool
about 1 month ago
model.pt
pickle
Detected Pickle imports (4)
"torch._utils._rebuild_tensor_v2"
,
"torch.BFloat16Storage"
,
"collections.OrderedDict"
,
"torch.FloatStorage"
What is a pickle import?
1.54 GB
xet
Add files using upload-large-folder tool
about 1 month ago
throughput_mfu.png
116 kB
xet
Upload throughput_mfu.png with huggingface_hub
about 1 month ago
token_bytes.npy
131 kB
xet
Add files using upload-large-folder tool
about 1 month ago
tokenizer.pkl
pickle
Detected Pickle imports (1)
"tiktoken.core.Encoding"
How to fix it?
477 kB
xet
Add files using upload-large-folder tool
about 1 month ago
training_loss.csv
Safe
347 kB
Upload training_loss.csv with huggingface_hub
about 1 month ago
training_loss.png
Safe
81 kB
Upload training_loss.png with huggingface_hub
about 1 month ago
val_bpb.csv
Safe
742 Bytes
Upload val_bpb.csv with huggingface_hub
about 1 month ago
val_bpb.png
Safe
93.3 kB
Upload val_bpb.png with huggingface_hub
about 1 month ago