Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
ViperEk
/
KHAOSZ
like
2
Safetensors
BelleGroup/train_3.5M_CN
YeungNLP/moss-003-sft-data
llm-wizard/alpaca-gpt4-data-zh
Chinese
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
main
KHAOSZ
/
config.json
ViperEk
Upload 2 files
7b16b0a
about 1 month ago
raw
Copy download link
history
blame
contribute
delete
Safe
195 Bytes
{
"vocab_size"
:
100000
,
"dim"
:
1536
,
"n_heads"
:
24
,
"n_layers"
:
24
,
"max_len"
:
2048
,
"norm_eps"
:
1e-05
,
"dim_ffn"
:
6912
,
"tie_weight"
:
false
,
"n_kv_heads"
:
4
}