Commit ·
c2b755c
1
Parent(s): 95dc308
feat(model): add Hugging Face Hub publication support
Browse files- Add config.json with full model architecture specifications
- Enhance README.md with HF model card metadata:
- Add license (Apache 2.0)
- Include model-index with performance metrics
- Add standardized tags for discoverability
- Prepare for model publication via huggingface_hub API
This commit prepares the model for publication on the Hugging Face Hub by:
1. Providing necessary configuration files
2. Adding required metadata for model cards
3. Documenting performance characteristics in standard format
- README.md +20 -0
- config.json +12 -0
- shoonya_model_v0_1.pt +3 -0
- shoonya_model_v0_1_quantized.pt +3 -0
README.md
CHANGED
|
@@ -1,5 +1,25 @@
|
|
| 1 |
# Shoonya v0.1 - Lightweight CPU-Friendly Language Model
|
| 2 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 3 |
## Model Description
|
| 4 |
Shoonya is a lightweight transformer-based language model designed specifically for CPU inference. Built with efficiency in mind, it features a compact architecture while maintaining coherent text generation capabilities.
|
| 5 |
|
|
|
|
| 1 |
# Shoonya v0.1 - Lightweight CPU-Friendly Language Model
|
| 2 |
|
| 3 |
+
---
|
| 4 |
+
language:
|
| 5 |
+
- en
|
| 6 |
+
license: apache-2.0
|
| 7 |
+
library_name: custom
|
| 8 |
+
tags:
|
| 9 |
+
- cpu-inference
|
| 10 |
+
- lightweight
|
| 11 |
+
- text-generation
|
| 12 |
+
model-index:
|
| 13 |
+
- name: Shoonya
|
| 14 |
+
results:
|
| 15 |
+
- task:
|
| 16 |
+
type: text-generation
|
| 17 |
+
metrics:
|
| 18 |
+
- type: throughput
|
| 19 |
+
value: "100ms/inference"
|
| 20 |
+
name: CPU Inference Speed
|
| 21 |
+
---
|
| 22 |
+
|
| 23 |
## Model Description
|
| 24 |
Shoonya is a lightweight transformer-based language model designed specifically for CPU inference. Built with efficiency in mind, it features a compact architecture while maintaining coherent text generation capabilities.
|
| 25 |
|
config.json
ADDED
|
@@ -0,0 +1,12 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
{
|
| 2 |
+
"architectures": ["TransformerLM"],
|
| 3 |
+
"model_type": "transformer",
|
| 4 |
+
"n_layer": 4,
|
| 5 |
+
"n_head": 4,
|
| 6 |
+
"n_embd": 128,
|
| 7 |
+
"vocab_size": 50257,
|
| 8 |
+
"max_position_embeddings": 128,
|
| 9 |
+
"intermediate_size": 256,
|
| 10 |
+
"torch_dtype": "float32",
|
| 11 |
+
"transformers_version": "4.30.0"
|
| 12 |
+
}
|
shoonya_model_v0_1.pt
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:0559892e5f04adf325ccb914bfaa015ce61b1900e43cbb748c466eb1d224726d
|
| 3 |
+
size 53675947
|
shoonya_model_v0_1_quantized.pt
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:5588d58368becbd22837a24806cf30e0f77a9a989f1e114cbfb1f6e03bdf2642
|
| 3 |
+
size 32824339
|