vishesh-t27 commited on
Commit
ce078cb
·
verified ·
1 Parent(s): 3d205d0

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -3
README.md CHANGED
@@ -16,7 +16,7 @@ pipeline_tag: text-generation
16
  library_name: transformers
17
  ---
18
 
19
- # Nandi-Mini-500M-Early-Checkpoint
20
 
21
  ## Introduction
22
 
@@ -53,7 +53,7 @@ Stay tuned!
53
 
54
  ### Architectural Highlights
55
 
56
- Nandi-Mini-500M introduces several efficiency-focused architectural optimizations designed for compact yet capable language models.
57
 
58
  #### Shared KV (Shared Key-Value Vectors)
59
 
@@ -133,7 +133,7 @@ This remains an active research area within the Nandi model family, and we plan
133
 
134
  ## Tokenization Fertility Score Across Languages
135
 
136
- | Language | SmolLM3-3B | Qwen3-0.6B-Base | Sarvam-1 | Nandi-Mini-500M |
137
  |-----------|------------|-----------------|----------|------------------|
138
  | English | 1.17 | 1.16 | 1.32 | **1.18** |
139
  | Bengali | 8.66 | 7.51 | 1.55 | **1.44** |
 
16
  library_name: transformers
17
  ---
18
 
19
+ # Nandi-Mini-600M-Early-Checkpoint
20
 
21
  ## Introduction
22
 
 
53
 
54
  ### Architectural Highlights
55
 
56
+ Nandi-Mini-600M introduces several efficiency-focused architectural optimizations designed for compact yet capable language models.
57
 
58
  #### Shared KV (Shared Key-Value Vectors)
59
 
 
133
 
134
  ## Tokenization Fertility Score Across Languages
135
 
136
+ | Language | SmolLM3-3B | Qwen3-0.6B-Base | Sarvam-1 | Nandi-Mini-600M |
137
  |-----------|------------|-----------------|----------|------------------|
138
  | English | 1.17 | 1.16 | 1.32 | **1.18** |
139
  | Bengali | 8.66 | 7.51 | 1.55 | **1.44** |