vishesh-t27 commited on
Commit
7bee726
·
verified ·
1 Parent(s): f5213b6

updated Readme.md

Browse files
Files changed (1) hide show
  1. README.md +7 -6
README.md CHANGED
@@ -35,9 +35,9 @@ Nandi-Mini-150M brings the following key features:
35
  We’re just getting started with the Nandi series 🚀
36
 
37
  - **Nandi-Mini-150M (Base)** — *Available now*
38
- - **Nandi-Mini-150M (Instruct)** — Coming soon (open-sourced)
39
- - **Nandi-Mini-500M (Base + Instruct)** — Planned next
40
- - **Nandi-Mini-1B (Base + Instruct)** — Final milestone in the current roadmap
41
 
42
  We are actively working on expanding the Nandi family to cover a wider range of use cases—from lightweight edge deployments to more capable instruction-tuned systems.
43
 
@@ -65,7 +65,7 @@ The model is trained on English and a diverse set of Indic languages, including:
65
 
66
  ## Benchmark Results
67
 
68
- ## 📊 Benchmark Comparison (Nandi-150M Focus)
69
 
70
  | Model Name | Parameters | Tokens(B) | HellaSwag | Winogrande | GPQA | MMLU | GSM8K | HumanEval | Average |
71
  |------------------|---------------|------------------|----------|------------|------|------|-------|-----------|---------|
@@ -75,7 +75,7 @@ The model is trained on English and a diverse set of Indic languages, including:
75
  | **Nandi-Mini-150M-Base** | **150** | **500** | 37.20 | 52.32 | **28.57** | **28.86** | **2.58** | **4.27** | **25.63** |
76
 
77
 
78
- ## 📊 Model Benchmark Comparison With Bigger Models (350M–600M Class)
79
 
80
  | Model Name | Parameters | Tokens(B) | HellaSwag | Winogrande | GPQA | MMLU | GSM8K | HumanEval | Average |
81
  |---------------------|---------------|------------------|----------|------------|------|------|-------|-----------|---------|
@@ -87,7 +87,8 @@ The model is trained on English and a diverse set of Indic languages, including:
87
  | SmolLM2-360M-Base | 360 | 40000 | 56.30 | 59.19 | 25.22| 25.55| 2.88 | 0.00 | 28.19 |
88
  | **Nandi-Mini-150M-Base** | **150** | 500 | 37.20| 52.32 | 28.57 | 28.86 | 2.58 | 4.27 | 25.63 |
89
 
90
-
 
91
 
92
  ## 🚀 Usage
93
 
 
35
  We’re just getting started with the Nandi series 🚀
36
 
37
  - **Nandi-Mini-150M (Base)** — *Available now*
38
+ - **Nandi-Mini-150M (Instruct)** — Open Sourcing Next week
39
+ - **Nandi-Mini-500M (Base + Instruct)** — Training Going On
40
+ - **Nandi-Mini-1B (Base + Instruct)** — Training Going On
41
 
42
  We are actively working on expanding the Nandi family to cover a wider range of use cases—from lightweight edge deployments to more capable instruction-tuned systems.
43
 
 
65
 
66
  ## Benchmark Results
67
 
68
+ ## 📊 Benchmark Comparison (~150M Class)
69
 
70
  | Model Name | Parameters | Tokens(B) | HellaSwag | Winogrande | GPQA | MMLU | GSM8K | HumanEval | Average |
71
  |------------------|---------------|------------------|----------|------------|------|------|-------|-----------|---------|
 
75
  | **Nandi-Mini-150M-Base** | **150** | **500** | 37.20 | 52.32 | **28.57** | **28.86** | **2.58** | **4.27** | **25.63** |
76
 
77
 
78
+ ## 📊 Model Benchmark Comparison With Slightly Bigger Models (350M–600M Class)
79
 
80
  | Model Name | Parameters | Tokens(B) | HellaSwag | Winogrande | GPQA | MMLU | GSM8K | HumanEval | Average |
81
  |---------------------|---------------|------------------|----------|------------|------|------|-------|-----------|---------|
 
87
  | SmolLM2-360M-Base | 360 | 40000 | 56.30 | 59.19 | 25.22| 25.55| 2.88 | 0.00 | 28.19 |
88
  | **Nandi-Mini-150M-Base** | **150** | 500 | 37.20| 52.32 | 28.57 | 28.86 | 2.58 | 4.27 | 25.63 |
89
 
90
+ ### Note
91
+ Mobile-LLM model checkpoints are not publicly available; their results are reported directly from the original paper. All other models have been evaluated using `lm-eval` under a consistent setup.
92
 
93
  ## 🚀 Usage
94