vishesh-t27 commited on
Commit
c6a866f
·
verified ·
1 Parent(s): 4cbe55e

updated Readme.md

Browse files
Files changed (1) hide show
  1. README.md +18 -0
README.md CHANGED
@@ -92,6 +92,22 @@ The model is trained on English and a diverse set of Indic languages, including:
92
  ### Note
93
  Mobile-LLM model checkpoints are not publicly available; their results are reported directly from the original paper. All other models have been evaluated using `lm-eval` under a consistent setup. Human-Eval & GSM8K have been evaluated using Greedy-decoding for now for all models.
94
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
95
  ## Tokenization Fertility Score across Languages
96
 
97
  | Language | SmolLM3-3B | Qwen3-0.6B-Base | Sarvam-1 | Nandi-Mini-150M |
@@ -110,6 +126,7 @@ Mobile-LLM model checkpoints are not publicly available; their results are repor
110
  | Assamese | 9.26 | 8.13 | 4.31 | **1.51** |
111
 
112
 
 
113
  ## 🚀 Usage
114
 
115
  ```python
@@ -153,6 +170,7 @@ print(response)
153
  ```
154
 
155
 
 
156
  ## 📬 Feedback & Suggestions
157
 
158
  We’d love to hear your thoughts, feedback, and ideas!
 
92
  ### Note
93
  Mobile-LLM model checkpoints are not publicly available; their results are reported directly from the original paper. All other models have been evaluated using `lm-eval` under a consistent setup. Human-Eval & GSM8K have been evaluated using Greedy-decoding for now for all models.
94
 
95
+ ## Performance onf Finetuned Tasks
96
+
97
+ #### CrossSum-Hindi (CHRF) Results
98
+ We finetuned our model and other open source models on [Google's IndicGenBench](https://github.com/google-research-datasets/indic-gen-bench/) Crossum-Hindi. Nandi-mini-150M was able to outperform other models.
99
+
100
+ | Base Model | Before Finetune | After Finetune |
101
+ |------------------------|-----------------|----------------|
102
+ | Qwen-2-0.5-Base | 0.09 | 4.22 |
103
+ | Qwen2.5-0.5B-Base | 0.43 | 4.18 |
104
+ | SmolLM-135M-Base | 0.09 | 2.55 |
105
+ | SmolLM-360M-Base | 0.09 | 2.99 |
106
+ | SmolLM2-135M-Base | 0.09 | 2.67 |
107
+ | SmolLM2-360M-Base | 0.12 | 3.51 |
108
+ | Nandi-mini-150M | 0.10 | **4.37** |
109
+
110
+
111
  ## Tokenization Fertility Score across Languages
112
 
113
  | Language | SmolLM3-3B | Qwen3-0.6B-Base | Sarvam-1 | Nandi-Mini-150M |
 
126
  | Assamese | 9.26 | 8.13 | 4.31 | **1.51** |
127
 
128
 
129
+
130
  ## 🚀 Usage
131
 
132
  ```python
 
170
  ```
171
 
172
 
173
+
174
  ## 📬 Feedback & Suggestions
175
 
176
  We’d love to hear your thoughts, feedback, and ideas!