vishesh-t27 commited on
Commit
f5213b6
·
verified ·
1 Parent(s): 4331f85

updated readme.md

Browse files
Files changed (1) hide show
  1. README.md +2 -2
README.md CHANGED
@@ -67,7 +67,7 @@ The model is trained on English and a diverse set of Indic languages, including:
67
 
68
  ## 📊 Benchmark Comparison (Nandi-150M Focus)
69
 
70
- | Model Name | Parameters (M) | Tokens Budget (B) | HellaSwag | Winogrande | GPQA | MMLU | GSM8K | HumanEval | Average |
71
  |------------------|---------------|------------------|----------|------------|------|------|-------|-----------|---------|
72
  | Mobile-LLM-125M | 125 | 1000 | 38.90 | 53.10 | - | - | - | - | - |
73
  | SmolLM-135M-Base | 135 | 600 | 42.66| 53.03 | 25.44| 25.30| 1.36 | 0.00 | 24.63 |
@@ -77,7 +77,7 @@ The model is trained on English and a diverse set of Indic languages, including:
77
 
78
  ## 📊 Model Benchmark Comparison With Bigger Models (350M–600M Class)
79
 
80
- | Model Name | Parameters (M) | Tokens Budget (B) | HellaSwag | Winogrande | GPQA | MMLU | GSM8K | HumanEval | Average |
81
  |---------------------|---------------|------------------|----------|------------|------|------|-------|-----------|---------|
82
  | Mobile-LLM-360M | 350 | 1000 | 49.60 | 56.59 | - | - | - | - | - |
83
  | Qwen-2-0.5-Base | 500 | 12000 | 49.01 | 57.69 | 27.23| 44.06| 10.61 | 22.56 | 35.19 |
 
67
 
68
  ## 📊 Benchmark Comparison (Nandi-150M Focus)
69
 
70
+ | Model Name | Parameters | Tokens(B) | HellaSwag | Winogrande | GPQA | MMLU | GSM8K | HumanEval | Average |
71
  |------------------|---------------|------------------|----------|------------|------|------|-------|-----------|---------|
72
  | Mobile-LLM-125M | 125 | 1000 | 38.90 | 53.10 | - | - | - | - | - |
73
  | SmolLM-135M-Base | 135 | 600 | 42.66| 53.03 | 25.44| 25.30| 1.36 | 0.00 | 24.63 |
 
77
 
78
  ## 📊 Model Benchmark Comparison With Bigger Models (350M–600M Class)
79
 
80
+ | Model Name | Parameters | Tokens(B) | HellaSwag | Winogrande | GPQA | MMLU | GSM8K | HumanEval | Average |
81
  |---------------------|---------------|------------------|----------|------------|------|------|-------|-----------|---------|
82
  | Mobile-LLM-360M | 350 | 1000 | 49.60 | 56.59 | - | - | - | - | - |
83
  | Qwen-2-0.5-Base | 500 | 12000 | 49.01 | 57.69 | 27.23| 44.06| 10.61 | 22.56 | 35.19 |