Instructions to use FrontiersMind/Nandi-Mini-150M with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use FrontiersMind/Nandi-Mini-150M with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-generation", model="FrontiersMind/Nandi-Mini-150M", trust_remote_code=True)# Load model directly from transformers import AutoModelForCausalLM model = AutoModelForCausalLM.from_pretrained("FrontiersMind/Nandi-Mini-150M", trust_remote_code=True, dtype="auto") - Notebooks
- Google Colab
- Kaggle
- Local Apps
- vLLM
How to use FrontiersMind/Nandi-Mini-150M with vLLM:
Install from pip and serve model
# Install vLLM from pip: pip install vllm # Start the vLLM server: vllm serve "FrontiersMind/Nandi-Mini-150M" # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:8000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "FrontiersMind/Nandi-Mini-150M", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }'Use Docker
docker model run hf.co/FrontiersMind/Nandi-Mini-150M
- SGLang
How to use FrontiersMind/Nandi-Mini-150M with SGLang:
Install from pip and serve model
# Install SGLang from pip: pip install sglang # Start the SGLang server: python3 -m sglang.launch_server \ --model-path "FrontiersMind/Nandi-Mini-150M" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "FrontiersMind/Nandi-Mini-150M", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }'Use Docker images
docker run --gpus all \ --shm-size 32g \ -p 30000:30000 \ -v ~/.cache/huggingface:/root/.cache/huggingface \ --env "HF_TOKEN=<secret>" \ --ipc=host \ lmsysorg/sglang:latest \ python3 -m sglang.launch_server \ --model-path "FrontiersMind/Nandi-Mini-150M" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "FrontiersMind/Nandi-Mini-150M", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }' - Docker Model Runner
How to use FrontiersMind/Nandi-Mini-150M with Docker Model Runner:
docker model run hf.co/FrontiersMind/Nandi-Mini-150M
Update README.md
Browse files
README.md
CHANGED
|
@@ -94,20 +94,20 @@ Mobile-LLM model checkpoints are not publicly available; their results are repor
|
|
| 94 |
|
| 95 |
## Tokenization Fertility Score across Languages
|
| 96 |
|
| 97 |
-
| Language | SmolLM3-3B | Qwen3-0.6B-Base | Sarvam-
|
| 98 |
-
|----------|------------|-----------------|----------
|
| 99 |
-
| English
|
| 100 |
-
| Bengali
|
| 101 |
-
| Gujarati
|
| 102 |
-
| Hindi
|
| 103 |
-
| Kannada
|
| 104 |
-
| Malayalam| 17.77 | 14.56 | 2.
|
| 105 |
-
| Marathi
|
| 106 |
-
| Oriya
|
| 107 |
-
| Punjabi
|
| 108 |
-
| Tamil
|
| 109 |
-
| Telugu
|
| 110 |
-
| Assamese
|
| 111 |
|
| 112 |
|
| 113 |
## 🚀 Usage
|
|
|
|
| 94 |
|
| 95 |
## Tokenization Fertility Score across Languages
|
| 96 |
|
| 97 |
+
| Language | SmolLM3-3B | Qwen3-0.6B-Base | Sarvam-1 | Nandi-Mini-150M |
|
| 98 |
+
|-----------|------------|-----------------|----------|------------------|
|
| 99 |
+
| English | 1.17 | 1.16 | 1.32 | **1.18** |
|
| 100 |
+
| Bengali | 8.66 | 7.51 | 1.55 | **1.44** |
|
| 101 |
+
| Gujarati | 10.47 | 9.37 | 1.55 | **1.53** |
|
| 102 |
+
| Hindi | 2.71 | 5.14 | **1.25** | 1.32 |
|
| 103 |
+
| Kannada | 16.43 | 12.96 | 2.10 | **1.90** |
|
| 104 |
+
| Malayalam | 17.77 | 14.56 | 2.49 | **2.05** |
|
| 105 |
+
| Marathi | 3.73 | 6.70 | 1.55 | **1.55** |
|
| 106 |
+
| Oriya | 19.07 | 15.75 |**2.18** | 2.68 |
|
| 107 |
+
| Punjabi | 9.23 | 8.66 | 1.47 | **1.42** |
|
| 108 |
+
| Tamil | 13.56 | 10.93 | 2.06 | **2.05** |
|
| 109 |
+
| Telugu | 15.40 | 13.38 | 2.09 | **1.77** |
|
| 110 |
+
| Assamese | 9.26 | 8.13 | 4.31 | **1.51** |
|
| 111 |
|
| 112 |
|
| 113 |
## 🚀 Usage
|