You need to agree to share your contact information to access this model

This repository is publicly accessible, but you have to accept the conditions to access its files and content.

Description

Paramanu-Hindi is a 367 million parameters open source monolingual generative pretrained Decoder Auto Regressive Language Model for Hindi.

This is a pretrained model from scratch at a context size of 1024.

This model is not either chat-tuned or fine-tuned.

We recommend to fine-tune/chat-tune this pretrained model on Hindi chat or Hindi instruction datasets. Please use PyTorch for fine-tuning/instruction-tuning.

We also recommend to perform continual pretraining on Hindi dataset if possible before fine-tuning.

This model is strictly prohibited to use for commercial purposes.

If you use our model, please cite our paper Niyogi et al., 2026

Model Architecture

Transformer Decoder Auto Regressive Model

Limitations

The model was trained on data that contains toxic language, unsafe content, and societal biases originally crawled from the internet. Therefore, the model may amplify those biases and return toxic responses especially when prompted with toxic prompts. The model may generate answers that may be inaccurate, omit key information, or include irrelevant or redundant text producing socially unacceptable or undesirable text, even if the prompt itself does not include anything explicitly offensive.

Citations

@misc{niyogi2026paramanucompactcompetitivemonolingual,
      title={Paramanu: Compact and Competitive Monolingual Language Models for Low-Resource Morphologically Rich Indian Languages}, 
      author={Mitodru Niyogi and Eric Gaussier and Arnab Bhattacharya},
      year={2026},
      eprint={2401.18034},
      archivePrefix={arXiv},
      primaryClass={cs.CL},
      url={https://arxiv.org/abs/2401.18034}, 
}

Downloads last month: 3

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Paper for gyanai/paramanu-hindi-367M-hf

Paramanu: A Family of Novel Efficient Indic Generative Foundation Language Models

Paper • 2401.18034 • Published Jan 31, 2024