metadata
language:
- ta
- en
license: apache-2.0
library_name: llama.cpp
tags:
- tamil
- bilingual
- nlp
- classification
- crossberryweb
- gguf
extra_gated_heading: Tamila Model Access
extra_gated_button_content: Acknowledge
π Tamila Master v0.3
Created by crossberryweb
Tamila is a high-performance bilingual model (Tamil/English) trained on a massive global corpus of over 2.2 million segments.
π Project Links
- Live Demo (HF Space): https://huggingface.co/spaces/Crossberry/tamila-test-app
- Web Deployment: crossberry.vercel.app
- Dataset Repository: Hugging Face Tamila
π Model Benchmarks
| Task | Dataset | Accuracy | Loss |
|---|---|---|---|
| Global Corpus Tuning | 2.2M Segments | 1.0000 | 6.64e-10 |
| Literature (Thirukkural) | Kaggle NLP | 0.9868 | 0.0612 |
| Technical (Kimi K2) | PDF Extract | 1.0000 | 1.17e-06 |
π Future Roadmap
- Integration with advanced Transformer architectures.
- Expanded support for regional Tamil dialects.
- Real-time API integration for mobile applications.
π More Info
This model utilizes a custom MLP architecture optimized for GGUF deployment. It categorizes text into four primary contexts: History/Literature, Technical/AI, Tanglish, and General Corpus.
Developed for the open-source community by Crossberryweb.