Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Polygl0t 's Collections
Tucano2
LilMoo
LilTii
ViTucano-v1 (Portuguese)
Tucano (Portuguese)
TeenyTinyLlama (Portuguese)

LilTii

updated Mar 5

A 0.6B Bengali Language Model that Outperforms Qwen.

Upvote
1

  • Polygl0t/LilTii-v0.1

    Text Generation β€’ 0.7B β€’ Updated Mar 5 β€’ 14

    Note 🧱 Base model pretrained only with Bengali text.


  • Polygl0t/LilTii-v0.2

    Text Generation β€’ 0.7B β€’ Updated Mar 5 β€’ 24

    Note 🧱 Base model pretrained with a Bengali + English mixture.


  • Polygl0t/gigakriya-v1

    Viewer β€’ Updated Mar 5 β€’ 41.6M β€’ 80

    Note πŸ“š Pretraining dataset.


  • Polygl0t/bengali-edu-qwen-annotations

    Viewer β€’ Updated Mar 5 β€’ 320k β€’ 30

    Note πŸ“š Annotations to train classifiers/filters (Educational).


  • Polygl0t/bengali-toxicity-qwen-annotations

    Viewer β€’ Updated Mar 5 β€’ 320k β€’ 26

    Note πŸ“š Annotations to train classifiers/filters (Toxicity).


  • Polygl0t/bengali-banglabert-edu-classifier

    Text Classification β€’ 34.7M β€’ Updated Mar 5 β€’ 2

    Note 🎯 Quality Filter (Educational)


  • Polygl0t/bengali-banglabert-toxicity-classifier

    Text Classification β€’ 34.7M β€’ Updated Mar 5

    Note 🎯 Quality Filter (Toxicity)


  • Polygl0t/tokenizers

    Viewer β€’ Updated Mar 5 β€’ 8.98M β€’ 717

    Note πŸ“š Data used to train the LilTii tokenizer.

Upvote
1
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs