Instructions to use ShakhawatShanin/Bangla-Text-Summarization with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

Libraries

How to use ShakhawatShanin/Bangla-Text-Summarization with llama-cpp-python:

# !pip install llama-cpp-python

from llama_cpp import Llama

llm = Llama.from_pretrained(
	repo_id="ShakhawatShanin/Bangla-Text-Summarization",
	filename="Bangla_Text_Summarization/bangla_summarization.Q8_0.gguf",
)

llm.create_chat_completion(
	messages = "No input example has been defined for this model task."
)

Notebooks
Google Colab
Kaggle
Local Apps

llama.cpp

How to use ShakhawatShanin/Bangla-Text-Summarization with llama.cpp:

Install from brew

brew install llama.cpp
# Start a local OpenAI-compatible server with a web UI:
llama-server -hf ShakhawatShanin/Bangla-Text-Summarization:Q8_0
# Run inference directly in the terminal:
llama-cli -hf ShakhawatShanin/Bangla-Text-Summarization:Q8_0

Install from WinGet (Windows)

winget install llama.cpp
# Start a local OpenAI-compatible server with a web UI:
llama-server -hf ShakhawatShanin/Bangla-Text-Summarization:Q8_0
# Run inference directly in the terminal:
llama-cli -hf ShakhawatShanin/Bangla-Text-Summarization:Q8_0

Use pre-built binary

# Download pre-built binary from:
# https://github.com/ggerganov/llama.cpp/releases
# Start a local OpenAI-compatible server with a web UI:
./llama-server -hf ShakhawatShanin/Bangla-Text-Summarization:Q8_0
# Run inference directly in the terminal:
./llama-cli -hf ShakhawatShanin/Bangla-Text-Summarization:Q8_0

Build from source code

git clone https://github.com/ggerganov/llama.cpp.git
cd llama.cpp
cmake -B build
cmake --build build -j --target llama-server llama-cli
# Start a local OpenAI-compatible server with a web UI:
./build/bin/llama-server -hf ShakhawatShanin/Bangla-Text-Summarization:Q8_0
# Run inference directly in the terminal:
./build/bin/llama-cli -hf ShakhawatShanin/Bangla-Text-Summarization:Q8_0

Use Docker

docker model run hf.co/ShakhawatShanin/Bangla-Text-Summarization:Q8_0

LM Studio
Jan
Ollama
How to use ShakhawatShanin/Bangla-Text-Summarization with Ollama:
```
ollama run hf.co/ShakhawatShanin/Bangla-Text-Summarization:Q8_0
```

Unsloth Studio new

How to use ShakhawatShanin/Bangla-Text-Summarization with Unsloth Studio:

Install Unsloth Studio (macOS, Linux, WSL)

curl -fsSL https://unsloth.ai/install.sh | sh
# Run unsloth studio
unsloth studio -H 0.0.0.0 -p 8888
# Then open http://localhost:8888 in your browser
# Search for ShakhawatShanin/Bangla-Text-Summarization to start chatting

Install Unsloth Studio (Windows)

irm https://unsloth.ai/install.ps1 | iex
# Run unsloth studio
unsloth studio -H 0.0.0.0 -p 8888
# Then open http://localhost:8888 in your browser
# Search for ShakhawatShanin/Bangla-Text-Summarization to start chatting

Using HuggingFace Spaces for Unsloth

# No setup required
# Open https://huggingface.co/spaces/unsloth/studio in your browser
# Search for ShakhawatShanin/Bangla-Text-Summarization to start chatting

Docker Model Runner
How to use ShakhawatShanin/Bangla-Text-Summarization with Docker Model Runner:
```
docker model run hf.co/ShakhawatShanin/Bangla-Text-Summarization:Q8_0
```

Lemonade

How to use ShakhawatShanin/Bangla-Text-Summarization with Lemonade:

Pull the model

# Download Lemonade from https://lemonade-server.ai/
lemonade pull ShakhawatShanin/Bangla-Text-Summarization:Q8_0

Run and chat with the model

lemonade run user.Bangla-Text-Summarization-Q8_0

List all available models

lemonade list

ShakhawatShanin commited on Nov 2, 2025

Commit

71db1c1

verified ·

1 Parent(s): 6fd4701

Upload 19 files

Browse files

Files changed (20) hide show

.gitattributes +2 -0
Bangla_Text_Summarization/Modelfile.txt +27 -0
Bangla_Text_Summarization/bangla_summarization.Q8_0.gguf +3 -0
Bangla_Text_Summarization/bangla_summarization_adapter/README.md +63 -0
Bangla_Text_Summarization/bangla_summarization_adapter/adapter_config.json +46 -0
Bangla_Text_Summarization/bangla_summarization_adapter/adapter_model.safetensors +3 -0
Bangla_Text_Summarization/bangla_summarization_adapter/added_tokens.json +3 -0
Bangla_Text_Summarization/bangla_summarization_adapter/chat_template.jinja +47 -0
Bangla_Text_Summarization/bangla_summarization_adapter/preprocessor_config.json +29 -0
Bangla_Text_Summarization/bangla_summarization_adapter/processor_config.json +4 -0
Bangla_Text_Summarization/bangla_summarization_adapter/special_tokens_map.json +33 -0
Bangla_Text_Summarization/bangla_summarization_adapter/tokenizer.json +3 -0
Bangla_Text_Summarization/bangla_summarization_adapter/tokenizer.model +3 -0
Bangla_Text_Summarization/bangla_summarization_adapter/tokenizer_config.json +0 -0
Bangla_Text_Summarization/bts_finetune.ipynb +0 -0
Bangla_Text_Summarization/data.csv +0 -0
Modelfile +26 -0
main.py +28 -0
static/icon.png +0 -0
templates/index.html +99 -0

.gitattributes CHANGED Viewed

@@ -33,3 +33,5 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
 *.zip filter=lfs diff=lfs merge=lfs -text
 *.zst filter=lfs diff=lfs merge=lfs -text
 *tfevents* filter=lfs diff=lfs merge=lfs -text

 *.zip filter=lfs diff=lfs merge=lfs -text
 *.zst filter=lfs diff=lfs merge=lfs -text
 *tfevents* filter=lfs diff=lfs merge=lfs -text
+Bangla_Text_Summarization/bangla_summarization_adapter/tokenizer.json filter=lfs diff=lfs merge=lfs -text
+Bangla_Text_Summarization/bangla_summarization.Q8_0.gguf filter=lfs diff=lfs merge=lfs -text

Bangla_Text_Summarization/Modelfile.txt ADDED Viewed

	@@ -0,0 +1,27 @@

+FROM /content/bangla_summarization_model_merged.Q8_0.gguf
+# Set the model parameters
+PARAMETER num_ctx 2048
+PARAMETER num_batch 512
+PARAMETER num_gpu 1
+# Template for chat format
+TEMPLATE """<start_of_turn>user
+সংক্ষেপ করুন: {{ .Prompt }}<end_of_turn>
+<start_of_turn>model
+"""
+# System prompt
+SYSTEM """You are a helpful AI assistant specialized in summarizing Bengali text.
+Your task is to create concise, accurate summaries of Bengali articles while preserving the key information and meaning.
+Always respond in Bengali.
+Keep summaries clear and to the point.
+Focus on the main ideas and important details."""
+# Model configuration
+PARAMETER temperature 0.1
+PARAMETER top_k 40
+PARAMETER top_p 0.9
+PARAMETER stop "<end_of_turn>"
+PARAMETER repeat_penalty 1.1

Bangla_Text_Summarization/bangla_summarization.Q8_0.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:2e6cb247908e9fc841d4698b3b677a9ff86eaf92c755c205235ce4e124b8105a
+size 4130401952

Bangla_Text_Summarization/bangla_summarization_adapter/README.md ADDED Viewed

	@@ -0,0 +1,63 @@

+---
+base_model: unsloth/gemma-3-4b-it-unsloth-bnb-4bit
+library_name: peft
+model_name: bangla_summarization_model
+tags:
+- base_model:adapter:unsloth/gemma-3-4b-it-unsloth-bnb-4bit
+- lora
+- sft
+- transformers
+- trl
+- unsloth
+licence: license
+pipeline_tag: text-generation
+---
+# Model Card for bangla_summarization_model
+This model is a fine-tuned version of [unsloth/gemma-3-4b-it-unsloth-bnb-4bit](https://huggingface.co/unsloth/gemma-3-4b-it-unsloth-bnb-4bit).
+It has been trained using [TRL](https://github.com/huggingface/trl).
+## Quick start
+```python
+from transformers import pipeline
+question = "If you had a time machine, but could only go to the past or the future once and never return, which would you choose and why?"
+generator = pipeline("text-generation", model="None", device="cuda")
+output = generator([{"role": "user", "content": question}], max_new_tokens=128, return_full_text=False)[0]
+print(output["generated_text"])
+```
+## Training procedure
+This model was trained with SFT.
+### Framework versions
+- PEFT 0.17.1
+- TRL: 0.22.2
+- Transformers: 4.55.4
+- Pytorch: 2.8.0+cu126
+- Datasets: 3.6.0
+- Tokenizers: 0.21.4
+## Citations
+Cite TRL as:
+```bibtex
+@misc{vonwerra2022trl,
+	title        = {{TRL: Transformer Reinforcement Learning}},
+	author       = {Leandro von Werra and Younes Belkada and Lewis Tunstall and Edward Beeching and Tristan Thrush and Nathan Lambert and Shengyi Huang and Kashif Rasul and Quentin Gallou{\'e}dec},
+	year         = 2020,
+	journal      = {GitHub repository},
+	publisher    = {GitHub},
+	howpublished = {\url{https://github.com/huggingface/trl}}
+}
+```

Bangla_Text_Summarization/bangla_summarization_adapter/adapter_config.json ADDED Viewed

	@@ -0,0 +1,46 @@

+{
+  "alpha_pattern": {},
+  "auto_mapping": {
+    "base_model_class": "Gemma3ForConditionalGeneration",
+    "parent_library": "transformers.models.gemma3.modeling_gemma3",
+    "unsloth_fixed": true
+  },
+  "base_model_name_or_path": "unsloth/gemma-3-4b-it-unsloth-bnb-4bit",
+  "bias": "none",
+  "corda_config": null,
+  "eva_config": null,
+  "exclude_modules": null,
+  "fan_in_fan_out": false,
+  "inference_mode": true,
+  "init_lora_weights": true,
+  "layer_replication": null,
+  "layers_pattern": null,
+  "layers_to_transform": null,
+  "loftq_config": {},
+  "lora_alpha": 16,
+  "lora_bias": false,
+  "lora_dropout": 0,
+  "megatron_config": null,
+  "megatron_core": "megatron.core",
+  "modules_to_save": null,
+  "peft_type": "LORA",
+  "qalora_group_size": 16,
+  "r": 16,
+  "rank_pattern": {},
+  "revision": null,
+  "target_modules": [
+    "up_proj",
+    "o_proj",
+    "v_proj",
+    "k_proj",
+    "q_proj",
+    "down_proj",
+    "gate_proj"
+  ],
+  "target_parameters": null,
+  "task_type": "CAUSAL_LM",
+  "trainable_token_indices": null,
+  "use_dora": false,
+  "use_qalora": false,
+  "use_rslora": false
+}

Bangla_Text_Summarization/bangla_summarization_adapter/adapter_model.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:537d5c2a66653bc29d9ba05b9c4b2e31f3c037f409830052a3582e3a66097a05
+size 131252288

Bangla_Text_Summarization/bangla_summarization_adapter/added_tokens.json ADDED Viewed

	@@ -0,0 +1,3 @@

+{
+  "<image_soft_token>": 262144
+}

Bangla_Text_Summarization/bangla_summarization_adapter/chat_template.jinja ADDED Viewed

	@@ -0,0 +1,47 @@

+{{ bos_token }}
+{%- if messages[0]['role'] == 'system' -%}
+    {%- if messages[0]['content'] is string -%}
+        {%- set first_user_prefix = messages[0]['content'] + '
+' -%}
+    {%- else -%}
+        {%- set first_user_prefix = messages[0]['content'][0]['text'] + '
+' -%}
+    {%- endif -%}
+    {%- set loop_messages = messages[1:] -%}
+{%- else -%}
+    {%- set first_user_prefix = "" -%}
+    {%- set loop_messages = messages -%}
+{%- endif -%}
+{%- for message in loop_messages -%}
+    {%- if (message['role'] == 'user') != (loop.index0 % 2 == 0) -%}
+        {{ raise_exception("Conversation roles must alternate user/assistant/user/assistant/...") }}
+    {%- endif -%}
+    {%- if (message['role'] == 'assistant') -%}
+        {%- set role = "model" -%}
+    {%- else -%}
+        {%- set role = message['role'] -%}
+    {%- endif -%}
+    {{ '<start_of_turn>' + role + '
+' + (first_user_prefix if loop.first else "") }}
+    {%- if message['content'] is string -%}
+        {{ message['content'] | trim }}
+    {%- elif message['content'] is iterable -%}
+        {%- for item in message['content'] -%}
+            {%- if item['type'] == 'image' -%}
+                {{ '<start_of_image>' }}
+            {%- elif item['type'] == 'text' -%}
+                {{ item['text'] | trim }}
+            {%- endif -%}
+        {%- endfor -%}
+    {%- else -%}
+        {{ raise_exception("Invalid content type") }}
+    {%- endif -%}
+    {{ '<end_of_turn>
+' }}
+{%- endfor -%}
+{%- if add_generation_prompt -%}
+    {{ '<start_of_turn>model
+' }}
+{%- endif -%}

Bangla_Text_Summarization/bangla_summarization_adapter/preprocessor_config.json ADDED Viewed

	@@ -0,0 +1,29 @@

+{
+  "do_convert_rgb": null,
+  "do_normalize": true,
+  "do_pan_and_scan": null,
+  "do_rescale": true,
+  "do_resize": true,
+  "image_mean": [
+    0.5,
+    0.5,
+    0.5
+  ],
+  "image_processor_type": "Gemma3ImageProcessor",
+  "image_seq_length": 256,
+  "image_std": [
+    0.5,
+    0.5,
+    0.5
+  ],
+  "pan_and_scan_max_num_crops": null,
+  "pan_and_scan_min_crop_size": null,
+  "pan_and_scan_min_ratio_to_activate": null,
+  "processor_class": "Gemma3Processor",
+  "resample": 2,
+  "rescale_factor": 0.00392156862745098,
+  "size": {
+    "height": 896,
+    "width": 896
+  }
+}

Bangla_Text_Summarization/bangla_summarization_adapter/processor_config.json ADDED Viewed

	@@ -0,0 +1,4 @@

+{
+  "image_seq_length": 256,
+  "processor_class": "Gemma3Processor"
+}

Bangla_Text_Summarization/bangla_summarization_adapter/special_tokens_map.json ADDED Viewed

	@@ -0,0 +1,33 @@

+{
+  "boi_token": "<start_of_image>",
+  "bos_token": {
+    "content": "<bos>",
+    "lstrip": false,
+    "normalized": false,
+    "rstrip": false,
+    "single_word": false
+  },
+  "eoi_token": "<end_of_image>",
+  "eos_token": {
+    "content": "<end_of_turn>",
+    "lstrip": false,
+    "normalized": false,
+    "rstrip": false,
+    "single_word": false
+  },
+  "image_token": "<image_soft_token>",
+  "pad_token": {
+    "content": "<pad>",
+    "lstrip": false,
+    "normalized": false,
+    "rstrip": false,
+    "single_word": false
+  },
+  "unk_token": {
+    "content": "<unk>",
+    "lstrip": false,
+    "normalized": false,
+    "rstrip": false,
+    "single_word": false
+  }
+}

Bangla_Text_Summarization/bangla_summarization_adapter/tokenizer.json ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:4667f2089529e8e7657cfb6d1c19910ae71ff5f28aa7ab2ff2763330affad795
+size 33384568

Bangla_Text_Summarization/bangla_summarization_adapter/tokenizer.model ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:1299c11d7cf632ef3b4e11937501358ada021bbdf7c47638d13c0ee982f2e79c
+size 4689074

Bangla_Text_Summarization/bangla_summarization_adapter/tokenizer_config.json ADDED Viewed

The diff for this file is too large to render. See raw diff

Bangla_Text_Summarization/bts_finetune.ipynb ADDED Viewed

The diff for this file is too large to render. See raw diff

Bangla_Text_Summarization/data.csv ADDED Viewed

The diff for this file is too large to render. See raw diff

Modelfile ADDED Viewed

	@@ -0,0 +1,26 @@

+FROM /home/shanin/Desktop/SHANIN/MAIN/ALL_CODE/BTS/Bangla_Text_Summarization/bangla_summarization.Q8_0.gguf
+PARAMETER num_ctx 2048
+PARAMETER num_batch 512
+PARAMETER num_gpu 1
+TEMPLATE """<start_of_turn>user
+{{ .Prompt }}<end_of_turn>
+<start_of_turn>model
+"""
+SYSTEM """You are a helpful AI assistant.
+- If the user asks for a summary (e.g., starts with 'সংক্ষেপ করুন'), respond with a concise Bengali summary.
+- Otherwise, answer general questions naturally in Bengali or English.
+- Keep responses clear, concise, and relevant."""
+PARAMETER temperature 0.1
+PARAMETER top_k 40
+PARAMETER top_p 0.9
+PARAMETER stop "<end_of_turn>"
+PARAMETER repeat_penalty 1.1
+# ollama create bangla-summarization -f Modelfile
+# ollama rm bangla-summarization

main.py ADDED Viewed

	@@ -0,0 +1,28 @@

+from flask import Flask, render_template, request
+import ollama
+app = Flask(__name__)
+MODEL_NAME = "bts"
+@app.route("/")
+def index():
+    return render_template("index.html")
+@app.route("/get", methods=["POST"])
+def chat():
+    user_message = request.form["msg"]
+    try:
+        response = ollama.chat(
+            model=MODEL_NAME,
+            messages=[{"role": "user", "content": user_message}]
+        )
+        reply = response['message']['content']
+    except Exception as e:
+        reply = f"Error: {str(e)}"
+    return reply
+if __name__ == "__main__":
+    app.run(host="0.0.0.0", port=5000)

static/icon.png ADDED Viewed

templates/index.html ADDED Viewed

	@@ -0,0 +1,99 @@

+<!DOCTYPE html>
+<html lang="en">
+<head>
+    <meta charset="UTF-8">
+    <meta name="viewport" content="width=device-width, initial-scale=1.0">
+    <title>Bangla AI Text Summarizer</title>
+    <!-- Bootstrap CSS -->
+    <link href="https://cdnjs.cloudflare.com/ajax/libs/bootstrap/5.3.2/css/bootstrap.min.css" rel="stylesheet">
+    <!-- Font Awesome -->
+    <link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.0/css/all.min.css">
+    <!-- Google Fonts: Poppins -->
+    <link href="https://fonts.googleapis.com/css2?family=Poppins:wght@400;500;600&display=swap" rel="stylesheet">
+    <!-- Tailwind CSS -->
+    <script src="https://cdn.tailwindcss.com"></script>
+    <script>
+        tailwind.config = {
+            theme: {
+                extend: {
+                    colors: {
+                        'dark-bg': '#1f2c34',
+                        'dark-bg-alt': '#1b1b2f',
+                        'chat-bg': '#222831',
+                        'header-bg': '#30475e',
+                        'footer-bg': '#393e46',
+                        'accent': '#00adb5',
+                        'accent-hover': '#00b8c4',
+                        'dark-light': '#393e46',
+                        'text-light': '#eeeeee',
+                    },
+                },
+            },
+        };
+    </script>
+</head>
+<body class="font-poppins bg-gradient-to-br from-dark-bg to-dark-bg-alt min-h-screen">
+<div class="chat-container flex justify-center items-center min-h-screen p-5">
+    <div class="chat-box w-[550px] max-w-[95%] h-[750px] bg-chat-bg rounded-3xl flex flex-col overflow-hidden shadow-2xl">
+        <div class="chat-header flex items-center p-4 bg-header-bg text-white rounded-t-3xl">
+            <img src="{{ url_for('static', filename='icon.png') }}" class="chat-avatar w-[55px] h-[55px] mr-3">
+            <div>
+                <h5 class="mb-0 text-lg font-semibold">Bangla LLM Text Summarizer</h5>
+                <small class="text-white/80 text-sm">Powered by Shanin</small>
+            </div>
+        </div>
+        <div id="chatBody" class="chat-body flex-1 overflow-y-auto p-4">
+            <!-- Messages will appear here -->
+        </div>
+        <div class="chat-footer p-4 bg-footer-bg rounded-b-3xl">
+            <form id="messageForm" class="flex">
+                <input type="text" id="text" name="msg" class="form-control flex-1 rounded-full px-4 py-3 bg-chat-bg border border-accent text-text-light placeholder:text-text-light/50 focus:outline-none focus:ring-0 me-2" placeholder="Type a message..." required>
+                <button type="submit" class="btn btn-accent rounded-full bg-accent text-white px-5 py-3 hover:bg-accent-hover"><i class="fas fa-paper-plane"></i></button>
+            </form>
+        </div>
+    </div>
+</div>
+<!-- jQuery -->
+<script src="https://cdnjs.cloudflare.com/ajax/libs/jquery/3.7.1/jquery.min.js"></script>
+<!-- Bootstrap JS -->
+<script src="https://cdnjs.cloudflare.com/ajax/libs/bootstrap/5.3.2/js/bootstrap.bundle.min.js"></script>
+<script>
+$(document).ready(function() {
+    $("#messageForm").on("submit", function(event) {
+        event.preventDefault();
+        const date = new Date();
+        const str_time = date.getHours().toString().padStart(2,'0') + ":" + date.getMinutes().toString().padStart(2,'0');
+        const rawText = $("#text").val();
+        $("#text").val("");
+        // User message
+        const userHtml = `
+            <div class="message user-message mb-3 flex justify-end">
+                <div class="msg-text bg-accent text-white p-3 rounded-3xl max-w-[80%] break-words shadow-md relative">
+                    ${rawText} <span class="time text-[10px] absolute -bottom-4 right-2 text-white/50">${str_time}</span>
+                </div>
+            </div>`;
+        $("#chatBody").append(userHtml).scrollTop($("#chatBody")[0].scrollHeight);
+        // Call Flask API
+        $.post("/get", { msg: rawText }, function(data) {
+            const botHtml = `
+                <div class="message bot-message mb-3 flex justify-start">
+                    <div class="msg-text bg-dark-light text-text-light p-3 rounded-3xl max-w-[80%] break-words shadow-md relative">
+                        ${data} <span class="time text-[10px] absolute -bottom-4 right-2 text-white/50">${str_time}</span>
+                    </div>
+                </div>`;
+            $("#chatBody").append(botHtml).scrollTop($("#chatBody")[0].scrollHeight);
+        });
+    });
+});
+</script>
+</body>
+</html>