ShobdoOCR
/

shobdo-ocr

Object Detection

text-recognition

Model card Files Files and versions

Sarjinkhan2003 commited on 3 days ago

Commit

c30a8b8

·

verified ·

1 Parent(s): 0d316fd

Update README.md

Files changed (1) hide show

README.md +30 -15

README.md CHANGED Viewed

@@ -1,21 +1,36 @@
 ---
-language: bn
-license: mit
-tags:
-  - object-detection
-  - ocr
-  - bengali
-  - yolov8
 ---
-# ShobdoOCR — Word-Level Detection
-**Load:**
 ```python
-from ultralytics import YOLO
-from huggingface_hub import hf_hub_download
-model = YOLO(hf_hub_download("Sarjinkhan2003/shobdo-ocr-detection", "shobdo_det.pt"))
-results = model.predict("doc.jpg", conf=0.25)
-```
-**mAP@0.5:** 0.9840

+# ShobdoOCR — Bangla-English OCR for Bangladeshi Documents
+ShobdoOCR is a word-level OCR system designed for Bangladeshi government documents including NID cards, birth certificates, land deeds, and invoices. It handles mixed Bengali and English text using a classifier-first dual-recognizer architecture — a lightweight 23K-parameter script classifier (99.82% accuracy) routes each detected word to either a Bengali CRNN or English CRNN recognizer, returning per-word bounding boxes, recognized text, and script labels.
+Part of the **DocReader BD** intelligent document understanding system.
 ---
+## Install
+```bash
+pip install --index-url https://test.pypi.org/simple/ \
+            --extra-index-url https://pypi.org/simple/ \
+            shobdoocr==0.1.1
+```
+> Note: shobdoocr is currently hosted on TestPyPI (test registry).
+> Dependencies are fetched from the official PyPI automatically.
 ---
+## Usage
 ```python
+from shobdoocr import OCR
+ocr = OCR()  # models download automatically (~80MB)
+# Plain text
+text = ocr.read_text("nid_card.jpg")
+print(text)
+# Word-level output with bounding boxes and script labels
+results = ocr.read("nid_card.jpg")
+for word in results:
+    print(word['text'], word['script'], word['box'])
+```