Sarjinkhan2003 commited on
Commit
f620b5f
·
verified ·
1 Parent(s): fc69be4

Phase 1 checkpoint — CER=0.0348

Browse files
Files changed (6) hide show
  1. .gitattributes +1 -0
  2. README.md +40 -0
  3. phase1_best.pth +3 -0
  4. phase1_curves.png +0 -0
  5. phase2_files.csv +3 -0
  6. vocab.json +1 -0
.gitattributes CHANGED
@@ -33,3 +33,4 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
 
 
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
36
+ phase2_files.csv filter=lfs diff=lfs merge=lfs -text
README.md ADDED
@@ -0,0 +1,40 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language: bn
3
+ license: mit
4
+ tags: [ocr, bengali, crnn, easyocr, ctc]
5
+ datasets: [mnsm92/bengali-ocr-dataset-1m]
6
+ metrics: [cer, wer]
7
+ ---
8
+
9
+ # Bengali CRNN OCR — Custom EasyOCR Recognition Model
10
+
11
+ **DocReader BD — CSC4233 NLP Final Project, AIUB**
12
+
13
+ ## Phase 1 Results (500K samples, 10 epochs)
14
+
15
+ | Model | CER ↓ | WER ↓ |
16
+ |---|---|---|
17
+ | Tesseract baseline | ~0.45 | ~0.60 |
18
+ | EasyOCR default | ~0.25 | ~0.40 |
19
+ | **BengaliCRNN Phase 1** | **0.0348** | **0.1020** |
20
+
21
+ Phase 2 checkpoint (full 1M) will update this card.
22
+
23
+ ## Files
24
+ - `phase1_best.pth` — Phase 1 weights (load for Phase 2 training)
25
+ - `phase2_best.pth` — Phase 2 weights (final model for EasyOCR)
26
+ - `vocab.json` — Bengali character vocabulary
27
+ - `bengali_crnn.py` — EasyOCR network definition (added in Phase 2)
28
+
29
+ ## Usage (after Phase 2)
30
+ ```python
31
+ import easyocr
32
+ reader = easyocr.Reader(
33
+ lang_list=["bn"],
34
+ recog_network="bengali_crnn",
35
+ model_storage_directory="./model",
36
+ user_network_directory="./model",
37
+ gpu=True
38
+ )
39
+ results = reader.readtext("bengali_doc.jpg")
40
+ ```
phase1_best.pth ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:fc6ec57fe4615f6c273d39f3efe62e85d5cd2122f36148f61a1b3f4421911cf3
3
+ size 125944385
phase1_curves.png ADDED
phase2_files.csv ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:defc227d746be6262504bbd04b0b2068d061a1cf3e1b2b7f7ef354031f70ce4d
3
+ size 53046834
vocab.json ADDED
@@ -0,0 +1 @@
 
 
1
+ {"charset": "অআইঈউঊঋঌএঐওঔকখগঘঙচছজঝঞটঠডঢণতথদধনপফবভমযরলশষসহড়ঢ়য়ৎািীুূৃেৈোৌ্ংঃঁ০১২৩৪৫৬৭৮৯ ।,.?!()-–:;%/\\ABCDEFGHIJKLMNOPQRSTUVWXYZabcdefghijklmnopqrstuvwxyz0123456789", "num_classes": 153}