Update README.md
Browse files
README.md
CHANGED
|
@@ -147,7 +147,7 @@ The models can also be used directly outside of DocWorkflow, though the CATMuS p
|
|
| 147 |
from transformers import AutoProcessor, AutoModelForImageTextToText
|
| 148 |
from PIL import Image
|
| 149 |
|
| 150 |
-
model_id = "ENC-PSL/
|
| 151 |
|
| 152 |
processor = AutoProcessor.from_pretrained(model_id)
|
| 153 |
model = AutoModelForImageTextToText.from_pretrained(model_id, device_map="auto")
|
|
@@ -195,25 +195,6 @@ Transcriptions follow the [CATMuS guidelines](https://catmus-guidelines.github.i
|
|
| 195 |
|
| 196 |
---
|
| 197 |
|
| 198 |
-
## Training details
|
| 199 |
-
|
| 200 |
-
| Parameter | Value |
|
| 201 |
-
|---|---|
|
| 202 |
-
| Base models | Qwen3.5-4B and Qwen3.5-9B |
|
| 203 |
-
| Fine-tuning method | LoRA (via Unsloth) |
|
| 204 |
-
| LoRA rank | 64 |
|
| 205 |
-
| Training data levels | Gold + Platinum (mixed), then Platinum only |
|
| 206 |
-
| Training epochs | 3 (mixed) + 1–3 (Platinum only) |
|
| 207 |
-
| Max sequence length | 512 |
|
| 208 |
-
| Max pixels per image | 401,408 |
|
| 209 |
-
| Batch size | 32 (effective) |
|
| 210 |
-
| Learning rate | 5 × 10⁻⁵ |
|
| 211 |
-
| Framework | DocWorkflow + Unsloth |
|
| 212 |
-
|
| 213 |
-
Total training data: ~643,000 lines across Gold, Platinum, and original data (see system report for full dataset list).
|
| 214 |
-
|
| 215 |
-
---
|
| 216 |
-
|
| 217 |
## Citation
|
| 218 |
|
| 219 |
If you use MEDUSA in your research, please cite:
|
|
|
|
| 147 |
from transformers import AutoProcessor, AutoModelForImageTextToText
|
| 148 |
from PIL import Image
|
| 149 |
|
| 150 |
+
model_id = "ENC-PSL/Medusa0.1Line-9B"
|
| 151 |
|
| 152 |
processor = AutoProcessor.from_pretrained(model_id)
|
| 153 |
model = AutoModelForImageTextToText.from_pretrained(model_id, device_map="auto")
|
|
|
|
| 195 |
|
| 196 |
---
|
| 197 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 198 |
## Citation
|
| 199 |
|
| 200 |
If you use MEDUSA in your research, please cite:
|