TheoMoins commited on
Commit
f762f49
·
verified ·
1 Parent(s): 5fad348

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -20
README.md CHANGED
@@ -147,7 +147,7 @@ The models can also be used directly outside of DocWorkflow, though the CATMuS p
147
  from transformers import AutoProcessor, AutoModelForImageTextToText
148
  from PIL import Image
149
 
150
- model_id = "ENC-PSL/MEDUSA-9B-0.1"
151
 
152
  processor = AutoProcessor.from_pretrained(model_id)
153
  model = AutoModelForImageTextToText.from_pretrained(model_id, device_map="auto")
@@ -195,25 +195,6 @@ Transcriptions follow the [CATMuS guidelines](https://catmus-guidelines.github.i
195
 
196
  ---
197
 
198
- ## Training details
199
-
200
- | Parameter | Value |
201
- |---|---|
202
- | Base models | Qwen3.5-4B and Qwen3.5-9B |
203
- | Fine-tuning method | LoRA (via Unsloth) |
204
- | LoRA rank | 64 |
205
- | Training data levels | Gold + Platinum (mixed), then Platinum only |
206
- | Training epochs | 3 (mixed) + 1–3 (Platinum only) |
207
- | Max sequence length | 512 |
208
- | Max pixels per image | 401,408 |
209
- | Batch size | 32 (effective) |
210
- | Learning rate | 5 × 10⁻⁵ |
211
- | Framework | DocWorkflow + Unsloth |
212
-
213
- Total training data: ~643,000 lines across Gold, Platinum, and original data (see system report for full dataset list).
214
-
215
- ---
216
-
217
  ## Citation
218
 
219
  If you use MEDUSA in your research, please cite:
 
147
  from transformers import AutoProcessor, AutoModelForImageTextToText
148
  from PIL import Image
149
 
150
+ model_id = "ENC-PSL/Medusa0.1Line-9B"
151
 
152
  processor = AutoProcessor.from_pretrained(model_id)
153
  model = AutoModelForImageTextToText.from_pretrained(model_id, device_map="auto")
 
195
 
196
  ---
197
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
198
  ## Citation
199
 
200
  If you use MEDUSA in your research, please cite: