Harley-ml
/

Tenete-8M

Text Generation

Eval Results (legacy)

Model card Files Files and versions

Harley-ml commited on 21 days ago

Commit

d5f6b36

·

verified ·

1 Parent(s): ee8e0fa

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -493,7 +493,7 @@ Tenete-8M uses the **Qwen3 architecture**.
 ## Training
-Tenete-8M was trained on an **RTX 2060 6GB** for one epoch with a batch size of 4 and a gradient accumulation of 18, resulting in an **effective batch size of 72**.
 ### Dataset

 ## Training
+Tenete-8M was trained on an **RTX 2060 6GB** for one epoch with a batch size of 4 and a gradient accumulation of 18 (**effective batch size=72**) for two hours and twenty minutes.
 ### Dataset