bear7011 commited on
Commit
7c9db17
·
verified ·
1 Parent(s): 15c57e9

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +29 -0
README.md ADDED
@@ -0,0 +1,29 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ base_model: google/gemma-4-e2b-it
3
+ library_name: transformers
4
+ pipeline_tag: image-text-to-text
5
+ ---
6
+ # gemma4-e4b-webvid4K_FT
7
+ Full fine-tune of `google/gemma-4-e2b-it` on AI-generated video data derived from WebVid.
8
+ ## Training
9
+ - Dataset: `bear7011/gemma-4-e4b-webvid-4K`
10
+ - Samples: 3,941 video instruction examples
11
+ - Method: full fine-tuning, no LoRA
12
+ - Precision: bfloat16
13
+ - GPUs: 4
14
+ - DeepSpeed: ZeRO-3 with CPU optimizer and parameter offload
15
+ - Epochs: 1
16
+ - Global steps: 124
17
+ - Per-device batch size: 1
18
+ - Gradient accumulation steps: 8
19
+ - Optimizer: AdamW
20
+ - Learning rate: 5e-6
21
+ - Projector learning rate: 5e-6
22
+ - Image encoder learning rate: 0.0
23
+ - Weight decay: 0.01
24
+ - Warmup ratio: 0.03
25
+ - LR scheduler: cosine
26
+ - Gradient checkpointing: enabled
27
+ - Max sequence length: 2304
28
+ - Final training loss: 1.9510
29
+ Checkpoints and training logs are not included in this repository.