File size: 875 Bytes
2da9347
b4abbf6
 
 
2da9347
b4abbf6
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
---
base_model: google/gemma-4-e4b-it
library_name: transformers
pipeline_tag: image-text-to-text
---

# gemma4-e4b-webvid4K_FT

Full fine-tune of `google/gemma-4-e4b-it` on AI-generated video data derived from WebVid.

## Training

- Dataset: `bear7011/gemma-4-e4b-webvid-4K`
- Samples: 3,941 video instruction examples
- Method: full fine-tuning, no LoRA
- Precision: bfloat16
- GPUs: 4
- DeepSpeed: ZeRO-3 with CPU optimizer and parameter offload
- Epochs: 1
- Global steps: 124
- Per-device batch size: 1
- Gradient accumulation steps: 8
- Optimizer: AdamW
- Learning rate: 5e-6
- Projector learning rate: 5e-6
- Image encoder learning rate: 0.0
- Weight decay: 0.01
- Warmup ratio: 0.03
- LR scheduler: cosine
- Gradient checkpointing: enabled
- Max sequence length: 2304
- Final training loss: 1.9510

Checkpoints and training logs are not included in this repository.