dchen0 commited on
Commit
1d408c8
Β·
verified Β·
1 Parent(s): bec7b17

Upload logs/resnet_20260331_102144.log with huggingface_hub

Browse files
Files changed (1) hide show
  1. logs/resnet_20260331_102144.log +654 -0
logs/resnet_20260331_102144.log ADDED
@@ -0,0 +1,654 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
0
  0%| | 0.00/97.8M [00:00<?, ?B/s]
1
  9%|β–‰ | 8.75M/97.8M [00:00<00:01, 91.5MB/s]
2
  20%|β–ˆβ–ˆ | 19.9M/97.8M [00:00<00:00, 106MB/s]
3
  31%|β–ˆβ–ˆβ–ˆ | 30.0M/97.8M [00:00<00:00, 100MB/s]
4
  41%|β–ˆβ–ˆβ–ˆβ–ˆ | 39.6M/97.8M [00:00<00:00, 88.7MB/s]
5
  50%|β–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 48.8M/97.8M [00:00<00:00, 90.7MB/s]
6
  61%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 60.0M/97.8M [00:00<00:00, 99.1MB/s]
7
  71%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 69.6M/97.8M [00:00<00:00, 97.2MB/s]
8
  81%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 79.0M/97.8M [00:00<00:00, 89.5MB/s]
9
  90%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 87.8M/97.8M [00:01<00:00, 86.6MB/s]
10
  98%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š| 96.1M/97.8M [00:01<00:00, 84.2MB/s]
 
 
 
 
 
 
11
  0%| | 0/1 [00:00<?, ?it/s]
 
12
  0%| | 0/1 [00:00<?, ?it/s]
13
 
 
14
 
 
 
15
  
16
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
17
  ...heckpoint-1/rng_state.pth: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 14.2kB / 14.2kB 
 
 
 
18
  ...50/checkpoint-1/scaler.pt: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 988B / 988B 
 
 
19
  ...heckpoint-1/rng_state.pth: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 14.2kB / 14.2kB 
 
 
 
20
  ...50/checkpoint-1/scaler.pt: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 988B / 988B 
 
 
21
  ...heckpoint-1/rng_state.pth: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 14.2kB / 14.2kB 
 
 
 
22
  ...50/checkpoint-1/scaler.pt: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 988B / 988B 
 
 
 
 
23
  ...point-1/model.safetensors: 93%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž| 87.9MB / 94.3MB 
 
 
 
 
 
24
  ...checkpoint-1/optimizer.pt: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 1.58kB / 1.58kB 
 
 
 
 
 
 
25
  ...t_model/model.safetensors: 93%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž| 87.9MB / 94.3MB 
 
 
 
 
 
 
 
26
  ...point-1/training_args.bin: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 4.86kB / 4.86kB 
 
 
 
 
 
 
 
 
27
  ...t_model/training_args.bin: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 4.86kB / 4.86kB 
 
 
28
  ...heckpoint-1/rng_state.pth: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 14.2kB / 14.2kB 
 
 
 
29
  ...50/checkpoint-1/scaler.pt: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 988B / 988B 
 
 
 
 
30
  ...point-1/model.safetensors: 93%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž| 87.9MB / 94.3MB 
 
 
 
 
 
31
  ...checkpoint-1/optimizer.pt: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 1.58kB / 1.58kB 
 
 
 
 
 
 
32
  ...t_model/model.safetensors: 93%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž| 87.9MB / 94.3MB 
 
 
 
 
 
 
 
33
  ...point-1/training_args.bin: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 4.86kB / 4.86kB 
 
 
 
 
 
 
 
 
34
  ...t_model/training_args.bin: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 4.86kB / 4.86kB 
 
 
35
  ...heckpoint-1/rng_state.pth: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 14.2kB / 14.2kB 
 
 
 
36
  ...50/checkpoint-1/scaler.pt: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 988B / 988B 
 
 
 
 
37
  ...point-1/model.safetensors: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰| 94.2MB / 94.3MB 
 
 
 
 
 
38
  ...checkpoint-1/optimizer.pt: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 1.58kB / 1.58kB 
 
 
 
 
 
 
39
  ...t_model/model.safetensors: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰| 94.2MB / 94.3MB 
 
 
 
 
 
 
 
40
  ...point-1/training_args.bin: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 4.86kB / 4.86kB 
 
 
 
 
 
 
 
 
41
  ...t_model/training_args.bin: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 4.86kB / 4.86kB 
 
 
42
  ...heckpoint-1/rng_state.pth: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 14.2kB / 14.2kB 
 
 
 
43
  ...50/checkpoint-1/scaler.pt: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 988B / 988B 
 
 
 
 
44
  ...point-1/model.safetensors: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰| 94.2MB / 94.3MB 
 
 
 
 
 
45
  ...checkpoint-1/optimizer.pt: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 1.58kB / 1.58kB 
 
 
 
 
 
 
46
  ...t_model/model.safetensors: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰| 94.2MB / 94.3MB 
 
 
 
 
 
 
 
47
  ...point-1/training_args.bin: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 4.86kB / 4.86kB 
 
 
 
 
 
 
 
 
48
  ...t_model/training_args.bin: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 4.86kB / 4.86kB 
 
 
 
 
 
 
 
 
 
49
  ...checkpoint-1/scheduler.pt: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 1.06kB / 1.06kB 
 
 
50
  ...heckpoint-1/rng_state.pth: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 14.2kB / 14.2kB 
 
 
 
51
  ...50/checkpoint-1/scaler.pt: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 988B / 988B 
 
 
 
 
52
  ...point-1/model.safetensors: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰| 94.2MB / 94.3MB 
 
 
 
 
 
53
  ...checkpoint-1/optimizer.pt: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 1.58kB / 1.58kB 
 
 
 
 
 
 
54
  ...t_model/model.safetensors: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰| 94.2MB / 94.3MB 
 
 
 
 
 
 
 
55
  ...point-1/training_args.bin: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 4.86kB / 4.86kB 
 
 
 
 
 
 
 
 
56
  ...t_model/training_args.bin: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 4.86kB / 4.86kB 
 
 
 
 
 
 
 
 
 
57
  ...checkpoint-1/scheduler.pt: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 1.06kB / 1.06kB 
 
 
58
  ...heckpoint-1/rng_state.pth: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 14.2kB / 14.2kB 
 
 
 
59
  ...50/checkpoint-1/scaler.pt: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 988B / 988B 
 
 
 
 
60
  ...point-1/model.safetensors: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰| 94.2MB / 94.3MB 
 
 
 
 
 
61
  ...checkpoint-1/optimizer.pt: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 1.58kB / 1.58kB 
 
 
 
 
 
 
62
  ...t_model/model.safetensors: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰| 94.2MB / 94.3MB 
 
 
 
 
 
 
 
63
  ...point-1/training_args.bin: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 4.86kB / 4.86kB 
 
 
 
 
 
 
 
 
64
  ...t_model/training_args.bin: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 4.86kB / 4.86kB 
 
 
 
 
 
 
 
 
 
65
  ...checkpoint-1/scheduler.pt: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 1.06kB / 1.06kB 
 
 
66
  ...heckpoint-1/rng_state.pth: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 14.2kB / 14.2kB 
 
 
 
67
  ...50/checkpoint-1/scaler.pt: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 988B / 988B 
 
 
 
 
68
  ...point-1/model.safetensors: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰| 94.2MB / 94.3MB 
 
 
 
 
 
69
  ...checkpoint-1/optimizer.pt: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 1.58kB / 1.58kB 
 
 
 
 
 
 
70
  ...t_model/model.safetensors: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰| 94.2MB / 94.3MB 
 
 
 
 
 
 
 
71
  ...point-1/training_args.bin: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 4.86kB / 4.86kB 
 
 
 
 
 
 
 
 
72
  ...t_model/training_args.bin: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 4.86kB / 4.86kB 
 
 
 
 
 
 
 
 
 
73
  ...checkpoint-1/scheduler.pt: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 1.06kB / 1.06kB 
 
 
 
 
 
 
 
 
 
 
74
  ...952495.620ce50c8876.612.0: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 4.63kB / 4.63kB 
 
 
75
  ...heckpoint-1/rng_state.pth: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 14.2kB / 14.2kB 
 
 
 
76
  ...50/checkpoint-1/scaler.pt: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 988B / 988B 
 
 
 
 
77
  ...point-1/model.safetensors: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 94.3MB / 94.3MB 
 
 
 
 
 
78
  ...checkpoint-1/optimizer.pt: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 1.58kB / 1.58kB 
 
 
 
 
 
 
79
  ...t_model/model.safetensors: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 94.3MB / 94.3MB 
 
 
 
 
 
 
 
80
  ...point-1/training_args.bin: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 4.86kB / 4.86kB 
 
 
 
 
 
 
 
 
81
  ...t_model/training_args.bin: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 4.86kB / 4.86kB 
 
 
 
 
 
 
 
 
 
82
  ...checkpoint-1/scheduler.pt: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 1.06kB / 1.06kB 
 
 
 
 
 
 
 
 
 
 
83
  ...952495.620ce50c8876.612.0: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 4.63kB / 4.63kB 
 
 
 
84
  ...heckpoint-1/rng_state.pth: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 14.2kB / 14.2kB 
 
 
 
85
  ...50/checkpoint-1/scaler.pt: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 988B / 988B 
 
 
 
 
86
  ...point-1/model.safetensors: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 94.3MB / 94.3MB 
 
 
 
 
 
87
  ...checkpoint-1/optimizer.pt: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 1.58kB / 1.58kB 
 
 
 
 
 
 
88
  ...t_model/model.safetensors: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 94.3MB / 94.3MB 
 
 
 
 
 
 
 
89
  ...point-1/training_args.bin: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 4.86kB / 4.86kB 
 
 
 
 
 
 
 
 
90
  ...t_model/training_args.bin: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 4.86kB / 4.86kB 
 
 
 
 
 
 
 
 
 
91
  ...checkpoint-1/scheduler.pt: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 1.06kB / 1.06kB 
 
 
 
 
 
 
 
 
 
 
92
  ...952495.620ce50c8876.612.0: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 4.63kB / 4.63kB 
 
 
93
  ...heckpoint-1/rng_state.pth: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 14.2kB / 14.2kB 
 
 
 
94
  ...50/checkpoint-1/scaler.pt: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 988B / 988B 
 
 
 
 
95
  ...point-1/model.safetensors: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 94.3MB / 94.3MB 
 
 
 
 
 
96
  ...checkpoint-1/optimizer.pt: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 1.58kB / 1.58kB 
 
 
 
 
 
 
97
  ...t_model/model.safetensors: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 94.3MB / 94.3MB 
 
 
 
 
 
 
 
98
  ...point-1/training_args.bin: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 4.86kB / 4.86kB 
 
 
 
 
 
 
 
 
99
  ...t_model/training_args.bin: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 4.86kB / 4.86kB 
 
 
 
 
 
 
 
 
 
100
  ...checkpoint-1/scheduler.pt: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 1.06kB / 1.06kB 
 
 
 
 
 
 
 
 
 
 
101
  ...952495.620ce50c8876.612.0: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 4.63kB / 4.63kB 
 
 
102
  ...heckpoint-1/rng_state.pth: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 14.2kB / 14.2kB 
 
 
 
103
  ...50/checkpoint-1/scaler.pt: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 988B / 988B 
 
 
 
 
104
  ...point-1/model.safetensors: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 94.3MB / 94.3MB 
 
 
 
 
 
105
  ...checkpoint-1/optimizer.pt: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 1.58kB / 1.58kB 
 
 
 
 
 
 
106
  ...t_model/model.safetensors: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 94.3MB / 94.3MB 
 
 
 
 
 
 
 
107
  ...point-1/training_args.bin: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 4.86kB / 4.86kB 
 
 
 
 
 
 
 
 
108
  ...t_model/training_args.bin: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 4.86kB / 4.86kB 
 
 
 
 
 
 
 
 
 
109
  ...checkpoint-1/scheduler.pt: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 1.06kB / 1.06kB 
 
 
 
 
 
 
 
 
 
 
110
  ...952495.620ce50c8876.612.0: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 4.63kB / 4.63kB 
 
 
111
  ...heckpoint-1/rng_state.pth: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 14.2kB / 14.2kB 
 
 
 
112
  ...50/checkpoint-1/scaler.pt: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 988B / 988B 
 
 
 
 
113
  ...point-1/model.safetensors: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 94.3MB / 94.3MB 
 
 
 
 
 
114
  ...checkpoint-1/optimizer.pt: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 1.58kB / 1.58kB 
 
 
 
 
 
 
115
  ...t_model/model.safetensors: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 94.3MB / 94.3MB 
 
 
 
 
 
 
 
116
  ...point-1/training_args.bin: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 4.86kB / 4.86kB 
 
 
 
 
 
 
 
 
117
  ...t_model/training_args.bin: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 4.86kB / 4.86kB 
 
 
 
 
 
 
 
 
 
118
  ...checkpoint-1/scheduler.pt: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 1.06kB / 1.06kB 
 
 
 
 
 
 
 
 
 
 
119
  ...952495.620ce50c8876.612.0: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 4.63kB / 4.63kB 
 
 
120
  ...heckpoint-1/rng_state.pth: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 14.2kB / 14.2kB
 
121
  ...50/checkpoint-1/scaler.pt: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 988B / 988B
 
122
  ...point-1/model.safetensors: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 94.3MB / 94.3MB
 
123
  ...checkpoint-1/optimizer.pt: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 1.58kB / 1.58kB
 
124
  ...t_model/model.safetensors: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 94.3MB / 94.3MB
 
125
  ...point-1/training_args.bin: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 4.86kB / 4.86kB
 
126
  ...t_model/training_args.bin: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 4.86kB / 4.86kB
 
127
  ...checkpoint-1/scheduler.pt: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 1.06kB / 1.06kB
 
128
  ...952495.620ce50c8876.612.0: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 4.63kB / 4.63kB
 
 
 
1
+ ==> Checking internet connectivity...
2
+ ==> Internet + pip OK
3
+ ==> Installing dependencies
4
+ ERROR: pip's dependency resolver does not currently take into account all the packages that are installed. This behaviour is the source of the following dependency conflicts.
5
+ torchaudio 2.5.1+cu124 requires torch==2.5.1, but you have torch 2.6.0+cu124 which is incompatible.
6
+ WARNING: Running pip as the 'root' user can result in broken permissions and conflicting behaviour with the system package manager, possibly rendering your system unusable.It is recommended to use a virtual environment instead: https://pip.pypa.io/warnings/venv. Use the --root-user-action option if you know what you are doing and want to suppress this warning.
7
+ WARNING: Running pip as the 'root' user can result in broken permissions and conflicting behaviour with the system package manager, possibly rendering your system unusable.It is recommended to use a virtual environment instead: https://pip.pypa.io/warnings/venv. Use the --root-user-action option if you know what you are doing and want to suppress this warning.
8
+ ==> CUDA OK (torch 2.6.0+cu124, CUDA 12.4, NVIDIA GeForce RTX 3090)
9
+ ==> System info:
10
+ GPU: NVIDIA GeForce RTX 3090, 24576 MiB, 565.57.01
11
+ RAM: 251Gi
12
+ CPU: 64 cores
13
+ Disk: 50G total, 47G free
14
+ ==> Cloning font-model repo
15
+ Cloning into 'font-model'...
16
+ ==> Downloading dataset from HuggingFace: dchen0/font_crops_test
17
+
18
+ ==> Extracting data/train.tar...
19
+ ==> Extracting data/test.tar...
20
+ ==> Dataset ready: 3 train variants, 3 test variants
21
+ overlay 50G 4.0G 47G 8% /
22
+
23
+ ============================================
24
+ Training: resnet50 (GPUs: 1)
25
+ ============================================
26
+ 2026-03-31 10:21:32 - INFO - Loading dataset from data - train_model.py:163
27
+ 2026-03-31 10:21:32 - INFO - Found 3 labels - train_model.py:167
28
+ 2026-03-31 10:21:32 - INFO - Setting up image processor and augmentations - train_model.py:177
29
+ 2026-03-31 10:21:32 - INFO - HTTP Request: HEAD https://huggingface.co/facebook/dinov2-base-imagenet1k-1-layer/resolve/main/processor_config.json "HTTP/1.1 404 Not Found" - _client.py:1025
30
+ 2026-03-31 10:21:32 - INFO - HTTP Request: HEAD https://huggingface.co/facebook/dinov2-base-imagenet1k-1-layer/resolve/main/preprocessor_config.json "HTTP/1.1 307 Temporary Redirect" - _client.py:1025
31
+ 2026-03-31 10:21:32 - INFO - HTTP Request: HEAD https://huggingface.co/api/resolve-cache/models/facebook/dinov2-base-imagenet1k-1-layer/f9305d2c8048bd65783f64fabfa25429d13cbdbb/preprocessor_config.json "HTTP/1.1 200 OK" - _client.py:1025
32
+ 2026-03-31 10:21:32 - INFO - HTTP Request: GET https://huggingface.co/api/resolve-cache/models/facebook/dinov2-base-imagenet1k-1-layer/f9305d2c8048bd65783f64fabfa25429d13cbdbb/preprocessor_config.json "HTTP/1.1 200 OK" - _client.py:1025
33
+ 2026-03-31 10:21:32 - INFO - HTTP Request: HEAD https://huggingface.co/facebook/dinov2-base-imagenet1k-1-layer/resolve/main/processor_config.json "HTTP/1.1 404 Not Found" - _client.py:1025
34
+ 2026-03-31 10:21:32 - INFO - HTTP Request: HEAD https://huggingface.co/facebook/dinov2-base-imagenet1k-1-layer/resolve/main/preprocessor_config.json "HTTP/1.1 307 Temporary Redirect" - _client.py:1025
35
+ 2026-03-31 10:21:32 - INFO - HTTP Request: HEAD https://huggingface.co/api/resolve-cache/models/facebook/dinov2-base-imagenet1k-1-layer/f9305d2c8048bd65783f64fabfa25429d13cbdbb/preprocessor_config.json "HTTP/1.1 200 OK" - _client.py:1025
36
+ 2026-03-31 10:21:33 - INFO - HTTP Request: HEAD https://s3.amazonaws.com/datasets.huggingface.co/datasets/datasets/imagefolder/imagefolder.py "HTTP/1.1 404 Not Found" - _client.py:1025
37
+
38
+
39
+
40
+ 2026-03-31 10:21:33 - INFO - Train size: 30, Validation size: 9 - train_model.py:187
41
+ 2026-03-31 10:21:33 - INFO - Applying data transformations - train_model.py:191
42
+
43
+
44
+ 2026-03-31 10:21:33 - INFO - Data preprocessing complete - train_model.py:207
45
+ 2026-03-31 10:21:33 - INFO - Loading ResNet-50 (ImageNet-pretrained) as CNN baseline - train_model.py:211
46
+ Downloading: "https://download.pytorch.org/models/resnet50-11ad3fa6.pth" to /root/.cache/torch/hub/checkpoints/resnet50-11ad3fa6.pth
47
+
48
  0%| | 0.00/97.8M [00:00<?, ?B/s]
49
  9%|β–‰ | 8.75M/97.8M [00:00<00:01, 91.5MB/s]
50
  20%|β–ˆβ–ˆ | 19.9M/97.8M [00:00<00:00, 106MB/s]
51
  31%|β–ˆβ–ˆβ–ˆ | 30.0M/97.8M [00:00<00:00, 100MB/s]
52
  41%|β–ˆβ–ˆβ–ˆβ–ˆ | 39.6M/97.8M [00:00<00:00, 88.7MB/s]
53
  50%|β–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 48.8M/97.8M [00:00<00:00, 90.7MB/s]
54
  61%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 60.0M/97.8M [00:00<00:00, 99.1MB/s]
55
  71%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 69.6M/97.8M [00:00<00:00, 97.2MB/s]
56
  81%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 79.0M/97.8M [00:00<00:00, 89.5MB/s]
57
  90%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 87.8M/97.8M [00:01<00:00, 86.6MB/s]
58
  98%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š| 96.1M/97.8M [00:01<00:00, 84.2MB/s]
59
+ 2026-03-31 10:21:35 - INFO - trainable params: 23,514,179 || all params: 23,514,179 || trainable%: 100.0000 - train_model.py:237
60
+ 2026-03-31 10:21:35 - INFO - Setting up training arguments - train_model.py:295
61
+ 2026-03-31 10:21:35 - INFO - Using device: cuda - train_model.py:298
62
+ `logging_dir` is deprecated and will be removed in v5.2. Please set `TENSORBOARD_LOGGING_DIR` instead.
63
+ 2026-03-31 10:21:35 - INFO - Starting training - train_model.py:337
64
+
65
  0%| | 0/1 [00:00<?, ?it/s]
66
+
67
  0%| | 0/1 [00:00<?, ?it/s]
68
 
69
+
70
 
71
+
72
+
73
  
74
 
75
+ 2026-03-31 10:21:37 - INFO - Training complete - train_model.py:343
76
+ 2026-03-31 10:21:37 - INFO - Saving result model to the output directory - train_model.py:350
77
+ {'eval_loss': '1.124', 'eval_accuracy': '0.3333', 'eval_runtime': '0.406', 'eval_samples_per_second': '22.17', 'eval_steps_per_second': '2.463', 'epoch': '1'}
78
+ {'train_runtime': '1.543', 'train_samples_per_second': '19.44', 'train_steps_per_second': '0.648', 'train_loss': '1.128', 'epoch': '1'}
79
+ ==> Finished: resnet50
80
+
81
+ ============================================
82
+ ALL TRAINING COMPLETE
83
+ Results in: /workspace/output/
84
+ ============================================
85
+ ==> Uploading results to HuggingFace: dchen0/font-model-dry-run
86
+
87
+
88
+
89
+
90
  ...heckpoint-1/rng_state.pth: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 14.2kB / 14.2kB 
91
+
92
+
93
+
94
  ...50/checkpoint-1/scaler.pt: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 988B / 988B 
95
+
96
+
97
  ...heckpoint-1/rng_state.pth: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 14.2kB / 14.2kB 
98
+
99
+
100
+
101
  ...50/checkpoint-1/scaler.pt: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 988B / 988B 
102
+
103
+
104
  ...heckpoint-1/rng_state.pth: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 14.2kB / 14.2kB 
105
+
106
+
107
+
108
  ...50/checkpoint-1/scaler.pt: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 988B / 988B 
109
+
110
+
111
+
112
+
113
  ...point-1/model.safetensors: 93%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž| 87.9MB / 94.3MB 
114
+
115
+
116
+
117
+
118
+
119
  ...checkpoint-1/optimizer.pt: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 1.58kB / 1.58kB 
120
+
121
+
122
+
123
+
124
+
125
+
126
  ...t_model/model.safetensors: 93%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž| 87.9MB / 94.3MB 
127
+
128
+
129
+
130
+
131
+
132
+
133
+
134
  ...point-1/training_args.bin: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 4.86kB / 4.86kB 
135
+
136
+
137
+
138
+
139
+
140
+
141
+
142
+
143
  ...t_model/training_args.bin: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 4.86kB / 4.86kB 
144
+
145
+
146
  ...heckpoint-1/rng_state.pth: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 14.2kB / 14.2kB 
147
+
148
+
149
+
150
  ...50/checkpoint-1/scaler.pt: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 988B / 988B 
151
+
152
+
153
+
154
+
155
  ...point-1/model.safetensors: 93%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž| 87.9MB / 94.3MB 
156
+
157
+
158
+
159
+
160
+
161
  ...checkpoint-1/optimizer.pt: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 1.58kB / 1.58kB 
162
+
163
+
164
+
165
+
166
+
167
+
168
  ...t_model/model.safetensors: 93%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž| 87.9MB / 94.3MB 
169
+
170
+
171
+
172
+
173
+
174
+
175
+
176
  ...point-1/training_args.bin: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 4.86kB / 4.86kB 
177
+
178
+
179
+
180
+
181
+
182
+
183
+
184
+
185
  ...t_model/training_args.bin: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 4.86kB / 4.86kB 
186
+
187
+
188
  ...heckpoint-1/rng_state.pth: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 14.2kB / 14.2kB 
189
+
190
+
191
+
192
  ...50/checkpoint-1/scaler.pt: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 988B / 988B 
193
+
194
+
195
+
196
+
197
  ...point-1/model.safetensors: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰| 94.2MB / 94.3MB 
198
+
199
+
200
+
201
+
202
+
203
  ...checkpoint-1/optimizer.pt: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 1.58kB / 1.58kB 
204
+
205
+
206
+
207
+
208
+
209
+
210
  ...t_model/model.safetensors: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰| 94.2MB / 94.3MB 
211
+
212
+
213
+
214
+
215
+
216
+
217
+
218
  ...point-1/training_args.bin: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 4.86kB / 4.86kB 
219
+
220
+
221
+
222
+
223
+
224
+
225
+
226
+
227
  ...t_model/training_args.bin: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 4.86kB / 4.86kB 
228
+
229
+
230
  ...heckpoint-1/rng_state.pth: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 14.2kB / 14.2kB 
231
+
232
+
233
+
234
  ...50/checkpoint-1/scaler.pt: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 988B / 988B 
235
+
236
+
237
+
238
+
239
  ...point-1/model.safetensors: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰| 94.2MB / 94.3MB 
240
+
241
+
242
+
243
+
244
+
245
  ...checkpoint-1/optimizer.pt: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 1.58kB / 1.58kB 
246
+
247
+
248
+
249
+
250
+
251
+
252
  ...t_model/model.safetensors: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰| 94.2MB / 94.3MB 
253
+
254
+
255
+
256
+
257
+
258
+
259
+
260
  ...point-1/training_args.bin: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 4.86kB / 4.86kB 
261
+
262
+
263
+
264
+
265
+
266
+
267
+
268
+
269
  ...t_model/training_args.bin: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 4.86kB / 4.86kB 
270
+
271
+
272
+
273
+
274
+
275
+
276
+
277
+
278
+
279
  ...checkpoint-1/scheduler.pt: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 1.06kB / 1.06kB 
280
+
281
+
282
  ...heckpoint-1/rng_state.pth: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 14.2kB / 14.2kB 
283
+
284
+
285
+
286
  ...50/checkpoint-1/scaler.pt: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 988B / 988B 
287
+
288
+
289
+
290
+
291
  ...point-1/model.safetensors: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰| 94.2MB / 94.3MB 
292
+
293
+
294
+
295
+
296
+
297
  ...checkpoint-1/optimizer.pt: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 1.58kB / 1.58kB 
298
+
299
+
300
+
301
+
302
+
303
+
304
  ...t_model/model.safetensors: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰| 94.2MB / 94.3MB 
305
+
306
+
307
+
308
+
309
+
310
+
311
+
312
  ...point-1/training_args.bin: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 4.86kB / 4.86kB 
313
+
314
+
315
+
316
+
317
+
318
+
319
+
320
+
321
  ...t_model/training_args.bin: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 4.86kB / 4.86kB 
322
+
323
+
324
+
325
+
326
+
327
+
328
+
329
+
330
+
331
  ...checkpoint-1/scheduler.pt: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 1.06kB / 1.06kB 
332
+
333
+
334
  ...heckpoint-1/rng_state.pth: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 14.2kB / 14.2kB 
335
+
336
+
337
+
338
  ...50/checkpoint-1/scaler.pt: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 988B / 988B 
339
+
340
+
341
+
342
+
343
  ...point-1/model.safetensors: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰| 94.2MB / 94.3MB 
344
+
345
+
346
+
347
+
348
+
349
  ...checkpoint-1/optimizer.pt: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 1.58kB / 1.58kB 
350
+
351
+
352
+
353
+
354
+
355
+
356
  ...t_model/model.safetensors: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰| 94.2MB / 94.3MB 
357
+
358
+
359
+
360
+
361
+
362
+
363
+
364
  ...point-1/training_args.bin: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 4.86kB / 4.86kB 
365
+
366
+
367
+
368
+
369
+
370
+
371
+
372
+
373
  ...t_model/training_args.bin: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 4.86kB / 4.86kB 
374
+
375
+
376
+
377
+
378
+
379
+
380
+
381
+
382
+
383
  ...checkpoint-1/scheduler.pt: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 1.06kB / 1.06kB 
384
+
385
+
386
  ...heckpoint-1/rng_state.pth: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 14.2kB / 14.2kB 
387
+
388
+
389
+
390
  ...50/checkpoint-1/scaler.pt: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 988B / 988B 
391
+
392
+
393
+
394
+
395
  ...point-1/model.safetensors: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰| 94.2MB / 94.3MB 
396
+
397
+
398
+
399
+
400
+
401
  ...checkpoint-1/optimizer.pt: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 1.58kB / 1.58kB 
402
+
403
+
404
+
405
+
406
+
407
+
408
  ...t_model/model.safetensors: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰| 94.2MB / 94.3MB 
409
+
410
+
411
+
412
+
413
+
414
+
415
+
416
  ...point-1/training_args.bin: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 4.86kB / 4.86kB 
417
+
418
+
419
+
420
+
421
+
422
+
423
+
424
+
425
  ...t_model/training_args.bin: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 4.86kB / 4.86kB 
426
+
427
+
428
+
429
+
430
+
431
+
432
+
433
+
434
+
435
  ...checkpoint-1/scheduler.pt: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 1.06kB / 1.06kB 
436
+
437
+
438
+
439
+
440
+
441
+
442
+
443
+
444
+
445
+
446
  ...952495.620ce50c8876.612.0: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 4.63kB / 4.63kB 
447
+
448
+
449
  ...heckpoint-1/rng_state.pth: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 14.2kB / 14.2kB 
450
+
451
+
452
+
453
  ...50/checkpoint-1/scaler.pt: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 988B / 988B 
454
+
455
+
456
+
457
+
458
  ...point-1/model.safetensors: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 94.3MB / 94.3MB 
459
+
460
+
461
+
462
+
463
+
464
  ...checkpoint-1/optimizer.pt: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 1.58kB / 1.58kB 
465
+
466
+
467
+
468
+
469
+
470
+
471
  ...t_model/model.safetensors: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 94.3MB / 94.3MB 
472
+
473
+
474
+
475
+
476
+
477
+
478
+
479
  ...point-1/training_args.bin: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 4.86kB / 4.86kB 
480
+
481
+
482
+
483
+
484
+
485
+
486
+
487
+
488
  ...t_model/training_args.bin: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 4.86kB / 4.86kB 
489
+
490
+
491
+
492
+
493
+
494
+
495
+
496
+
497
+
498
  ...checkpoint-1/scheduler.pt: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 1.06kB / 1.06kB 
499
+
500
+
501
+
502
+
503
+
504
+
505
+
506
+
507
+
508
+
509
  ...952495.620ce50c8876.612.0: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 4.63kB / 4.63kB 
510
+
511
+
512
+
513
  ...heckpoint-1/rng_state.pth: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 14.2kB / 14.2kB 
514
+
515
+
516
+
517
  ...50/checkpoint-1/scaler.pt: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 988B / 988B 
518
+
519
+
520
+
521
+
522
  ...point-1/model.safetensors: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 94.3MB / 94.3MB 
523
+
524
+
525
+
526
+
527
+
528
  ...checkpoint-1/optimizer.pt: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 1.58kB / 1.58kB 
529
+
530
+
531
+
532
+
533
+
534
+
535
  ...t_model/model.safetensors: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 94.3MB / 94.3MB 
536
+
537
+
538
+
539
+
540
+
541
+
542
+
543
  ...point-1/training_args.bin: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 4.86kB / 4.86kB 
544
+
545
+
546
+
547
+
548
+
549
+
550
+
551
+
552
  ...t_model/training_args.bin: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 4.86kB / 4.86kB 
553
+
554
+
555
+
556
+
557
+
558
+
559
+
560
+
561
+
562
  ...checkpoint-1/scheduler.pt: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 1.06kB / 1.06kB 
563
+
564
+
565
+
566
+
567
+
568
+
569
+
570
+
571
+
572
+
573
  ...952495.620ce50c8876.612.0: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 4.63kB / 4.63kB 
574
+
575
+
576
  ...heckpoint-1/rng_state.pth: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 14.2kB / 14.2kB 
577
+
578
+
579
+
580
  ...50/checkpoint-1/scaler.pt: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 988B / 988B 
581
+
582
+
583
+
584
+
585
  ...point-1/model.safetensors: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 94.3MB / 94.3MB 
586
+
587
+
588
+
589
+
590
+
591
  ...checkpoint-1/optimizer.pt: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 1.58kB / 1.58kB 
592
+
593
+
594
+
595
+
596
+
597
+
598
  ...t_model/model.safetensors: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 94.3MB / 94.3MB 
599
+
600
+
601
+
602
+
603
+
604
+
605
+
606
  ...point-1/training_args.bin: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 4.86kB / 4.86kB 
607
+
608
+
609
+
610
+
611
+
612
+
613
+
614
+
615
  ...t_model/training_args.bin: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 4.86kB / 4.86kB 
616
+
617
+
618
+
619
+
620
+
621
+
622
+
623
+
624
+
625
  ...checkpoint-1/scheduler.pt: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 1.06kB / 1.06kB 
626
+
627
+
628
+
629
+
630
+
631
+
632
+
633
+
634
+
635
+
636
  ...952495.620ce50c8876.612.0: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 4.63kB / 4.63kB 
637
+
638
+
639
  ...heckpoint-1/rng_state.pth: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 14.2kB / 14.2kB 
640
+
641
+
642
+
643
  ...50/checkpoint-1/scaler.pt: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 988B / 988B 
644
+
645
+
646
+
647
+
648
  ...point-1/model.safetensors: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 94.3MB / 94.3MB 
649
+
650
+
651
+
652
+
653
+
654
  ...checkpoint-1/optimizer.pt: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 1.58kB / 1.58kB 
655
+
656
+
657
+
658
+
659
+
660
+
661
  ...t_model/model.safetensors: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 94.3MB / 94.3MB 
662
+
663
+
664
+
665
+
666
+
667
+
668
+
669
  ...point-1/training_args.bin: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 4.86kB / 4.86kB 
670
+
671
+
672
+
673
+
674
+
675
+
676
+
677
+
678
  ...t_model/training_args.bin: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 4.86kB / 4.86kB 
679
+
680
+
681
+
682
+
683
+
684
+
685
+
686
+
687
+
688
  ...checkpoint-1/scheduler.pt: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 1.06kB / 1.06kB 
689
+
690
+
691
+
692
+
693
+
694
+
695
+
696
+
697
+
698
+
699
  ...952495.620ce50c8876.612.0: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 4.63kB / 4.63kB 
700
+
701
+
702
  ...heckpoint-1/rng_state.pth: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 14.2kB / 14.2kB 
703
+
704
+
705
+
706
  ...50/checkpoint-1/scaler.pt: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 988B / 988B 
707
+
708
+
709
+
710
+
711
  ...point-1/model.safetensors: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 94.3MB / 94.3MB 
712
+
713
+
714
+
715
+
716
+
717
  ...checkpoint-1/optimizer.pt: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 1.58kB / 1.58kB 
718
+
719
+
720
+
721
+
722
+
723
+
724
  ...t_model/model.safetensors: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 94.3MB / 94.3MB 
725
+
726
+
727
+
728
+
729
+
730
+
731
+
732
  ...point-1/training_args.bin: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 4.86kB / 4.86kB 
733
+
734
+
735
+
736
+
737
+
738
+
739
+
740
+
741
  ...t_model/training_args.bin: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 4.86kB / 4.86kB 
742
+
743
+
744
+
745
+
746
+
747
+
748
+
749
+
750
+
751
  ...checkpoint-1/scheduler.pt: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 1.06kB / 1.06kB 
752
+
753
+
754
+
755
+
756
+
757
+
758
+
759
+
760
+
761
+
762
  ...952495.620ce50c8876.612.0: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 4.63kB / 4.63kB 
763
+
764
+
765
  ...heckpoint-1/rng_state.pth: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 14.2kB / 14.2kB
766
+
767
  ...50/checkpoint-1/scaler.pt: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 988B / 988B
768
+
769
  ...point-1/model.safetensors: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 94.3MB / 94.3MB
770
+
771
  ...checkpoint-1/optimizer.pt: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 1.58kB / 1.58kB
772
+
773
  ...t_model/model.safetensors: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 94.3MB / 94.3MB
774
+
775
  ...point-1/training_args.bin: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 4.86kB / 4.86kB
776
+
777
  ...t_model/training_args.bin: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 4.86kB / 4.86kB
778
+
779
  ...checkpoint-1/scheduler.pt: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 1.06kB / 1.06kB
780
+
781
  ...952495.620ce50c8876.612.0: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 4.63kB / 4.63kB
782
+ Upload complete.
783
+ ==> Uploading training log to HuggingFace...