carsonx
/

carsonx shenyuang commited on
Commit
467b854
·
0 Parent(s):

Duplicate from nvidia/DreamDojo

Browse files

Co-authored-by: Shenyuan Gao <shenyuang@users.noreply.huggingface.co>

This view is limited to 50 files because it contains too many changes.   See raw diff
Files changed (50) hide show
  1. .gitattributes +875 -0
  2. 14B_AgiBot_post-train/iter_000050000/model/.metadata +3 -0
  3. 14B_AgiBot_post-train/iter_000050000/model/__0_0.distcp +3 -0
  4. 14B_AgiBot_post-train/iter_000050000/model/__100_0.distcp +0 -0
  5. 14B_AgiBot_post-train/iter_000050000/model/__101_0.distcp +0 -0
  6. 14B_AgiBot_post-train/iter_000050000/model/__102_0.distcp +0 -0
  7. 14B_AgiBot_post-train/iter_000050000/model/__103_0.distcp +0 -0
  8. 14B_AgiBot_post-train/iter_000050000/model/__104_0.distcp +0 -0
  9. 14B_AgiBot_post-train/iter_000050000/model/__105_0.distcp +0 -0
  10. 14B_AgiBot_post-train/iter_000050000/model/__106_0.distcp +0 -0
  11. 14B_AgiBot_post-train/iter_000050000/model/__107_0.distcp +0 -0
  12. 14B_AgiBot_post-train/iter_000050000/model/__108_0.distcp +0 -0
  13. 14B_AgiBot_post-train/iter_000050000/model/__109_0.distcp +0 -0
  14. 14B_AgiBot_post-train/iter_000050000/model/__10_0.distcp +3 -0
  15. 14B_AgiBot_post-train/iter_000050000/model/__110_0.distcp +0 -0
  16. 14B_AgiBot_post-train/iter_000050000/model/__111_0.distcp +0 -0
  17. 14B_AgiBot_post-train/iter_000050000/model/__112_0.distcp +0 -0
  18. 14B_AgiBot_post-train/iter_000050000/model/__113_0.distcp +0 -0
  19. 14B_AgiBot_post-train/iter_000050000/model/__114_0.distcp +0 -0
  20. 14B_AgiBot_post-train/iter_000050000/model/__115_0.distcp +0 -0
  21. 14B_AgiBot_post-train/iter_000050000/model/__116_0.distcp +0 -0
  22. 14B_AgiBot_post-train/iter_000050000/model/__117_0.distcp +0 -0
  23. 14B_AgiBot_post-train/iter_000050000/model/__118_0.distcp +0 -0
  24. 14B_AgiBot_post-train/iter_000050000/model/__119_0.distcp +0 -0
  25. 14B_AgiBot_post-train/iter_000050000/model/__11_0.distcp +3 -0
  26. 14B_AgiBot_post-train/iter_000050000/model/__120_0.distcp +0 -0
  27. 14B_AgiBot_post-train/iter_000050000/model/__121_0.distcp +0 -0
  28. 14B_AgiBot_post-train/iter_000050000/model/__122_0.distcp +0 -0
  29. 14B_AgiBot_post-train/iter_000050000/model/__123_0.distcp +0 -0
  30. 14B_AgiBot_post-train/iter_000050000/model/__124_0.distcp +0 -0
  31. 14B_AgiBot_post-train/iter_000050000/model/__125_0.distcp +0 -0
  32. 14B_AgiBot_post-train/iter_000050000/model/__126_0.distcp +0 -0
  33. 14B_AgiBot_post-train/iter_000050000/model/__127_0.distcp +0 -0
  34. 14B_AgiBot_post-train/iter_000050000/model/__128_0.distcp +0 -0
  35. 14B_AgiBot_post-train/iter_000050000/model/__129_0.distcp +0 -0
  36. 14B_AgiBot_post-train/iter_000050000/model/__12_0.distcp +3 -0
  37. 14B_AgiBot_post-train/iter_000050000/model/__130_0.distcp +0 -0
  38. 14B_AgiBot_post-train/iter_000050000/model/__131_0.distcp +0 -0
  39. 14B_AgiBot_post-train/iter_000050000/model/__132_0.distcp +0 -0
  40. 14B_AgiBot_post-train/iter_000050000/model/__133_0.distcp +0 -0
  41. 14B_AgiBot_post-train/iter_000050000/model/__134_0.distcp +0 -0
  42. 14B_AgiBot_post-train/iter_000050000/model/__135_0.distcp +0 -0
  43. 14B_AgiBot_post-train/iter_000050000/model/__136_0.distcp +0 -0
  44. 14B_AgiBot_post-train/iter_000050000/model/__137_0.distcp +0 -0
  45. 14B_AgiBot_post-train/iter_000050000/model/__138_0.distcp +0 -0
  46. 14B_AgiBot_post-train/iter_000050000/model/__139_0.distcp +0 -0
  47. 14B_AgiBot_post-train/iter_000050000/model/__13_0.distcp +3 -0
  48. 14B_AgiBot_post-train/iter_000050000/model/__140_0.distcp +0 -0
  49. 14B_AgiBot_post-train/iter_000050000/model/__141_0.distcp +0 -0
  50. 14B_AgiBot_post-train/iter_000050000/model/__142_0.distcp +0 -0
.gitattributes ADDED
@@ -0,0 +1,875 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ *.7z filter=lfs diff=lfs merge=lfs -text
2
+ *.arrow filter=lfs diff=lfs merge=lfs -text
3
+ *.bin filter=lfs diff=lfs merge=lfs -text
4
+ *.bz2 filter=lfs diff=lfs merge=lfs -text
5
+ *.ckpt filter=lfs diff=lfs merge=lfs -text
6
+ *.ftz filter=lfs diff=lfs merge=lfs -text
7
+ *.gz filter=lfs diff=lfs merge=lfs -text
8
+ *.h5 filter=lfs diff=lfs merge=lfs -text
9
+ *.joblib filter=lfs diff=lfs merge=lfs -text
10
+ *.lfs.* filter=lfs diff=lfs merge=lfs -text
11
+ *.mlmodel filter=lfs diff=lfs merge=lfs -text
12
+ *.model filter=lfs diff=lfs merge=lfs -text
13
+ *.msgpack filter=lfs diff=lfs merge=lfs -text
14
+ *.npy filter=lfs diff=lfs merge=lfs -text
15
+ *.npz filter=lfs diff=lfs merge=lfs -text
16
+ *.onnx filter=lfs diff=lfs merge=lfs -text
17
+ *.ot filter=lfs diff=lfs merge=lfs -text
18
+ *.parquet filter=lfs diff=lfs merge=lfs -text
19
+ *.pb filter=lfs diff=lfs merge=lfs -text
20
+ *.pickle filter=lfs diff=lfs merge=lfs -text
21
+ *.pkl filter=lfs diff=lfs merge=lfs -text
22
+ *.pt filter=lfs diff=lfs merge=lfs -text
23
+ *.pth filter=lfs diff=lfs merge=lfs -text
24
+ *.rar filter=lfs diff=lfs merge=lfs -text
25
+ *.safetensors filter=lfs diff=lfs merge=lfs -text
26
+ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
27
+ *.tar.* filter=lfs diff=lfs merge=lfs -text
28
+ *.tar filter=lfs diff=lfs merge=lfs -text
29
+ *.tflite filter=lfs diff=lfs merge=lfs -text
30
+ *.tgz filter=lfs diff=lfs merge=lfs -text
31
+ *.wasm filter=lfs diff=lfs merge=lfs -text
32
+ *.xz filter=lfs diff=lfs merge=lfs -text
33
+ *.zip filter=lfs diff=lfs merge=lfs -text
34
+ *.zst filter=lfs diff=lfs merge=lfs -text
35
+ *tfevents* filter=lfs diff=lfs merge=lfs -text
36
+ 14B_YAM_post-train_5k/optim/__8_0.distcp filter=lfs diff=lfs merge=lfs -text
37
+ 14B_YAM_post-train_5k/optim/__3_0.distcp filter=lfs diff=lfs merge=lfs -text
38
+ 14B_YAM_post-train_5k/optim/__15_0.distcp filter=lfs diff=lfs merge=lfs -text
39
+ 2B_YAM_post-train_5k/optim/.metadata filter=lfs diff=lfs merge=lfs -text
40
+ 2B_YAM_post-train_5k/model/.metadata filter=lfs diff=lfs merge=lfs -text
41
+ 2B_YAM_post-train_5k/optim/__2_0.distcp filter=lfs diff=lfs merge=lfs -text
42
+ 2B_YAM_post-train_5k/optim/__0_0.distcp filter=lfs diff=lfs merge=lfs -text
43
+ 2B_YAM_post-train_5k/optim/__5_0.distcp filter=lfs diff=lfs merge=lfs -text
44
+ 2B_YAM_post-train_5k/optim/__4_0.distcp filter=lfs diff=lfs merge=lfs -text
45
+ 2B_YAM_post-train_5k/optim/__1_0.distcp filter=lfs diff=lfs merge=lfs -text
46
+ 2B_YAM_post-train_5k/optim/__3_0.distcp filter=lfs diff=lfs merge=lfs -text
47
+ 2B_YAM_post-train_5k/optim/__6_0.distcp filter=lfs diff=lfs merge=lfs -text
48
+ 2B_YAM_post-train_5k/optim/__7_0.distcp filter=lfs diff=lfs merge=lfs -text
49
+ 2B_YAM_post-train_5k/model/__7_0.distcp filter=lfs diff=lfs merge=lfs -text
50
+ 2B_YAM_post-train_5k/model/__2_0.distcp filter=lfs diff=lfs merge=lfs -text
51
+ 14B_pretrain_140k/optim/.metadata filter=lfs diff=lfs merge=lfs -text
52
+ 2B_YAM_post-train_5k/model/__1_0.distcp filter=lfs diff=lfs merge=lfs -text
53
+ 2B_YAM_post-train_5k/model/__6_0.distcp filter=lfs diff=lfs merge=lfs -text
54
+ 2B_YAM_post-train_5k/model/__3_0.distcp filter=lfs diff=lfs merge=lfs -text
55
+ 2B_YAM_post-train_5k/model/__5_0.distcp filter=lfs diff=lfs merge=lfs -text
56
+ 2B_YAM_post-train_5k/model/__0_0.distcp filter=lfs diff=lfs merge=lfs -text
57
+ 2B_YAM_post-train_5k/model/__4_0.distcp filter=lfs diff=lfs merge=lfs -text
58
+ 14B_pretrain_140k/optim/__17_0.distcp filter=lfs diff=lfs merge=lfs -text
59
+ 14B_pretrain_140k/optim/__28_0.distcp filter=lfs diff=lfs merge=lfs -text
60
+ 14B_pretrain_140k/optim/__18_0.distcp filter=lfs diff=lfs merge=lfs -text
61
+ 14B_pretrain_140k/optim/__10_0.distcp filter=lfs diff=lfs merge=lfs -text
62
+ 14B_pretrain_140k/optim/__21_0.distcp filter=lfs diff=lfs merge=lfs -text
63
+ 14B_pretrain_140k/optim/__7_0.distcp filter=lfs diff=lfs merge=lfs -text
64
+ 14B_pretrain_140k/optim/__0_0.distcp filter=lfs diff=lfs merge=lfs -text
65
+ 14B_pretrain_140k/optim/__14_0.distcp filter=lfs diff=lfs merge=lfs -text
66
+ 14B_pretrain_140k/optim/__1_0.distcp filter=lfs diff=lfs merge=lfs -text
67
+ 14B_pretrain_140k/optim/__19_0.distcp filter=lfs diff=lfs merge=lfs -text
68
+ 14B_pretrain_140k/optim/__5_0.distcp filter=lfs diff=lfs merge=lfs -text
69
+ 14B_pretrain_140k/optim/__13_0.distcp filter=lfs diff=lfs merge=lfs -text
70
+ 14B_pretrain_140k/optim/__2_0.distcp filter=lfs diff=lfs merge=lfs -text
71
+ 14B_pretrain_140k/optim/__11_0.distcp filter=lfs diff=lfs merge=lfs -text
72
+ 14B_pretrain_140k/optim/__25_0.distcp filter=lfs diff=lfs merge=lfs -text
73
+ 14B_pretrain_140k/optim/__4_0.distcp filter=lfs diff=lfs merge=lfs -text
74
+ 14B_pretrain_140k/optim/__12_0.distcp filter=lfs diff=lfs merge=lfs -text
75
+ 14B_pretrain_140k/optim/__9_0.distcp filter=lfs diff=lfs merge=lfs -text
76
+ 14B_pretrain_140k/optim/__26_0.distcp filter=lfs diff=lfs merge=lfs -text
77
+ 14B_pretrain_140k/optim/__29_0.distcp filter=lfs diff=lfs merge=lfs -text
78
+ 14B_pretrain_140k/optim/__20_0.distcp filter=lfs diff=lfs merge=lfs -text
79
+ 14B_pretrain_140k/optim/__3_0.distcp filter=lfs diff=lfs merge=lfs -text
80
+ 14B_pretrain_140k/optim/__16_0.distcp filter=lfs diff=lfs merge=lfs -text
81
+ 14B_pretrain_140k/optim/__23_0.distcp filter=lfs diff=lfs merge=lfs -text
82
+ 14B_pretrain_140k/optim/__6_0.distcp filter=lfs diff=lfs merge=lfs -text
83
+ 14B_pretrain_140k/optim/__27_0.distcp filter=lfs diff=lfs merge=lfs -text
84
+ 14B_pretrain_140k/optim/__15_0.distcp filter=lfs diff=lfs merge=lfs -text
85
+ 14B_pretrain_140k/model/.metadata filter=lfs diff=lfs merge=lfs -text
86
+ 14B_pretrain_140k/optim/__22_0.distcp filter=lfs diff=lfs merge=lfs -text
87
+ 14B_pretrain_140k/optim/__8_0.distcp filter=lfs diff=lfs merge=lfs -text
88
+ 14B_pretrain_140k/optim/__30_0.distcp filter=lfs diff=lfs merge=lfs -text
89
+ 14B_pretrain_140k/optim/__24_0.distcp filter=lfs diff=lfs merge=lfs -text
90
+ 14B_pretrain_140k/optim/__31_0.distcp filter=lfs diff=lfs merge=lfs -text
91
+ 14B_pretrain_140k/model/__17_0.distcp filter=lfs diff=lfs merge=lfs -text
92
+ 14B_pretrain_140k/model/__18_0.distcp filter=lfs diff=lfs merge=lfs -text
93
+ 14B_pretrain_140k/model/__7_0.distcp filter=lfs diff=lfs merge=lfs -text
94
+ 14B_pretrain_140k/model/__21_0.distcp filter=lfs diff=lfs merge=lfs -text
95
+ 14B_pretrain_140k/model/__10_0.distcp filter=lfs diff=lfs merge=lfs -text
96
+ 14B_pretrain_140k/model/__28_0.distcp filter=lfs diff=lfs merge=lfs -text
97
+ 14B_pretrain_140k/model/__19_0.distcp filter=lfs diff=lfs merge=lfs -text
98
+ 14B_pretrain_140k/model/__14_0.distcp filter=lfs diff=lfs merge=lfs -text
99
+ 14B_G1_post-train_5k/optim/__17_0.distcp filter=lfs diff=lfs merge=lfs -text
100
+ 14B_pretrain_140k/model/__26_0.distcp filter=lfs diff=lfs merge=lfs -text
101
+ 14B_G1_post-train_5k/optim/__18_0.distcp filter=lfs diff=lfs merge=lfs -text
102
+ 14B_G1_post-train_5k/optim/__7_0.distcp filter=lfs diff=lfs merge=lfs -text
103
+ 14B_pretrain_140k/model/__12_0.distcp filter=lfs diff=lfs merge=lfs -text
104
+ 14B_G1_post-train_5k/optim/__21_0.distcp filter=lfs diff=lfs merge=lfs -text
105
+ 14B_G1_post-train_5k/optim/__10_0.distcp filter=lfs diff=lfs merge=lfs -text
106
+ 14B_G1_post-train_5k/optim/__28_0.distcp filter=lfs diff=lfs merge=lfs -text
107
+ 14B_G1_post-train_5k/optim/.metadata filter=lfs diff=lfs merge=lfs -text
108
+ 14B_pretrain_140k/model/__13_0.distcp filter=lfs diff=lfs merge=lfs -text
109
+ 14B_pretrain_140k/model/__2_0.distcp filter=lfs diff=lfs merge=lfs -text
110
+ 14B_pretrain_140k/model/__11_0.distcp filter=lfs diff=lfs merge=lfs -text
111
+ 14B_pretrain_140k/model/__4_0.distcp filter=lfs diff=lfs merge=lfs -text
112
+ 14B_G1_post-train_5k/optim/__19_0.distcp filter=lfs diff=lfs merge=lfs -text
113
+ 14B_G1_post-train_5k/optim/__14_0.distcp filter=lfs diff=lfs merge=lfs -text
114
+ 14B_pretrain_140k/model/__5_0.distcp filter=lfs diff=lfs merge=lfs -text
115
+ 14B_pretrain_140k/model/__16_0.distcp filter=lfs diff=lfs merge=lfs -text
116
+ 14B_G1_post-train_5k/optim/__26_0.distcp filter=lfs diff=lfs merge=lfs -text
117
+ 14B_pretrain_140k/model/__0_0.distcp filter=lfs diff=lfs merge=lfs -text
118
+ 14B_pretrain_140k/model/__6_0.distcp filter=lfs diff=lfs merge=lfs -text
119
+ 14B_pretrain_140k/model/__22_0.distcp filter=lfs diff=lfs merge=lfs -text
120
+ 14B_pretrain_140k/model/__29_0.distcp filter=lfs diff=lfs merge=lfs -text
121
+ 14B_pretrain_140k/model/__3_0.distcp filter=lfs diff=lfs merge=lfs -text
122
+ 14B_pretrain_140k/model/__1_0.distcp filter=lfs diff=lfs merge=lfs -text
123
+ 14B_pretrain_140k/model/__27_0.distcp filter=lfs diff=lfs merge=lfs -text
124
+ 14B_pretrain_140k/model/__20_0.distcp filter=lfs diff=lfs merge=lfs -text
125
+ 14B_pretrain_140k/model/__8_0.distcp filter=lfs diff=lfs merge=lfs -text
126
+ 14B_pretrain_140k/model/__24_0.distcp filter=lfs diff=lfs merge=lfs -text
127
+ 14B_pretrain_140k/model/__25_0.distcp filter=lfs diff=lfs merge=lfs -text
128
+ 14B_pretrain_140k/model/__9_0.distcp filter=lfs diff=lfs merge=lfs -text
129
+ 14B_pretrain_140k/model/__23_0.distcp filter=lfs diff=lfs merge=lfs -text
130
+ 14B_pretrain_140k/model/__15_0.distcp filter=lfs diff=lfs merge=lfs -text
131
+ 14B_pretrain_140k/model/__30_0.distcp filter=lfs diff=lfs merge=lfs -text
132
+ 14B_pretrain_140k/model/__31_0.distcp filter=lfs diff=lfs merge=lfs -text
133
+ 14B_G1_post-train_5k/optim/__12_0.distcp filter=lfs diff=lfs merge=lfs -text
134
+ 14B_G1_post-train_5k/optim/__2_0.distcp filter=lfs diff=lfs merge=lfs -text
135
+ 14B_G1_post-train_5k/optim/__13_0.distcp filter=lfs diff=lfs merge=lfs -text
136
+ 14B_G1_post-train_5k/optim/__11_0.distcp filter=lfs diff=lfs merge=lfs -text
137
+ 14B_G1_post-train_5k/optim/__5_0.distcp filter=lfs diff=lfs merge=lfs -text
138
+ 14B_G1_post-train_5k/optim/__4_0.distcp filter=lfs diff=lfs merge=lfs -text
139
+ 14B_G1_post-train_5k/optim/__0_0.distcp filter=lfs diff=lfs merge=lfs -text
140
+ 14B_G1_post-train_5k/optim/__29_0.distcp filter=lfs diff=lfs merge=lfs -text
141
+ 14B_G1_post-train_5k/optim/__20_0.distcp filter=lfs diff=lfs merge=lfs -text
142
+ 14B_G1_post-train_5k/optim/__25_0.distcp filter=lfs diff=lfs merge=lfs -text
143
+ 14B_G1_post-train_5k/optim/__16_0.distcp filter=lfs diff=lfs merge=lfs -text
144
+ 14B_G1_post-train_5k/model/.metadata filter=lfs diff=lfs merge=lfs -text
145
+ 14B_G1_post-train_5k/optim/__9_0.distcp filter=lfs diff=lfs merge=lfs -text
146
+ 14B_G1_post-train_5k/optim/__6_0.distcp filter=lfs diff=lfs merge=lfs -text
147
+ 14B_G1_post-train_5k/optim/__22_0.distcp filter=lfs diff=lfs merge=lfs -text
148
+ 14B_G1_post-train_5k/optim/__27_0.distcp filter=lfs diff=lfs merge=lfs -text
149
+ 14B_G1_post-train_5k/optim/__23_0.distcp filter=lfs diff=lfs merge=lfs -text
150
+ 14B_G1_post-train_5k/optim/__1_0.distcp filter=lfs diff=lfs merge=lfs -text
151
+ 14B_G1_post-train_5k/optim/__3_0.distcp filter=lfs diff=lfs merge=lfs -text
152
+ 14B_G1_post-train_5k/optim/__8_0.distcp filter=lfs diff=lfs merge=lfs -text
153
+ 14B_G1_post-train_5k/optim/__15_0.distcp filter=lfs diff=lfs merge=lfs -text
154
+ 14B_G1_post-train_5k/optim/__31_0.distcp filter=lfs diff=lfs merge=lfs -text
155
+ 14B_G1_post-train_5k/optim/__24_0.distcp filter=lfs diff=lfs merge=lfs -text
156
+ 14B_G1_post-train_5k/optim/__30_0.distcp filter=lfs diff=lfs merge=lfs -text
157
+ 14B_G1_post-train_5k/model/__18_0.distcp filter=lfs diff=lfs merge=lfs -text
158
+ 14B_G1_post-train_5k/model/__7_0.distcp filter=lfs diff=lfs merge=lfs -text
159
+ 14B_G1_post-train_5k/model/__17_0.distcp filter=lfs diff=lfs merge=lfs -text
160
+ 14B_G1_post-train_5k/model/__21_0.distcp filter=lfs diff=lfs merge=lfs -text
161
+ 14B_G1_post-train_5k/model/__28_0.distcp filter=lfs diff=lfs merge=lfs -text
162
+ 14B_G1_post-train_5k/model/__26_0.distcp filter=lfs diff=lfs merge=lfs -text
163
+ 14B_G1_post-train_5k/model/__19_0.distcp filter=lfs diff=lfs merge=lfs -text
164
+ 14B_G1_post-train_5k/model/__2_0.distcp filter=lfs diff=lfs merge=lfs -text
165
+ 14B_G1_post-train_5k/model/__13_0.distcp filter=lfs diff=lfs merge=lfs -text
166
+ 14B_G1_post-train_5k/model/__11_0.distcp filter=lfs diff=lfs merge=lfs -text
167
+ 14B_G1_post-train_5k/model/__10_0.distcp filter=lfs diff=lfs merge=lfs -text
168
+ 14B_G1_post-train_5k/model/__12_0.distcp filter=lfs diff=lfs merge=lfs -text
169
+ 14B_G1_post-train_5k/model/__14_0.distcp filter=lfs diff=lfs merge=lfs -text
170
+ 14B_G1_post-train_5k/model/__4_0.distcp filter=lfs diff=lfs merge=lfs -text
171
+ 14B_G1_post-train_5k/model/__5_0.distcp filter=lfs diff=lfs merge=lfs -text
172
+ 14B_G1_post-train_5k/model/__0_0.distcp filter=lfs diff=lfs merge=lfs -text
173
+ 14B_G1_post-train_5k/model/__20_0.distcp filter=lfs diff=lfs merge=lfs -text
174
+ 14B_G1_post-train_5k/model/__29_0.distcp filter=lfs diff=lfs merge=lfs -text
175
+ 14B_AgiBot_post-train_5k/optim/.metadata filter=lfs diff=lfs merge=lfs -text
176
+ 14B_G1_post-train_5k/model/__16_0.distcp filter=lfs diff=lfs merge=lfs -text
177
+ 14B_G1_post-train_5k/model/__25_0.distcp filter=lfs diff=lfs merge=lfs -text
178
+ 14B_G1_post-train_5k/model/__9_0.distcp filter=lfs diff=lfs merge=lfs -text
179
+ 14B_G1_post-train_5k/model/__1_0.distcp filter=lfs diff=lfs merge=lfs -text
180
+ 14B_G1_post-train_5k/model/__23_0.distcp filter=lfs diff=lfs merge=lfs -text
181
+ 14B_G1_post-train_5k/model/__6_0.distcp filter=lfs diff=lfs merge=lfs -text
182
+ 14B_AgiBot_post-train_5k/optim/__7_0.distcp filter=lfs diff=lfs merge=lfs -text
183
+ 14B_AgiBot_post-train_5k/optim/__17_0.distcp filter=lfs diff=lfs merge=lfs -text
184
+ 14B_G1_post-train_5k/model/__27_0.distcp filter=lfs diff=lfs merge=lfs -text
185
+ 14B_AgiBot_post-train_5k/optim/__10_0.distcp filter=lfs diff=lfs merge=lfs -text
186
+ 14B_G1_post-train_5k/model/__22_0.distcp filter=lfs diff=lfs merge=lfs -text
187
+ 14B_AgiBot_post-train_5k/optim/__18_0.distcp filter=lfs diff=lfs merge=lfs -text
188
+ 14B_AgiBot_post-train_5k/optim/__21_0.distcp filter=lfs diff=lfs merge=lfs -text
189
+ 14B_AgiBot_post-train_5k/optim/__28_0.distcp filter=lfs diff=lfs merge=lfs -text
190
+ 14B_AgiBot_post-train_5k/optim/__14_0.distcp filter=lfs diff=lfs merge=lfs -text
191
+ 14B_G1_post-train_5k/model/__3_0.distcp filter=lfs diff=lfs merge=lfs -text
192
+ 14B_AgiBot_post-train_5k/optim/__19_0.distcp filter=lfs diff=lfs merge=lfs -text
193
+ 14B_AgiBot_post-train_5k/optim/__26_0.distcp filter=lfs diff=lfs merge=lfs -text
194
+ 14B_G1_post-train_5k/model/__8_0.distcp filter=lfs diff=lfs merge=lfs -text
195
+ 14B_AgiBot_post-train_5k/optim/__12_0.distcp filter=lfs diff=lfs merge=lfs -text
196
+ 14B_G1_post-train_5k/model/__31_0.distcp filter=lfs diff=lfs merge=lfs -text
197
+ 14B_G1_post-train_5k/model/__15_0.distcp filter=lfs diff=lfs merge=lfs -text
198
+ 14B_G1_post-train_5k/model/__30_0.distcp filter=lfs diff=lfs merge=lfs -text
199
+ 14B_G1_post-train_5k/model/__24_0.distcp filter=lfs diff=lfs merge=lfs -text
200
+ 14B_AgiBot_post-train_5k/optim/__2_0.distcp filter=lfs diff=lfs merge=lfs -text
201
+ 14B_AgiBot_post-train_5k/optim/__13_0.distcp filter=lfs diff=lfs merge=lfs -text
202
+ 14B_AgiBot_post-train_5k/optim/__11_0.distcp filter=lfs diff=lfs merge=lfs -text
203
+ 14B_AgiBot_post-train_5k/optim/__5_0.distcp filter=lfs diff=lfs merge=lfs -text
204
+ 14B_AgiBot_post-train_5k/optim/__25_0.distcp filter=lfs diff=lfs merge=lfs -text
205
+ 14B_AgiBot_post-train_5k/optim/__29_0.distcp filter=lfs diff=lfs merge=lfs -text
206
+ 14B_AgiBot_post-train_5k/optim/__0_0.distcp filter=lfs diff=lfs merge=lfs -text
207
+ 14B_AgiBot_post-train_5k/optim/__4_0.distcp filter=lfs diff=lfs merge=lfs -text
208
+ 14B_AgiBot_post-train_5k/optim/__20_0.distcp filter=lfs diff=lfs merge=lfs -text
209
+ 14B_AgiBot_post-train_5k/optim/__16_0.distcp filter=lfs diff=lfs merge=lfs -text
210
+ 14B_AgiBot_post-train_5k/optim/__1_0.distcp filter=lfs diff=lfs merge=lfs -text
211
+ 14B_AgiBot_post-train_5k/optim/__9_0.distcp filter=lfs diff=lfs merge=lfs -text
212
+ 14B_AgiBot_post-train_5k/model/.metadata filter=lfs diff=lfs merge=lfs -text
213
+ 14B_AgiBot_post-train_5k/optim/__22_0.distcp filter=lfs diff=lfs merge=lfs -text
214
+ 14B_AgiBot_post-train_5k/optim/__23_0.distcp filter=lfs diff=lfs merge=lfs -text
215
+ 14B_AgiBot_post-train_5k/optim/__27_0.distcp filter=lfs diff=lfs merge=lfs -text
216
+ 14B_AgiBot_post-train_5k/optim/__6_0.distcp filter=lfs diff=lfs merge=lfs -text
217
+ 14B_AgiBot_post-train_5k/optim/__3_0.distcp filter=lfs diff=lfs merge=lfs -text
218
+ 14B_AgiBot_post-train_5k/optim/__8_0.distcp filter=lfs diff=lfs merge=lfs -text
219
+ 14B_AgiBot_post-train_5k/optim/__15_0.distcp filter=lfs diff=lfs merge=lfs -text
220
+ 14B_AgiBot_post-train_5k/optim/__31_0.distcp filter=lfs diff=lfs merge=lfs -text
221
+ 14B_AgiBot_post-train_5k/optim/__24_0.distcp filter=lfs diff=lfs merge=lfs -text
222
+ 14B_AgiBot_post-train_5k/optim/__30_0.distcp filter=lfs diff=lfs merge=lfs -text
223
+ 2B_G1_post-train_5k/optim/.metadata filter=lfs diff=lfs merge=lfs -text
224
+ 2B_G1_post-train_5k/optim/__7_0.distcp filter=lfs diff=lfs merge=lfs -text
225
+ 2B_G1_post-train_5k/optim/__2_0.distcp filter=lfs diff=lfs merge=lfs -text
226
+ 2B_G1_post-train_5k/optim/__5_0.distcp filter=lfs diff=lfs merge=lfs -text
227
+ 2B_G1_post-train_5k/optim/__0_0.distcp filter=lfs diff=lfs merge=lfs -text
228
+ 2B_G1_post-train_5k/optim/__4_0.distcp filter=lfs diff=lfs merge=lfs -text
229
+ 2B_G1_post-train_5k/optim/__6_0.distcp filter=lfs diff=lfs merge=lfs -text
230
+ 2B_G1_post-train_5k/optim/__1_0.distcp filter=lfs diff=lfs merge=lfs -text
231
+ 2B_G1_post-train_5k/optim/__3_0.distcp filter=lfs diff=lfs merge=lfs -text
232
+ 2B_G1_post-train_5k/model/.metadata filter=lfs diff=lfs merge=lfs -text
233
+ 14B_AgiBot_post-train_5k/model/__17_0.distcp filter=lfs diff=lfs merge=lfs -text
234
+ 14B_AgiBot_post-train_5k/model/__10_0.distcp filter=lfs diff=lfs merge=lfs -text
235
+ 14B_AgiBot_post-train_5k/model/__21_0.distcp filter=lfs diff=lfs merge=lfs -text
236
+ 14B_AgiBot_post-train_5k/model/__18_0.distcp filter=lfs diff=lfs merge=lfs -text
237
+ 14B_AgiBot_post-train_5k/model/__26_0.distcp filter=lfs diff=lfs merge=lfs -text
238
+ 14B_AgiBot_post-train_5k/model/__7_0.distcp filter=lfs diff=lfs merge=lfs -text
239
+ 14B_AgiBot_post-train_5k/model/__28_0.distcp filter=lfs diff=lfs merge=lfs -text
240
+ 14B_AgiBot_post-train_5k/model/__12_0.distcp filter=lfs diff=lfs merge=lfs -text
241
+ 14B_AgiBot_post-train_5k/model/__19_0.distcp filter=lfs diff=lfs merge=lfs -text
242
+ 14B_AgiBot_post-train_5k/model/__13_0.distcp filter=lfs diff=lfs merge=lfs -text
243
+ 14B_AgiBot_post-train_5k/model/__2_0.distcp filter=lfs diff=lfs merge=lfs -text
244
+ 14B_AgiBot_post-train_5k/model/__14_0.distcp filter=lfs diff=lfs merge=lfs -text
245
+ 14B_AgiBot_post-train_5k/model/__11_0.distcp filter=lfs diff=lfs merge=lfs -text
246
+ 14B_AgiBot_post-train_5k/model/__4_0.distcp filter=lfs diff=lfs merge=lfs -text
247
+ 14B_AgiBot_post-train_5k/model/__5_0.distcp filter=lfs diff=lfs merge=lfs -text
248
+ 14B_AgiBot_post-train_5k/model/__0_0.distcp filter=lfs diff=lfs merge=lfs -text
249
+ 14B_AgiBot_post-train_5k/model/__25_0.distcp filter=lfs diff=lfs merge=lfs -text
250
+ 14B_AgiBot_post-train_5k/model/__20_0.distcp filter=lfs diff=lfs merge=lfs -text
251
+ 14B_AgiBot_post-train_5k/model/__29_0.distcp filter=lfs diff=lfs merge=lfs -text
252
+ 14B_AgiBot_post-train_5k/model/__16_0.distcp filter=lfs diff=lfs merge=lfs -text
253
+ 14B_AgiBot_post-train_5k/model/__9_0.distcp filter=lfs diff=lfs merge=lfs -text
254
+ 14B_AgiBot_post-train_5k/model/__1_0.distcp filter=lfs diff=lfs merge=lfs -text
255
+ 14B_AgiBot_post-train_5k/model/__23_0.distcp filter=lfs diff=lfs merge=lfs -text
256
+ 14B_AgiBot_post-train_5k/model/__27_0.distcp filter=lfs diff=lfs merge=lfs -text
257
+ 14B_AgiBot_post-train_5k/model/__3_0.distcp filter=lfs diff=lfs merge=lfs -text
258
+ 14B_AgiBot_post-train_5k/model/__15_0.distcp filter=lfs diff=lfs merge=lfs -text
259
+ 14B_AgiBot_post-train_5k/model/__22_0.distcp filter=lfs diff=lfs merge=lfs -text
260
+ 14B_AgiBot_post-train_5k/model/__6_0.distcp filter=lfs diff=lfs merge=lfs -text
261
+ 14B_AgiBot_post-train_5k/model/__31_0.distcp filter=lfs diff=lfs merge=lfs -text
262
+ 14B_GR1_post-train_5k/optim/.metadata filter=lfs diff=lfs merge=lfs -text
263
+ 14B_AgiBot_post-train_5k/model/__30_0.distcp filter=lfs diff=lfs merge=lfs -text
264
+ 14B_AgiBot_post-train_5k/model/__8_0.distcp filter=lfs diff=lfs merge=lfs -text
265
+ 14B_AgiBot_post-train_5k/model/__24_0.distcp filter=lfs diff=lfs merge=lfs -text
266
+ 2B_G1_post-train_5k/model/__7_0.distcp filter=lfs diff=lfs merge=lfs -text
267
+ 2B_G1_post-train_5k/model/__5_0.distcp filter=lfs diff=lfs merge=lfs -text
268
+ 2B_G1_post-train_5k/model/__2_0.distcp filter=lfs diff=lfs merge=lfs -text
269
+ 2B_G1_post-train_5k/model/__0_0.distcp filter=lfs diff=lfs merge=lfs -text
270
+ 2B_G1_post-train_5k/model/__4_0.distcp filter=lfs diff=lfs merge=lfs -text
271
+ 2B_G1_post-train_5k/model/__6_0.distcp filter=lfs diff=lfs merge=lfs -text
272
+ 2B_G1_post-train_5k/model/__1_0.distcp filter=lfs diff=lfs merge=lfs -text
273
+ 2B_G1_post-train_5k/model/__3_0.distcp filter=lfs diff=lfs merge=lfs -text
274
+ 14B_GR1_post-train_5k/optim/__17_0.distcp filter=lfs diff=lfs merge=lfs -text
275
+ 14B_GR1_post-train_5k/optim/__7_0.distcp filter=lfs diff=lfs merge=lfs -text
276
+ 14B_GR1_post-train_5k/optim/__10_0.distcp filter=lfs diff=lfs merge=lfs -text
277
+ 14B_GR1_post-train_5k/optim/__18_0.distcp filter=lfs diff=lfs merge=lfs -text
278
+ 14B_GR1_post-train_5k/optim/__28_0.distcp filter=lfs diff=lfs merge=lfs -text
279
+ 14B_GR1_post-train_5k/optim/__21_0.distcp filter=lfs diff=lfs merge=lfs -text
280
+ 14B_GR1_post-train_5k/optim/__19_0.distcp filter=lfs diff=lfs merge=lfs -text
281
+ 14B_GR1_post-train_5k/optim/__2_0.distcp filter=lfs diff=lfs merge=lfs -text
282
+ 14B_GR1_post-train_5k/optim/__11_0.distcp filter=lfs diff=lfs merge=lfs -text
283
+ 14B_GR1_post-train_5k/optim/__12_0.distcp filter=lfs diff=lfs merge=lfs -text
284
+ 14B_GR1_post-train_5k/optim/__5_0.distcp filter=lfs diff=lfs merge=lfs -text
285
+ 14B_GR1_post-train_5k/optim/__14_0.distcp filter=lfs diff=lfs merge=lfs -text
286
+ 14B_GR1_post-train_5k/optim/__26_0.distcp filter=lfs diff=lfs merge=lfs -text
287
+ 14B_GR1_post-train_5k/optim/__0_0.distcp filter=lfs diff=lfs merge=lfs -text
288
+ 14B_GR1_post-train_5k/optim/__13_0.distcp filter=lfs diff=lfs merge=lfs -text
289
+ 14B_GR1_post-train_5k/optim/__4_0.distcp filter=lfs diff=lfs merge=lfs -text
290
+ 14B_GR1_post-train_5k/optim/__29_0.distcp filter=lfs diff=lfs merge=lfs -text
291
+ 14B_GR1_post-train_5k/optim/__20_0.distcp filter=lfs diff=lfs merge=lfs -text
292
+ 14B_GR1_post-train_5k/optim/__25_0.distcp filter=lfs diff=lfs merge=lfs -text
293
+ 14B_GR1_post-train_5k/model/.metadata filter=lfs diff=lfs merge=lfs -text
294
+ 14B_YAM_post-train_5k/optim/__31_0.distcp filter=lfs diff=lfs merge=lfs -text
295
+ 2B_pretrain_140k/model/__7_0.distcp filter=lfs diff=lfs merge=lfs -text
296
+ 2B_pretrain_140k/model/__2_0.distcp filter=lfs diff=lfs merge=lfs -text
297
+ 2B_pretrain_140k/model/__5_0.distcp filter=lfs diff=lfs merge=lfs -text
298
+ 2B_pretrain_140k/model/__0_0.distcp filter=lfs diff=lfs merge=lfs -text
299
+ 2B_pretrain_140k/model/__4_0.distcp filter=lfs diff=lfs merge=lfs -text
300
+ 2B_pretrain_140k/model/__1_0.distcp filter=lfs diff=lfs merge=lfs -text
301
+ 2B_pretrain_140k/model/__3_0.distcp filter=lfs diff=lfs merge=lfs -text
302
+ 2B_pretrain_140k/model/__6_0.distcp filter=lfs diff=lfs merge=lfs -text
303
+ 14B_YAM_post-train_5k/optim/.metadata filter=lfs diff=lfs merge=lfs -text
304
+ 2B_GR1_post-train_5k/model/__2_0.distcp filter=lfs diff=lfs merge=lfs -text
305
+ 2B_GR1_post-train_5k/model/__7_0.distcp filter=lfs diff=lfs merge=lfs -text
306
+ 2B_GR1_post-train_5k/model/__4_0.distcp filter=lfs diff=lfs merge=lfs -text
307
+ 14B_YAM_post-train_5k/optim/__24_0.distcp filter=lfs diff=lfs merge=lfs -text
308
+ 14B_YAM_post-train_5k/optim/__30_0.distcp filter=lfs diff=lfs merge=lfs -text
309
+ 2B_GR1_post-train_5k/model/__0_0.distcp filter=lfs diff=lfs merge=lfs -text
310
+ 2B_GR1_post-train_5k/model/__1_0.distcp filter=lfs diff=lfs merge=lfs -text
311
+ 2B_GR1_post-train_5k/model/__3_0.distcp filter=lfs diff=lfs merge=lfs -text
312
+ 2B_GR1_post-train_5k/model/__6_0.distcp filter=lfs diff=lfs merge=lfs -text
313
+ 2B_GR1_post-train_5k/model/__5_0.distcp filter=lfs diff=lfs merge=lfs -text
314
+ 14B_GR1_post-train_5k/optim/__16_0.distcp filter=lfs diff=lfs merge=lfs -text
315
+ 14B_GR1_post-train_5k/optim/__9_0.distcp filter=lfs diff=lfs merge=lfs -text
316
+ 14B_GR1_post-train_5k/optim/__1_0.distcp filter=lfs diff=lfs merge=lfs -text
317
+ 14B_GR1_post-train_5k/optim/__23_0.distcp filter=lfs diff=lfs merge=lfs -text
318
+ 14B_GR1_post-train_5k/optim/__6_0.distcp filter=lfs diff=lfs merge=lfs -text
319
+ 14B_GR1_post-train_5k/optim/__22_0.distcp filter=lfs diff=lfs merge=lfs -text
320
+ 14B_GR1_post-train_5k/optim/__3_0.distcp filter=lfs diff=lfs merge=lfs -text
321
+ 14B_GR1_post-train_5k/optim/__27_0.distcp filter=lfs diff=lfs merge=lfs -text
322
+ 14B_GR1_post-train_5k/optim/__8_0.distcp filter=lfs diff=lfs merge=lfs -text
323
+ 14B_GR1_post-train_5k/optim/__15_0.distcp filter=lfs diff=lfs merge=lfs -text
324
+ 14B_GR1_post-train_5k/optim/__24_0.distcp filter=lfs diff=lfs merge=lfs -text
325
+ 14B_GR1_post-train_5k/optim/__30_0.distcp filter=lfs diff=lfs merge=lfs -text
326
+ 14B_GR1_post-train_5k/optim/__31_0.distcp filter=lfs diff=lfs merge=lfs -text
327
+ 2B_AgiBot_post-train_5k/optim/.metadata filter=lfs diff=lfs merge=lfs -text
328
+ 2B_AgiBot_post-train_5k/optim/__7_0.distcp filter=lfs diff=lfs merge=lfs -text
329
+ 2B_AgiBot_post-train_5k/optim/__2_0.distcp filter=lfs diff=lfs merge=lfs -text
330
+ 2B_AgiBot_post-train_5k/optim/__0_0.distcp filter=lfs diff=lfs merge=lfs -text
331
+ 2B_AgiBot_post-train_5k/optim/__5_0.distcp filter=lfs diff=lfs merge=lfs -text
332
+ 2B_AgiBot_post-train_5k/optim/__4_0.distcp filter=lfs diff=lfs merge=lfs -text
333
+ 2B_AgiBot_post-train_5k/optim/__1_0.distcp filter=lfs diff=lfs merge=lfs -text
334
+ 2B_AgiBot_post-train_5k/optim/__6_0.distcp filter=lfs diff=lfs merge=lfs -text
335
+ 2B_AgiBot_post-train_5k/optim/__3_0.distcp filter=lfs diff=lfs merge=lfs -text
336
+ 2B_AgiBot_post-train_5k/model/.metadata filter=lfs diff=lfs merge=lfs -text
337
+ 14B_GR1_post-train_5k/model/__10_0.distcp filter=lfs diff=lfs merge=lfs -text
338
+ 14B_GR1_post-train_5k/model/__17_0.distcp filter=lfs diff=lfs merge=lfs -text
339
+ 14B_GR1_post-train_5k/model/__18_0.distcp filter=lfs diff=lfs merge=lfs -text
340
+ 14B_GR1_post-train_5k/model/__7_0.distcp filter=lfs diff=lfs merge=lfs -text
341
+ 14B_GR1_post-train_5k/model/__21_0.distcp filter=lfs diff=lfs merge=lfs -text
342
+ 14B_GR1_post-train_5k/model/__28_0.distcp filter=lfs diff=lfs merge=lfs -text
343
+ 14B_GR1_post-train_5k/model/__26_0.distcp filter=lfs diff=lfs merge=lfs -text
344
+ 14B_GR1_post-train_5k/model/__19_0.distcp filter=lfs diff=lfs merge=lfs -text
345
+ 14B_GR1_post-train_5k/model/__14_0.distcp filter=lfs diff=lfs merge=lfs -text
346
+ 14B_GR1_post-train_5k/model/__12_0.distcp filter=lfs diff=lfs merge=lfs -text
347
+ 14B_GR1_post-train_5k/model/__13_0.distcp filter=lfs diff=lfs merge=lfs -text
348
+ 14B_GR1_post-train_5k/model/__2_0.distcp filter=lfs diff=lfs merge=lfs -text
349
+ 14B_GR1_post-train_5k/model/__11_0.distcp filter=lfs diff=lfs merge=lfs -text
350
+ 14B_GR1_post-train_5k/model/__5_0.distcp filter=lfs diff=lfs merge=lfs -text
351
+ 14B_GR1_post-train_5k/model/__0_0.distcp filter=lfs diff=lfs merge=lfs -text
352
+ 14B_GR1_post-train_5k/model/__29_0.distcp filter=lfs diff=lfs merge=lfs -text
353
+ 14B_GR1_post-train_5k/model/__9_0.distcp filter=lfs diff=lfs merge=lfs -text
354
+ 14B_GR1_post-train_5k/model/__16_0.distcp filter=lfs diff=lfs merge=lfs -text
355
+ 14B_GR1_post-train_5k/model/__4_0.distcp filter=lfs diff=lfs merge=lfs -text
356
+ 14B_GR1_post-train_5k/model/__20_0.distcp filter=lfs diff=lfs merge=lfs -text
357
+ 14B_GR1_post-train_5k/model/__25_0.distcp filter=lfs diff=lfs merge=lfs -text
358
+ 14B_GR1_post-train_5k/model/__1_0.distcp filter=lfs diff=lfs merge=lfs -text
359
+ 14B_GR1_post-train_5k/model/__23_0.distcp filter=lfs diff=lfs merge=lfs -text
360
+ 14B_GR1_post-train_5k/model/__6_0.distcp filter=lfs diff=lfs merge=lfs -text
361
+ 14B_GR1_post-train_5k/model/__27_0.distcp filter=lfs diff=lfs merge=lfs -text
362
+ 14B_GR1_post-train_5k/model/__22_0.distcp filter=lfs diff=lfs merge=lfs -text
363
+ 14B_GR1_post-train_5k/model/__3_0.distcp filter=lfs diff=lfs merge=lfs -text
364
+ 2B_pretrain_140k/optim/.metadata filter=lfs diff=lfs merge=lfs -text
365
+ 14B_GR1_post-train_5k/model/__8_0.distcp filter=lfs diff=lfs merge=lfs -text
366
+ 14B_GR1_post-train_5k/model/__31_0.distcp filter=lfs diff=lfs merge=lfs -text
367
+ 14B_GR1_post-train_5k/model/__15_0.distcp filter=lfs diff=lfs merge=lfs -text
368
+ 14B_GR1_post-train_5k/model/__24_0.distcp filter=lfs diff=lfs merge=lfs -text
369
+ 14B_GR1_post-train_5k/model/__30_0.distcp filter=lfs diff=lfs merge=lfs -text
370
+ 2B_pretrain_140k/model/.metadata filter=lfs diff=lfs merge=lfs -text
371
+ 2B_AgiBot_post-train_5k/model/__7_0.distcp filter=lfs diff=lfs merge=lfs -text
372
+ 2B_AgiBot_post-train_5k/model/__2_0.distcp filter=lfs diff=lfs merge=lfs -text
373
+ 2B_AgiBot_post-train_5k/model/__0_0.distcp filter=lfs diff=lfs merge=lfs -text
374
+ 2B_AgiBot_post-train_5k/model/__5_0.distcp filter=lfs diff=lfs merge=lfs -text
375
+ 2B_AgiBot_post-train_5k/model/__4_0.distcp filter=lfs diff=lfs merge=lfs -text
376
+ 2B_AgiBot_post-train_5k/model/__1_0.distcp filter=lfs diff=lfs merge=lfs -text
377
+ 2B_AgiBot_post-train_5k/model/__6_0.distcp filter=lfs diff=lfs merge=lfs -text
378
+ 2B_AgiBot_post-train_5k/model/__3_0.distcp filter=lfs diff=lfs merge=lfs -text
379
+ 2B_GR1_post-train_5k/optim/.metadata filter=lfs diff=lfs merge=lfs -text
380
+ 2B_pretrain_140k/optim/__7_0.distcp filter=lfs diff=lfs merge=lfs -text
381
+ 2B_pretrain_140k/optim/__5_0.distcp filter=lfs diff=lfs merge=lfs -text
382
+ 2B_pretrain_140k/optim/__2_0.distcp filter=lfs diff=lfs merge=lfs -text
383
+ 2B_pretrain_140k/optim/__4_0.distcp filter=lfs diff=lfs merge=lfs -text
384
+ 2B_pretrain_140k/optim/__0_0.distcp filter=lfs diff=lfs merge=lfs -text
385
+ 2B_pretrain_140k/optim/__1_0.distcp filter=lfs diff=lfs merge=lfs -text
386
+ 2B_pretrain_140k/optim/__6_0.distcp filter=lfs diff=lfs merge=lfs -text
387
+ 2B_pretrain_140k/optim/__3_0.distcp filter=lfs diff=lfs merge=lfs -text
388
+ 2B_GR1_post-train_5k/model/.metadata filter=lfs diff=lfs merge=lfs -text
389
+ 2B_GR1_post-train_5k/optim/__7_0.distcp filter=lfs diff=lfs merge=lfs -text
390
+ 2B_GR1_post-train_5k/optim/__2_0.distcp filter=lfs diff=lfs merge=lfs -text
391
+ 2B_GR1_post-train_5k/optim/__4_0.distcp filter=lfs diff=lfs merge=lfs -text
392
+ 2B_GR1_post-train_5k/optim/__1_0.distcp filter=lfs diff=lfs merge=lfs -text
393
+ 2B_GR1_post-train_5k/optim/__0_0.distcp filter=lfs diff=lfs merge=lfs -text
394
+ 2B_GR1_post-train_5k/optim/__5_0.distcp filter=lfs diff=lfs merge=lfs -text
395
+ 2B_GR1_post-train_5k/optim/__6_0.distcp filter=lfs diff=lfs merge=lfs -text
396
+ 2B_GR1_post-train_5k/optim/__3_0.distcp filter=lfs diff=lfs merge=lfs -text
397
+ 14B_YAM_post-train_5k/optim/__17_0.distcp filter=lfs diff=lfs merge=lfs -text
398
+ 14B_YAM_post-train_5k/optim/__7_0.distcp filter=lfs diff=lfs merge=lfs -text
399
+ 14B_YAM_post-train_5k/optim/__18_0.distcp filter=lfs diff=lfs merge=lfs -text
400
+ 14B_YAM_post-train_5k/optim/__10_0.distcp filter=lfs diff=lfs merge=lfs -text
401
+ 14B_YAM_post-train_5k/optim/__21_0.distcp filter=lfs diff=lfs merge=lfs -text
402
+ 14B_YAM_post-train_5k/optim/__28_0.distcp filter=lfs diff=lfs merge=lfs -text
403
+ 14B_YAM_post-train_5k/optim/__26_0.distcp filter=lfs diff=lfs merge=lfs -text
404
+ 14B_YAM_post-train_5k/optim/__19_0.distcp filter=lfs diff=lfs merge=lfs -text
405
+ 14B_YAM_post-train_5k/optim/__12_0.distcp filter=lfs diff=lfs merge=lfs -text
406
+ 14B_YAM_post-train_5k/optim/__4_0.distcp filter=lfs diff=lfs merge=lfs -text
407
+ 14B_YAM_post-train_5k/optim/__11_0.distcp filter=lfs diff=lfs merge=lfs -text
408
+ 14B_YAM_post-train_5k/optim/__13_0.distcp filter=lfs diff=lfs merge=lfs -text
409
+ 14B_YAM_post-train_5k/optim/__14_0.distcp filter=lfs diff=lfs merge=lfs -text
410
+ 14B_YAM_post-train_5k/optim/__5_0.distcp filter=lfs diff=lfs merge=lfs -text
411
+ 14B_YAM_post-train_5k/optim/__2_0.distcp filter=lfs diff=lfs merge=lfs -text
412
+ 14B_YAM_post-train_5k/optim/__0_0.distcp filter=lfs diff=lfs merge=lfs -text
413
+ 14B_YAM_post-train_5k/optim/__29_0.distcp filter=lfs diff=lfs merge=lfs -text
414
+ 14B_YAM_post-train_5k/optim/__20_0.distcp filter=lfs diff=lfs merge=lfs -text
415
+ 14B_YAM_post-train_5k/optim/__16_0.distcp filter=lfs diff=lfs merge=lfs -text
416
+ 14B_YAM_post-train_5k/optim/__25_0.distcp filter=lfs diff=lfs merge=lfs -text
417
+ 14B_YAM_post-train_5k/optim/__9_0.distcp filter=lfs diff=lfs merge=lfs -text
418
+ 14B_YAM_post-train_5k/optim/__23_0.distcp filter=lfs diff=lfs merge=lfs -text
419
+ 14B_YAM_post-train_5k/model/.metadata filter=lfs diff=lfs merge=lfs -text
420
+ 14B_YAM_post-train_5k/optim/__1_0.distcp filter=lfs diff=lfs merge=lfs -text
421
+ 14B_YAM_post-train_5k/optim/__22_0.distcp filter=lfs diff=lfs merge=lfs -text
422
+ 14B_YAM_post-train_5k/optim/__27_0.distcp filter=lfs diff=lfs merge=lfs -text
423
+ 14B_YAM_post-train_5k/optim/__6_0.distcp filter=lfs diff=lfs merge=lfs -text
424
+ 14B_YAM_post-train_5k/model/__17_0.distcp filter=lfs diff=lfs merge=lfs -text
425
+ 14B_YAM_post-train_5k/model/__21_0.distcp filter=lfs diff=lfs merge=lfs -text
426
+ 14B_YAM_post-train_5k/model/__7_0.distcp filter=lfs diff=lfs merge=lfs -text
427
+ 14B_YAM_post-train_5k/model/__18_0.distcp filter=lfs diff=lfs merge=lfs -text
428
+ 14B_YAM_post-train_5k/model/__10_0.distcp filter=lfs diff=lfs merge=lfs -text
429
+ 14B_YAM_post-train_5k/model/__28_0.distcp filter=lfs diff=lfs merge=lfs -text
430
+ 14B_YAM_post-train_5k/model/__19_0.distcp filter=lfs diff=lfs merge=lfs -text
431
+ 14B_YAM_post-train_5k/model/__12_0.distcp filter=lfs diff=lfs merge=lfs -text
432
+ 14B_YAM_post-train_5k/model/__14_0.distcp filter=lfs diff=lfs merge=lfs -text
433
+ 14B_YAM_post-train_5k/model/__26_0.distcp filter=lfs diff=lfs merge=lfs -text
434
+ 14B_YAM_post-train_5k/model/__2_0.distcp filter=lfs diff=lfs merge=lfs -text
435
+ 14B_YAM_post-train_5k/model/__11_0.distcp filter=lfs diff=lfs merge=lfs -text
436
+ 14B_YAM_post-train_5k/model/__13_0.distcp filter=lfs diff=lfs merge=lfs -text
437
+ 14B_YAM_post-train_5k/model/__4_0.distcp filter=lfs diff=lfs merge=lfs -text
438
+ 14B_YAM_post-train_5k/model/__5_0.distcp filter=lfs diff=lfs merge=lfs -text
439
+ 14B_YAM_post-train_5k/model/__0_0.distcp filter=lfs diff=lfs merge=lfs -text
440
+ 14B_YAM_post-train_5k/model/__20_0.distcp filter=lfs diff=lfs merge=lfs -text
441
+ 14B_YAM_post-train_5k/model/__9_0.distcp filter=lfs diff=lfs merge=lfs -text
442
+ 14B_YAM_post-train_5k/model/__29_0.distcp filter=lfs diff=lfs merge=lfs -text
443
+ 14B_YAM_post-train_5k/model/__25_0.distcp filter=lfs diff=lfs merge=lfs -text
444
+ 14B_YAM_post-train_5k/model/__23_0.distcp filter=lfs diff=lfs merge=lfs -text
445
+ 14B_YAM_post-train_5k/model/__1_0.distcp filter=lfs diff=lfs merge=lfs -text
446
+ 14B_YAM_post-train_5k/model/__27_0.distcp filter=lfs diff=lfs merge=lfs -text
447
+ 14B_YAM_post-train_5k/model/__6_0.distcp filter=lfs diff=lfs merge=lfs -text
448
+ 14B_YAM_post-train_5k/model/__16_0.distcp filter=lfs diff=lfs merge=lfs -text
449
+ 14B_YAM_post-train_5k/model/__3_0.distcp filter=lfs diff=lfs merge=lfs -text
450
+ 14B_YAM_post-train_5k/model/__8_0.distcp filter=lfs diff=lfs merge=lfs -text
451
+ 14B_YAM_post-train_5k/model/__30_0.distcp filter=lfs diff=lfs merge=lfs -text
452
+ 14B_YAM_post-train_5k/model/__31_0.distcp filter=lfs diff=lfs merge=lfs -text
453
+ 14B_YAM_post-train_5k/model/__24_0.distcp filter=lfs diff=lfs merge=lfs -text
454
+ 14B_YAM_post-train_5k/model/__22_0.distcp filter=lfs diff=lfs merge=lfs -text
455
+ 14B_YAM_post-train_5k/model/__15_0.distcp filter=lfs diff=lfs merge=lfs -text
456
+ 14B_AgiBot_post-train/iter_000050000/optim/__16_0.distcp filter=lfs diff=lfs merge=lfs -text
457
+ 2B_YAM_post-train/iter_000050000/optim/.metadata filter=lfs diff=lfs merge=lfs -text
458
+ 2B_YAM_post-train/iter_000050000/model/.metadata filter=lfs diff=lfs merge=lfs -text
459
+ 2B_AgiBot_post-train/iter_000050000/optim/.metadata filter=lfs diff=lfs merge=lfs -text
460
+ 2B_YAM_post-train/iter_000050000/optim/__2_0.distcp filter=lfs diff=lfs merge=lfs -text
461
+ 2B_YAM_post-train/iter_000050000/optim/__3_0.distcp filter=lfs diff=lfs merge=lfs -text
462
+ 2B_YAM_post-train/iter_000050000/optim/__4_0.distcp filter=lfs diff=lfs merge=lfs -text
463
+ 2B_YAM_post-train/iter_000050000/optim/__5_0.distcp filter=lfs diff=lfs merge=lfs -text
464
+ 2B_YAM_post-train/iter_000050000/optim/__1_0.distcp filter=lfs diff=lfs merge=lfs -text
465
+ 2B_YAM_post-train/iter_000050000/optim/__0_0.distcp filter=lfs diff=lfs merge=lfs -text
466
+ 2B_YAM_post-train/iter_000050000/optim/__6_0.distcp filter=lfs diff=lfs merge=lfs -text
467
+ 2B_YAM_post-train/iter_000050000/optim/__7_0.distcp filter=lfs diff=lfs merge=lfs -text
468
+ 2B_AgiBot_post-train/iter_000050000/model/.metadata filter=lfs diff=lfs merge=lfs -text
469
+ 2B_AgiBot_post-train/iter_000050000/optim/__2_0.distcp filter=lfs diff=lfs merge=lfs -text
470
+ 2B_AgiBot_post-train/iter_000050000/optim/__4_0.distcp filter=lfs diff=lfs merge=lfs -text
471
+ 2B_AgiBot_post-train/iter_000050000/optim/__5_0.distcp filter=lfs diff=lfs merge=lfs -text
472
+ 2B_AgiBot_post-train/iter_000050000/optim/__3_0.distcp filter=lfs diff=lfs merge=lfs -text
473
+ 2B_AgiBot_post-train/iter_000050000/optim/__7_0.distcp filter=lfs diff=lfs merge=lfs -text
474
+ 2B_AgiBot_post-train/iter_000050000/optim/__6_0.distcp filter=lfs diff=lfs merge=lfs -text
475
+ 2B_AgiBot_post-train/iter_000050000/optim/__1_0.distcp filter=lfs diff=lfs merge=lfs -text
476
+ 2B_AgiBot_post-train/iter_000050000/optim/__0_0.distcp filter=lfs diff=lfs merge=lfs -text
477
+ 2B_YAM_post-train/iter_000050000/model/__2_0.distcp filter=lfs diff=lfs merge=lfs -text
478
+ 2B_YAM_post-train/iter_000050000/model/__3_0.distcp filter=lfs diff=lfs merge=lfs -text
479
+ 2B_YAM_post-train/iter_000050000/model/__5_0.distcp filter=lfs diff=lfs merge=lfs -text
480
+ 2B_YAM_post-train/iter_000050000/model/__4_0.distcp filter=lfs diff=lfs merge=lfs -text
481
+ 14B_GR1_post-train/iter_000050000/optim/.metadata filter=lfs diff=lfs merge=lfs -text
482
+ 2B_YAM_post-train/iter_000050000/model/__7_0.distcp filter=lfs diff=lfs merge=lfs -text
483
+ 2B_YAM_post-train/iter_000050000/model/__0_0.distcp filter=lfs diff=lfs merge=lfs -text
484
+ 2B_YAM_post-train/iter_000050000/model/__6_0.distcp filter=lfs diff=lfs merge=lfs -text
485
+ 2B_YAM_post-train/iter_000050000/model/__1_0.distcp filter=lfs diff=lfs merge=lfs -text
486
+ 2B_AgiBot_post-train/iter_000050000/model/__2_0.distcp filter=lfs diff=lfs merge=lfs -text
487
+ 2B_AgiBot_post-train/iter_000050000/model/__3_0.distcp filter=lfs diff=lfs merge=lfs -text
488
+ 2B_AgiBot_post-train/iter_000050000/model/__4_0.distcp filter=lfs diff=lfs merge=lfs -text
489
+ 2B_AgiBot_post-train/iter_000050000/model/__5_0.distcp filter=lfs diff=lfs merge=lfs -text
490
+ 2B_AgiBot_post-train/iter_000050000/model/__0_0.distcp filter=lfs diff=lfs merge=lfs -text
491
+ 2B_AgiBot_post-train/iter_000050000/model/__1_0.distcp filter=lfs diff=lfs merge=lfs -text
492
+ 2B_AgiBot_post-train/iter_000050000/model/__6_0.distcp filter=lfs diff=lfs merge=lfs -text
493
+ 2B_AgiBot_post-train/iter_000050000/model/__7_0.distcp filter=lfs diff=lfs merge=lfs -text
494
+ 14B_GR1_post-train/iter_000050000/optim/__2_0.distcp filter=lfs diff=lfs merge=lfs -text
495
+ 14B_GR1_post-train/iter_000050000/optim/__29_0.distcp filter=lfs diff=lfs merge=lfs -text
496
+ 14B_GR1_post-train/iter_000050000/optim/__30_0.distcp filter=lfs diff=lfs merge=lfs -text
497
+ 14B_GR1_post-train/iter_000050000/optim/__24_0.distcp filter=lfs diff=lfs merge=lfs -text
498
+ 14B_GR1_post-train/iter_000050000/optim/__11_0.distcp filter=lfs diff=lfs merge=lfs -text
499
+ 14B_GR1_post-train/iter_000050000/optim/__28_0.distcp filter=lfs diff=lfs merge=lfs -text
500
+ 14B_GR1_post-train/iter_000050000/optim/__31_0.distcp filter=lfs diff=lfs merge=lfs -text
501
+ 14B_GR1_post-train/iter_000050000/optim/__12_0.distcp filter=lfs diff=lfs merge=lfs -text
502
+ 14B_GR1_post-train/iter_000050000/optim/__27_0.distcp filter=lfs diff=lfs merge=lfs -text
503
+ 14B_GR1_post-train/iter_000050000/optim/__3_0.distcp filter=lfs diff=lfs merge=lfs -text
504
+ 14B_GR1_post-train/iter_000050000/optim/__17_0.distcp filter=lfs diff=lfs merge=lfs -text
505
+ 14B_GR1_post-train/iter_000050000/optim/__18_0.distcp filter=lfs diff=lfs merge=lfs -text
506
+ 14B_GR1_post-train/iter_000050000/optim/__4_0.distcp filter=lfs diff=lfs merge=lfs -text
507
+ 14B_GR1_post-train/iter_000050000/optim/__13_0.distcp filter=lfs diff=lfs merge=lfs -text
508
+ 14B_GR1_post-train/iter_000050000/optim/__5_0.distcp filter=lfs diff=lfs merge=lfs -text
509
+ 14B_GR1_post-train/iter_000050000/optim/__8_0.distcp filter=lfs diff=lfs merge=lfs -text
510
+ 14B_GR1_post-train/iter_000050000/optim/__14_0.distcp filter=lfs diff=lfs merge=lfs -text
511
+ 14B_GR1_post-train/iter_000050000/optim/__25_0.distcp filter=lfs diff=lfs merge=lfs -text
512
+ 14B_GR1_post-train/iter_000050000/optim/__1_0.distcp filter=lfs diff=lfs merge=lfs -text
513
+ 14B_GR1_post-train/iter_000050000/optim/__20_0.distcp filter=lfs diff=lfs merge=lfs -text
514
+ 14B_GR1_post-train/iter_000050000/optim/__9_0.distcp filter=lfs diff=lfs merge=lfs -text
515
+ 14B_GR1_post-train/iter_000050000/optim/__15_0.distcp filter=lfs diff=lfs merge=lfs -text
516
+ 14B_GR1_post-train/iter_000050000/optim/__10_0.distcp filter=lfs diff=lfs merge=lfs -text
517
+ 14B_GR1_post-train/iter_000050000/optim/__6_0.distcp filter=lfs diff=lfs merge=lfs -text
518
+ 14B_GR1_post-train/iter_000050000/optim/__21_0.distcp filter=lfs diff=lfs merge=lfs -text
519
+ 14B_GR1_post-train/iter_000050000/optim/__19_0.distcp filter=lfs diff=lfs merge=lfs -text
520
+ 14B_GR1_post-train/iter_000050000/optim/__23_0.distcp filter=lfs diff=lfs merge=lfs -text
521
+ 14B_GR1_post-train/iter_000050000/model/.metadata filter=lfs diff=lfs merge=lfs -text
522
+ 14B_GR1_post-train/iter_000050000/optim/__22_0.distcp filter=lfs diff=lfs merge=lfs -text
523
+ 14B_GR1_post-train/iter_000050000/optim/__7_0.distcp filter=lfs diff=lfs merge=lfs -text
524
+ 14B_GR1_post-train/iter_000050000/optim/__0_0.distcp filter=lfs diff=lfs merge=lfs -text
525
+ 14B_GR1_post-train/iter_000050000/optim/__26_0.distcp filter=lfs diff=lfs merge=lfs -text
526
+ 14B_GR1_post-train/iter_000050000/optim/__16_0.distcp filter=lfs diff=lfs merge=lfs -text
527
+ 14B_G1_post-train/iter_000050000/optim/__29_0.distcp filter=lfs diff=lfs merge=lfs -text
528
+ 14B_G1_post-train/iter_000050000/optim/__30_0.distcp filter=lfs diff=lfs merge=lfs -text
529
+ 14B_G1_post-train/iter_000050000/optim/__2_0.distcp filter=lfs diff=lfs merge=lfs -text
530
+ 14B_G1_post-train/iter_000050000/optim/__11_0.distcp filter=lfs diff=lfs merge=lfs -text
531
+ 14B_GR1_post-train/iter_000050000/model/__30_0.distcp filter=lfs diff=lfs merge=lfs -text
532
+ 14B_GR1_post-train/iter_000050000/model/__2_0.distcp filter=lfs diff=lfs merge=lfs -text
533
+ 14B_GR1_post-train/iter_000050000/model/__31_0.distcp filter=lfs diff=lfs merge=lfs -text
534
+ 14B_GR1_post-train/iter_000050000/model/__11_0.distcp filter=lfs diff=lfs merge=lfs -text
535
+ 14B_GR1_post-train/iter_000050000/model/__24_0.distcp filter=lfs diff=lfs merge=lfs -text
536
+ 14B_GR1_post-train/iter_000050000/model/__18_0.distcp filter=lfs diff=lfs merge=lfs -text
537
+ 14B_GR1_post-train/iter_000050000/model/__12_0.distcp filter=lfs diff=lfs merge=lfs -text
538
+ 14B_GR1_post-train/iter_000050000/model/__28_0.distcp filter=lfs diff=lfs merge=lfs -text
539
+ 14B_GR1_post-train/iter_000050000/model/__3_0.distcp filter=lfs diff=lfs merge=lfs -text
540
+ 14B_GR1_post-train/iter_000050000/model/__29_0.distcp filter=lfs diff=lfs merge=lfs -text
541
+ 14B_GR1_post-train/iter_000050000/model/__27_0.distcp filter=lfs diff=lfs merge=lfs -text
542
+ 14B_GR1_post-train/iter_000050000/model/__17_0.distcp filter=lfs diff=lfs merge=lfs -text
543
+ 14B_G1_post-train/iter_000050000/optim/__28_0.distcp filter=lfs diff=lfs merge=lfs -text
544
+ 14B_GR1_post-train/iter_000050000/model/__5_0.distcp filter=lfs diff=lfs merge=lfs -text
545
+ 14B_GR1_post-train/iter_000050000/model/__4_0.distcp filter=lfs diff=lfs merge=lfs -text
546
+ 14B_GR1_post-train/iter_000050000/model/__8_0.distcp filter=lfs diff=lfs merge=lfs -text
547
+ 14B_G1_post-train/iter_000050000/optim/.metadata filter=lfs diff=lfs merge=lfs -text
548
+ 14B_GR1_post-train/iter_000050000/model/__20_0.distcp filter=lfs diff=lfs merge=lfs -text
549
+ 14B_GR1_post-train/iter_000050000/model/__25_0.distcp filter=lfs diff=lfs merge=lfs -text
550
+ 14B_GR1_post-train/iter_000050000/model/__1_0.distcp filter=lfs diff=lfs merge=lfs -text
551
+ 14B_GR1_post-train/iter_000050000/model/__13_0.distcp filter=lfs diff=lfs merge=lfs -text
552
+ 14B_GR1_post-train/iter_000050000/model/__19_0.distcp filter=lfs diff=lfs merge=lfs -text
553
+ 14B_GR1_post-train/iter_000050000/model/__14_0.distcp filter=lfs diff=lfs merge=lfs -text
554
+ 14B_GR1_post-train/iter_000050000/model/__10_0.distcp filter=lfs diff=lfs merge=lfs -text
555
+ 14B_GR1_post-train/iter_000050000/model/__6_0.distcp filter=lfs diff=lfs merge=lfs -text
556
+ 14B_GR1_post-train/iter_000050000/model/__21_0.distcp filter=lfs diff=lfs merge=lfs -text
557
+ 14B_GR1_post-train/iter_000050000/model/__9_0.distcp filter=lfs diff=lfs merge=lfs -text
558
+ 14B_GR1_post-train/iter_000050000/model/__15_0.distcp filter=lfs diff=lfs merge=lfs -text
559
+ 14B_GR1_post-train/iter_000050000/model/__7_0.distcp filter=lfs diff=lfs merge=lfs -text
560
+ 14B_G1_post-train/iter_000050000/optim/__24_0.distcp filter=lfs diff=lfs merge=lfs -text
561
+ 14B_GR1_post-train/iter_000050000/model/__26_0.distcp filter=lfs diff=lfs merge=lfs -text
562
+ 14B_GR1_post-train/iter_000050000/model/__23_0.distcp filter=lfs diff=lfs merge=lfs -text
563
+ 14B_GR1_post-train/iter_000050000/model/__22_0.distcp filter=lfs diff=lfs merge=lfs -text
564
+ 14B_GR1_post-train/iter_000050000/model/__16_0.distcp filter=lfs diff=lfs merge=lfs -text
565
+ 14B_GR1_post-train/iter_000050000/model/__0_0.distcp filter=lfs diff=lfs merge=lfs -text
566
+ 14B_G1_post-train/iter_000050000/optim/__31_0.distcp filter=lfs diff=lfs merge=lfs -text
567
+ 14B_G1_post-train/iter_000050000/optim/__17_0.distcp filter=lfs diff=lfs merge=lfs -text
568
+ 14B_G1_post-train/iter_000050000/optim/__12_0.distcp filter=lfs diff=lfs merge=lfs -text
569
+ 14B_G1_post-train/iter_000050000/optim/__27_0.distcp filter=lfs diff=lfs merge=lfs -text
570
+ 14B_G1_post-train/iter_000050000/optim/__3_0.distcp filter=lfs diff=lfs merge=lfs -text
571
+ 14B_G1_post-train/iter_000050000/optim/__18_0.distcp filter=lfs diff=lfs merge=lfs -text
572
+ 14B_G1_post-train/iter_000050000/optim/__4_0.distcp filter=lfs diff=lfs merge=lfs -text
573
+ 14B_G1_post-train/iter_000050000/optim/__5_0.distcp filter=lfs diff=lfs merge=lfs -text
574
+ 14B_G1_post-train/iter_000050000/optim/__8_0.distcp filter=lfs diff=lfs merge=lfs -text
575
+ 14B_G1_post-train/iter_000050000/optim/__13_0.distcp filter=lfs diff=lfs merge=lfs -text
576
+ 14B_G1_post-train/iter_000050000/optim/__25_0.distcp filter=lfs diff=lfs merge=lfs -text
577
+ 14B_G1_post-train/iter_000050000/optim/__14_0.distcp filter=lfs diff=lfs merge=lfs -text
578
+ 14B_G1_post-train/iter_000050000/optim/__1_0.distcp filter=lfs diff=lfs merge=lfs -text
579
+ 14B_G1_post-train/iter_000050000/optim/__15_0.distcp filter=lfs diff=lfs merge=lfs -text
580
+ 14B_G1_post-train/iter_000050000/optim/__6_0.distcp filter=lfs diff=lfs merge=lfs -text
581
+ 14B_G1_post-train/iter_000050000/optim/__20_0.distcp filter=lfs diff=lfs merge=lfs -text
582
+ 14B_G1_post-train/iter_000050000/optim/__9_0.distcp filter=lfs diff=lfs merge=lfs -text
583
+ 14B_G1_post-train/iter_000050000/model/.metadata filter=lfs diff=lfs merge=lfs -text
584
+ 14B_G1_post-train/iter_000050000/optim/__10_0.distcp filter=lfs diff=lfs merge=lfs -text
585
+ 14B_G1_post-train/iter_000050000/optim/__23_0.distcp filter=lfs diff=lfs merge=lfs -text
586
+ 14B_G1_post-train/iter_000050000/optim/__21_0.distcp filter=lfs diff=lfs merge=lfs -text
587
+ 14B_G1_post-train/iter_000050000/optim/__7_0.distcp filter=lfs diff=lfs merge=lfs -text
588
+ 14B_G1_post-train/iter_000050000/optim/__0_0.distcp filter=lfs diff=lfs merge=lfs -text
589
+ 14B_G1_post-train/iter_000050000/optim/__19_0.distcp filter=lfs diff=lfs merge=lfs -text
590
+ 14B_G1_post-train/iter_000050000/optim/__22_0.distcp filter=lfs diff=lfs merge=lfs -text
591
+ 14B_G1_post-train/iter_000050000/optim/__26_0.distcp filter=lfs diff=lfs merge=lfs -text
592
+ 14B_G1_post-train/iter_000050000/optim/__16_0.distcp filter=lfs diff=lfs merge=lfs -text
593
+ 2B_GR1_post-train/iter_000050000/optim/.metadata filter=lfs diff=lfs merge=lfs -text
594
+ 2B_GR1_post-train/iter_000050000/optim/__2_0.distcp filter=lfs diff=lfs merge=lfs -text
595
+ 2B_GR1_post-train/iter_000050000/optim/__3_0.distcp filter=lfs diff=lfs merge=lfs -text
596
+ 2B_GR1_post-train/iter_000050000/optim/__4_0.distcp filter=lfs diff=lfs merge=lfs -text
597
+ 2B_GR1_post-train/iter_000050000/optim/__5_0.distcp filter=lfs diff=lfs merge=lfs -text
598
+ 14B_G1_post-train/iter_000050000/model/__2_0.distcp filter=lfs diff=lfs merge=lfs -text
599
+ 14B_G1_post-train/iter_000050000/model/__29_0.distcp filter=lfs diff=lfs merge=lfs -text
600
+ 14B_G1_post-train/iter_000050000/model/__30_0.distcp filter=lfs diff=lfs merge=lfs -text
601
+ 14B_G1_post-train/iter_000050000/model/__28_0.distcp filter=lfs diff=lfs merge=lfs -text
602
+ 14B_G1_post-train/iter_000050000/model/__11_0.distcp filter=lfs diff=lfs merge=lfs -text
603
+ 14B_G1_post-train/iter_000050000/model/__31_0.distcp filter=lfs diff=lfs merge=lfs -text
604
+ 14B_G1_post-train/iter_000050000/model/__24_0.distcp filter=lfs diff=lfs merge=lfs -text
605
+ 14B_G1_post-train/iter_000050000/model/__17_0.distcp filter=lfs diff=lfs merge=lfs -text
606
+ 2B_GR1_post-train/iter_000050000/optim/__1_0.distcp filter=lfs diff=lfs merge=lfs -text
607
+ 14B_G1_post-train/iter_000050000/model/__12_0.distcp filter=lfs diff=lfs merge=lfs -text
608
+ 14B_G1_post-train/iter_000050000/model/__27_0.distcp filter=lfs diff=lfs merge=lfs -text
609
+ 14B_G1_post-train/iter_000050000/model/__3_0.distcp filter=lfs diff=lfs merge=lfs -text
610
+ 2B_GR1_post-train/iter_000050000/optim/__6_0.distcp filter=lfs diff=lfs merge=lfs -text
611
+ 14B_G1_post-train/iter_000050000/model/__4_0.distcp filter=lfs diff=lfs merge=lfs -text
612
+ 14B_G1_post-train/iter_000050000/model/__18_0.distcp filter=lfs diff=lfs merge=lfs -text
613
+ 2B_GR1_post-train/iter_000050000/optim/__7_0.distcp filter=lfs diff=lfs merge=lfs -text
614
+ 2B_GR1_post-train/iter_000050000/model/.metadata filter=lfs diff=lfs merge=lfs -text
615
+ 14B_G1_post-train/iter_000050000/model/__13_0.distcp filter=lfs diff=lfs merge=lfs -text
616
+ 14B_G1_post-train/iter_000050000/model/__8_0.distcp filter=lfs diff=lfs merge=lfs -text
617
+ 14B_G1_post-train/iter_000050000/model/__5_0.distcp filter=lfs diff=lfs merge=lfs -text
618
+ 2B_GR1_post-train/iter_000050000/optim/__0_0.distcp filter=lfs diff=lfs merge=lfs -text
619
+ 14B_G1_post-train/iter_000050000/model/__14_0.distcp filter=lfs diff=lfs merge=lfs -text
620
+ 14B_G1_post-train/iter_000050000/model/__25_0.distcp filter=lfs diff=lfs merge=lfs -text
621
+ 14B_G1_post-train/iter_000050000/model/__15_0.distcp filter=lfs diff=lfs merge=lfs -text
622
+ 14B_G1_post-train/iter_000050000/model/__1_0.distcp filter=lfs diff=lfs merge=lfs -text
623
+ 14B_G1_post-train/iter_000050000/model/__20_0.distcp filter=lfs diff=lfs merge=lfs -text
624
+ 14B_G1_post-train/iter_000050000/model/__6_0.distcp filter=lfs diff=lfs merge=lfs -text
625
+ 2B_pretrain/iter_000140000/optim/.metadata filter=lfs diff=lfs merge=lfs -text
626
+ 14B_G1_post-train/iter_000050000/model/__23_0.distcp filter=lfs diff=lfs merge=lfs -text
627
+ 14B_G1_post-train/iter_000050000/model/__10_0.distcp filter=lfs diff=lfs merge=lfs -text
628
+ 14B_G1_post-train/iter_000050000/model/__9_0.distcp filter=lfs diff=lfs merge=lfs -text
629
+ 14B_G1_post-train/iter_000050000/model/__0_0.distcp filter=lfs diff=lfs merge=lfs -text
630
+ 14B_G1_post-train/iter_000050000/model/__21_0.distcp filter=lfs diff=lfs merge=lfs -text
631
+ 14B_G1_post-train/iter_000050000/model/__19_0.distcp filter=lfs diff=lfs merge=lfs -text
632
+ 14B_G1_post-train/iter_000050000/model/__7_0.distcp filter=lfs diff=lfs merge=lfs -text
633
+ 14B_G1_post-train/iter_000050000/model/__22_0.distcp filter=lfs diff=lfs merge=lfs -text
634
+ 14B_G1_post-train/iter_000050000/model/__26_0.distcp filter=lfs diff=lfs merge=lfs -text
635
+ 14B_G1_post-train/iter_000050000/model/__16_0.distcp filter=lfs diff=lfs merge=lfs -text
636
+ 2B_pretrain/iter_000140000/optim/__2_0.distcp filter=lfs diff=lfs merge=lfs -text
637
+ 2B_pretrain/iter_000140000/optim/__3_0.distcp filter=lfs diff=lfs merge=lfs -text
638
+ 2B_pretrain/iter_000140000/optim/__4_0.distcp filter=lfs diff=lfs merge=lfs -text
639
+ 2B_pretrain/iter_000140000/optim/__5_0.distcp filter=lfs diff=lfs merge=lfs -text
640
+ 2B_GR1_post-train/iter_000050000/model/__2_0.distcp filter=lfs diff=lfs merge=lfs -text
641
+ 2B_pretrain/iter_000140000/optim/__1_0.distcp filter=lfs diff=lfs merge=lfs -text
642
+ 2B_pretrain/iter_000140000/model/.metadata filter=lfs diff=lfs merge=lfs -text
643
+ 2B_GR1_post-train/iter_000050000/model/__4_0.distcp filter=lfs diff=lfs merge=lfs -text
644
+ 2B_pretrain/iter_000140000/optim/__6_0.distcp filter=lfs diff=lfs merge=lfs -text
645
+ 2B_GR1_post-train/iter_000050000/model/__3_0.distcp filter=lfs diff=lfs merge=lfs -text
646
+ 2B_GR1_post-train/iter_000050000/model/__5_0.distcp filter=lfs diff=lfs merge=lfs -text
647
+ 2B_pretrain/iter_000140000/optim/__0_0.distcp filter=lfs diff=lfs merge=lfs -text
648
+ 2B_GR1_post-train/iter_000050000/model/__1_0.distcp filter=lfs diff=lfs merge=lfs -text
649
+ 2B_pretrain/iter_000140000/optim/__7_0.distcp filter=lfs diff=lfs merge=lfs -text
650
+ 2B_GR1_post-train/iter_000050000/model/__6_0.distcp filter=lfs diff=lfs merge=lfs -text
651
+ 2B_GR1_post-train/iter_000050000/model/__7_0.distcp filter=lfs diff=lfs merge=lfs -text
652
+ 2B_GR1_post-train/iter_000050000/model/__0_0.distcp filter=lfs diff=lfs merge=lfs -text
653
+ 14B_YAM_post-train/iter_000050000/optim/.metadata filter=lfs diff=lfs merge=lfs -text
654
+ 2B_pretrain/iter_000140000/model/__2_0.distcp filter=lfs diff=lfs merge=lfs -text
655
+ 2B_pretrain/iter_000140000/model/__4_0.distcp filter=lfs diff=lfs merge=lfs -text
656
+ 2B_pretrain/iter_000140000/model/__3_0.distcp filter=lfs diff=lfs merge=lfs -text
657
+ 2B_pretrain/iter_000140000/model/__5_0.distcp filter=lfs diff=lfs merge=lfs -text
658
+ 2B_pretrain/iter_000140000/model/__1_0.distcp filter=lfs diff=lfs merge=lfs -text
659
+ 2B_pretrain/iter_000140000/model/__6_0.distcp filter=lfs diff=lfs merge=lfs -text
660
+ 2B_pretrain/iter_000140000/model/__0_0.distcp filter=lfs diff=lfs merge=lfs -text
661
+ 2B_pretrain/iter_000140000/model/__7_0.distcp filter=lfs diff=lfs merge=lfs -text
662
+ 14B_YAM_post-train/iter_000050000/optim/__29_0.distcp filter=lfs diff=lfs merge=lfs -text
663
+ 14B_YAM_post-train/iter_000050000/optim/__2_0.distcp filter=lfs diff=lfs merge=lfs -text
664
+ 14B_YAM_post-train/iter_000050000/optim/__24_0.distcp filter=lfs diff=lfs merge=lfs -text
665
+ 14B_YAM_post-train/iter_000050000/optim/__11_0.distcp filter=lfs diff=lfs merge=lfs -text
666
+ 14B_YAM_post-train/iter_000050000/optim/__30_0.distcp filter=lfs diff=lfs merge=lfs -text
667
+ 14B_YAM_post-train/iter_000050000/optim/__17_0.distcp filter=lfs diff=lfs merge=lfs -text
668
+ 14B_YAM_post-train/iter_000050000/optim/__28_0.distcp filter=lfs diff=lfs merge=lfs -text
669
+ 14B_YAM_post-train/iter_000050000/optim/__31_0.distcp filter=lfs diff=lfs merge=lfs -text
670
+ 14B_YAM_post-train/iter_000050000/optim/__4_0.distcp filter=lfs diff=lfs merge=lfs -text
671
+ 14B_YAM_post-train/iter_000050000/optim/__12_0.distcp filter=lfs diff=lfs merge=lfs -text
672
+ 14B_YAM_post-train/iter_000050000/optim/__3_0.distcp filter=lfs diff=lfs merge=lfs -text
673
+ 14B_YAM_post-train/iter_000050000/optim/__18_0.distcp filter=lfs diff=lfs merge=lfs -text
674
+ 14B_YAM_post-train/iter_000050000/optim/__27_0.distcp filter=lfs diff=lfs merge=lfs -text
675
+ 14B_YAM_post-train/iter_000050000/optim/__13_0.distcp filter=lfs diff=lfs merge=lfs -text
676
+ 14B_YAM_post-train/iter_000050000/optim/__8_0.distcp filter=lfs diff=lfs merge=lfs -text
677
+ 14B_YAM_post-train/iter_000050000/optim/__5_0.distcp filter=lfs diff=lfs merge=lfs -text
678
+ 14B_YAM_post-train/iter_000050000/optim/__14_0.distcp filter=lfs diff=lfs merge=lfs -text
679
+ 14B_YAM_post-train/iter_000050000/optim/__25_0.distcp filter=lfs diff=lfs merge=lfs -text
680
+ 14B_YAM_post-train/iter_000050000/optim/__15_0.distcp filter=lfs diff=lfs merge=lfs -text
681
+ 14B_YAM_post-train/iter_000050000/optim/__1_0.distcp filter=lfs diff=lfs merge=lfs -text
682
+ 14B_YAM_post-train/iter_000050000/optim/__20_0.distcp filter=lfs diff=lfs merge=lfs -text
683
+ 14B_YAM_post-train/iter_000050000/optim/__9_0.distcp filter=lfs diff=lfs merge=lfs -text
684
+ 14B_YAM_post-train/iter_000050000/optim/__6_0.distcp filter=lfs diff=lfs merge=lfs -text
685
+ 14B_YAM_post-train/iter_000050000/optim/__10_0.distcp filter=lfs diff=lfs merge=lfs -text
686
+ 14B_YAM_post-train/iter_000050000/optim/__19_0.distcp filter=lfs diff=lfs merge=lfs -text
687
+ 14B_YAM_post-train/iter_000050000/optim/__21_0.distcp filter=lfs diff=lfs merge=lfs -text
688
+ 14B_YAM_post-train/iter_000050000/optim/__23_0.distcp filter=lfs diff=lfs merge=lfs -text
689
+ 14B_YAM_post-train/iter_000050000/model/.metadata filter=lfs diff=lfs merge=lfs -text
690
+ 14B_YAM_post-train/iter_000050000/optim/__22_0.distcp filter=lfs diff=lfs merge=lfs -text
691
+ 14B_YAM_post-train/iter_000050000/optim/__26_0.distcp filter=lfs diff=lfs merge=lfs -text
692
+ 14B_YAM_post-train/iter_000050000/optim/__0_0.distcp filter=lfs diff=lfs merge=lfs -text
693
+ 14B_YAM_post-train/iter_000050000/optim/__16_0.distcp filter=lfs diff=lfs merge=lfs -text
694
+ 14B_YAM_post-train/iter_000050000/optim/__7_0.distcp filter=lfs diff=lfs merge=lfs -text
695
+ 14B_pretrain/iter_000140000/optim/__30_0.distcp filter=lfs diff=lfs merge=lfs -text
696
+ 14B_pretrain/iter_000140000/optim/__2_0.distcp filter=lfs diff=lfs merge=lfs -text
697
+ 14B_pretrain/iter_000140000/optim/__29_0.distcp filter=lfs diff=lfs merge=lfs -text
698
+ 14B_pretrain/iter_000140000/optim/__11_0.distcp filter=lfs diff=lfs merge=lfs -text
699
+ 14B_YAM_post-train/iter_000050000/model/__2_0.distcp filter=lfs diff=lfs merge=lfs -text
700
+ 14B_YAM_post-train/iter_000050000/model/__29_0.distcp filter=lfs diff=lfs merge=lfs -text
701
+ 14B_YAM_post-train/iter_000050000/model/__11_0.distcp filter=lfs diff=lfs merge=lfs -text
702
+ 14B_pretrain/iter_000140000/optim/__28_0.distcp filter=lfs diff=lfs merge=lfs -text
703
+ 14B_YAM_post-train/iter_000050000/model/__24_0.distcp filter=lfs diff=lfs merge=lfs -text
704
+ 14B_YAM_post-train/iter_000050000/model/__18_0.distcp filter=lfs diff=lfs merge=lfs -text
705
+ 14B_YAM_post-train/iter_000050000/model/__30_0.distcp filter=lfs diff=lfs merge=lfs -text
706
+ 14B_YAM_post-train/iter_000050000/model/__31_0.distcp filter=lfs diff=lfs merge=lfs -text
707
+ 14B_YAM_post-train/iter_000050000/model/__12_0.distcp filter=lfs diff=lfs merge=lfs -text
708
+ 14B_YAM_post-train/iter_000050000/model/__28_0.distcp filter=lfs diff=lfs merge=lfs -text
709
+ 14B_YAM_post-train/iter_000050000/model/__4_0.distcp filter=lfs diff=lfs merge=lfs -text
710
+ 14B_YAM_post-train/iter_000050000/model/__3_0.distcp filter=lfs diff=lfs merge=lfs -text
711
+ 14B_YAM_post-train/iter_000050000/model/__27_0.distcp filter=lfs diff=lfs merge=lfs -text
712
+ 14B_YAM_post-train/iter_000050000/model/__17_0.distcp filter=lfs diff=lfs merge=lfs -text
713
+ 14B_AgiBot_post-train/iter_000050000/optim/.metadata filter=lfs diff=lfs merge=lfs -text
714
+ 2B_G1_post-train/iter_000050000/model/__2_0.distcp filter=lfs diff=lfs merge=lfs -text
715
+ 2B_G1_post-train/iter_000050000/model/__4_0.distcp filter=lfs diff=lfs merge=lfs -text
716
+ 2B_G1_post-train/iter_000050000/model/__3_0.distcp filter=lfs diff=lfs merge=lfs -text
717
+ 2B_G1_post-train/iter_000050000/model/__5_0.distcp filter=lfs diff=lfs merge=lfs -text
718
+ 2B_G1_post-train/iter_000050000/model/__6_0.distcp filter=lfs diff=lfs merge=lfs -text
719
+ 2B_G1_post-train/iter_000050000/model/__1_0.distcp filter=lfs diff=lfs merge=lfs -text
720
+ 2B_G1_post-train/iter_000050000/model/__7_0.distcp filter=lfs diff=lfs merge=lfs -text
721
+ 2B_G1_post-train/iter_000050000/model/__0_0.distcp filter=lfs diff=lfs merge=lfs -text
722
+ 14B_AgiBot_post-train/iter_000050000/model/__13_0.distcp filter=lfs diff=lfs merge=lfs -text
723
+ 14B_AgiBot_post-train/iter_000050000/model/__27_0.distcp filter=lfs diff=lfs merge=lfs -text
724
+ 14B_AgiBot_post-train/iter_000050000/model/__10_0.distcp filter=lfs diff=lfs merge=lfs -text
725
+ 14B_AgiBot_post-train/iter_000050000/model/__7_0.distcp filter=lfs diff=lfs merge=lfs -text
726
+ 14B_AgiBot_post-train/iter_000050000/model/__20_0.distcp filter=lfs diff=lfs merge=lfs -text
727
+ 14B_AgiBot_post-train/iter_000050000/model/__15_0.distcp filter=lfs diff=lfs merge=lfs -text
728
+ 14B_AgiBot_post-train/iter_000050000/model/__3_0.distcp filter=lfs diff=lfs merge=lfs -text
729
+ 14B_AgiBot_post-train/iter_000050000/model/__5_0.distcp filter=lfs diff=lfs merge=lfs -text
730
+ 14B_AgiBot_post-train/iter_000050000/model/__14_0.distcp filter=lfs diff=lfs merge=lfs -text
731
+ 14B_AgiBot_post-train/iter_000050000/model/__17_0.distcp filter=lfs diff=lfs merge=lfs -text
732
+ 14B_AgiBot_post-train/iter_000050000/model/__28_0.distcp filter=lfs diff=lfs merge=lfs -text
733
+ 14B_AgiBot_post-train/iter_000050000/model/__21_0.distcp filter=lfs diff=lfs merge=lfs -text
734
+ 14B_AgiBot_post-train/iter_000050000/model/__11_0.distcp filter=lfs diff=lfs merge=lfs -text
735
+ 14B_AgiBot_post-train/iter_000050000/model/__6_0.distcp filter=lfs diff=lfs merge=lfs -text
736
+ 14B_AgiBot_post-train/iter_000050000/model/__29_0.distcp filter=lfs diff=lfs merge=lfs -text
737
+ 14B_AgiBot_post-train/iter_000050000/model/__22_0.distcp filter=lfs diff=lfs merge=lfs -text
738
+ 14B_AgiBot_post-train/iter_000050000/model/__30_0.distcp filter=lfs diff=lfs merge=lfs -text
739
+ 14B_AgiBot_post-train/iter_000050000/model/__8_0.distcp filter=lfs diff=lfs merge=lfs -text
740
+ 14B_AgiBot_post-train/iter_000050000/model/__0_0.distcp filter=lfs diff=lfs merge=lfs -text
741
+ 14B_AgiBot_post-train/iter_000050000/optim/__0_0.distcp filter=lfs diff=lfs merge=lfs -text
742
+ 14B_AgiBot_post-train/iter_000050000/optim/__26_0.distcp filter=lfs diff=lfs merge=lfs -text
743
+ 14B_AgiBot_post-train/iter_000050000/model/__2_0.distcp filter=lfs diff=lfs merge=lfs -text
744
+ 14B_AgiBot_post-train/iter_000050000/model/__4_0.distcp filter=lfs diff=lfs merge=lfs -text
745
+ 14B_AgiBot_post-train/iter_000050000/model/__23_0.distcp filter=lfs diff=lfs merge=lfs -text
746
+ 14B_AgiBot_post-train/iter_000050000/model/__9_0.distcp filter=lfs diff=lfs merge=lfs -text
747
+ 14B_AgiBot_post-train/iter_000050000/model/__12_0.distcp filter=lfs diff=lfs merge=lfs -text
748
+ 14B_AgiBot_post-train/iter_000050000/model/__25_0.distcp filter=lfs diff=lfs merge=lfs -text
749
+ 14B_AgiBot_post-train/iter_000050000/model/__24_0.distcp filter=lfs diff=lfs merge=lfs -text
750
+ 14B_AgiBot_post-train/iter_000050000/model/__18_0.distcp filter=lfs diff=lfs merge=lfs -text
751
+ 14B_AgiBot_post-train/iter_000050000/model/__19_0.distcp filter=lfs diff=lfs merge=lfs -text
752
+ 14B_AgiBot_post-train/iter_000050000/model/__31_0.distcp filter=lfs diff=lfs merge=lfs -text
753
+ 14B_AgiBot_post-train/iter_000050000/model/__26_0.distcp filter=lfs diff=lfs merge=lfs -text
754
+ 14B_AgiBot_post-train/iter_000050000/model/__1_0.distcp filter=lfs diff=lfs merge=lfs -text
755
+ 14B_AgiBot_post-train/iter_000050000/model/__16_0.distcp filter=lfs diff=lfs merge=lfs -text
756
+ 14B_AgiBot_post-train/iter_000050000/model/.metadata filter=lfs diff=lfs merge=lfs -text
757
+ 14B_AgiBot_post-train/iter_000050000/optim/__17_0.distcp filter=lfs diff=lfs merge=lfs -text
758
+ 14B_AgiBot_post-train/iter_000050000/optim/__27_0.distcp filter=lfs diff=lfs merge=lfs -text
759
+ 14B_AgiBot_post-train/iter_000050000/optim/__12_0.distcp filter=lfs diff=lfs merge=lfs -text
760
+ 14B_AgiBot_post-train/iter_000050000/optim/__3_0.distcp filter=lfs diff=lfs merge=lfs -text
761
+ 14B_AgiBot_post-train/iter_000050000/optim/__4_0.distcp filter=lfs diff=lfs merge=lfs -text
762
+ 14B_AgiBot_post-train/iter_000050000/optim/__18_0.distcp filter=lfs diff=lfs merge=lfs -text
763
+ 14B_AgiBot_post-train/iter_000050000/optim/__13_0.distcp filter=lfs diff=lfs merge=lfs -text
764
+ 14B_AgiBot_post-train/iter_000050000/optim/__14_0.distcp filter=lfs diff=lfs merge=lfs -text
765
+ 14B_AgiBot_post-train/iter_000050000/optim/__8_0.distcp filter=lfs diff=lfs merge=lfs -text
766
+ 14B_AgiBot_post-train/iter_000050000/optim/__25_0.distcp filter=lfs diff=lfs merge=lfs -text
767
+ 14B_AgiBot_post-train/iter_000050000/optim/__5_0.distcp filter=lfs diff=lfs merge=lfs -text
768
+ 14B_AgiBot_post-train/iter_000050000/optim/__2_0.distcp filter=lfs diff=lfs merge=lfs -text
769
+ 14B_AgiBot_post-train/iter_000050000/optim/__30_0.distcp filter=lfs diff=lfs merge=lfs -text
770
+ 14B_AgiBot_post-train/iter_000050000/optim/__29_0.distcp filter=lfs diff=lfs merge=lfs -text
771
+ 14B_AgiBot_post-train/iter_000050000/optim/__24_0.distcp filter=lfs diff=lfs merge=lfs -text
772
+ 14B_AgiBot_post-train/iter_000050000/optim/__11_0.distcp filter=lfs diff=lfs merge=lfs -text
773
+ 14B_AgiBot_post-train/iter_000050000/optim/__28_0.distcp filter=lfs diff=lfs merge=lfs -text
774
+ 14B_AgiBot_post-train/iter_000050000/optim/__31_0.distcp filter=lfs diff=lfs merge=lfs -text
775
+ 14B_YAM_post-train/iter_000050000/model/__1_0.distcp filter=lfs diff=lfs merge=lfs -text
776
+ 14B_YAM_post-train/iter_000050000/model/__13_0.distcp filter=lfs diff=lfs merge=lfs -text
777
+ 14B_YAM_post-train/iter_000050000/model/__5_0.distcp filter=lfs diff=lfs merge=lfs -text
778
+ 14B_YAM_post-train/iter_000050000/model/__8_0.distcp filter=lfs diff=lfs merge=lfs -text
779
+ 14B_YAM_post-train/iter_000050000/model/__14_0.distcp filter=lfs diff=lfs merge=lfs -text
780
+ 14B_pretrain/iter_000140000/optim/.metadata filter=lfs diff=lfs merge=lfs -text
781
+ 14B_YAM_post-train/iter_000050000/model/__20_0.distcp filter=lfs diff=lfs merge=lfs -text
782
+ 14B_YAM_post-train/iter_000050000/model/__19_0.distcp filter=lfs diff=lfs merge=lfs -text
783
+ 14B_YAM_post-train/iter_000050000/model/__10_0.distcp filter=lfs diff=lfs merge=lfs -text
784
+ 14B_pretrain/iter_000140000/optim/__24_0.distcp filter=lfs diff=lfs merge=lfs -text
785
+ 14B_YAM_post-train/iter_000050000/model/__6_0.distcp filter=lfs diff=lfs merge=lfs -text
786
+ 14B_YAM_post-train/iter_000050000/model/__15_0.distcp filter=lfs diff=lfs merge=lfs -text
787
+ 14B_YAM_post-train/iter_000050000/model/__25_0.distcp filter=lfs diff=lfs merge=lfs -text
788
+ 14B_YAM_post-train/iter_000050000/model/__7_0.distcp filter=lfs diff=lfs merge=lfs -text
789
+ 14B_YAM_post-train/iter_000050000/model/__9_0.distcp filter=lfs diff=lfs merge=lfs -text
790
+ 14B_YAM_post-train/iter_000050000/model/__21_0.distcp filter=lfs diff=lfs merge=lfs -text
791
+ 14B_YAM_post-train/iter_000050000/model/__22_0.distcp filter=lfs diff=lfs merge=lfs -text
792
+ 14B_YAM_post-train/iter_000050000/model/__23_0.distcp filter=lfs diff=lfs merge=lfs -text
793
+ 14B_YAM_post-train/iter_000050000/model/__0_0.distcp filter=lfs diff=lfs merge=lfs -text
794
+ 14B_YAM_post-train/iter_000050000/model/__26_0.distcp filter=lfs diff=lfs merge=lfs -text
795
+ 14B_YAM_post-train/iter_000050000/model/__16_0.distcp filter=lfs diff=lfs merge=lfs -text
796
+ 14B_pretrain/iter_000140000/optim/__31_0.distcp filter=lfs diff=lfs merge=lfs -text
797
+ 14B_pretrain/iter_000140000/optim/__12_0.distcp filter=lfs diff=lfs merge=lfs -text
798
+ 14B_pretrain/iter_000140000/optim/__27_0.distcp filter=lfs diff=lfs merge=lfs -text
799
+ 14B_pretrain/iter_000140000/optim/__17_0.distcp filter=lfs diff=lfs merge=lfs -text
800
+ 14B_pretrain/iter_000140000/optim/__18_0.distcp filter=lfs diff=lfs merge=lfs -text
801
+ 14B_pretrain/iter_000140000/optim/__3_0.distcp filter=lfs diff=lfs merge=lfs -text
802
+ 14B_pretrain/iter_000140000/optim/__4_0.distcp filter=lfs diff=lfs merge=lfs -text
803
+ 14B_pretrain/iter_000140000/optim/__13_0.distcp filter=lfs diff=lfs merge=lfs -text
804
+ 14B_pretrain/iter_000140000/optim/__5_0.distcp filter=lfs diff=lfs merge=lfs -text
805
+ 14B_pretrain/iter_000140000/optim/__8_0.distcp filter=lfs diff=lfs merge=lfs -text
806
+ 14B_pretrain/iter_000140000/optim/__14_0.distcp filter=lfs diff=lfs merge=lfs -text
807
+ 14B_pretrain/iter_000140000/optim/__1_0.distcp filter=lfs diff=lfs merge=lfs -text
808
+ 14B_pretrain/iter_000140000/optim/__25_0.distcp filter=lfs diff=lfs merge=lfs -text
809
+ 14B_pretrain/iter_000140000/optim/__15_0.distcp filter=lfs diff=lfs merge=lfs -text
810
+ 14B_pretrain/iter_000140000/optim/__10_0.distcp filter=lfs diff=lfs merge=lfs -text
811
+ 14B_pretrain/iter_000140000/optim/__6_0.distcp filter=lfs diff=lfs merge=lfs -text
812
+ 14B_pretrain/iter_000140000/optim/__19_0.distcp filter=lfs diff=lfs merge=lfs -text
813
+ 14B_pretrain/iter_000140000/optim/__20_0.distcp filter=lfs diff=lfs merge=lfs -text
814
+ 14B_pretrain/iter_000140000/optim/__9_0.distcp filter=lfs diff=lfs merge=lfs -text
815
+ 14B_pretrain/iter_000140000/model/.metadata filter=lfs diff=lfs merge=lfs -text
816
+ 14B_pretrain/iter_000140000/optim/__23_0.distcp filter=lfs diff=lfs merge=lfs -text
817
+ 14B_pretrain/iter_000140000/optim/__7_0.distcp filter=lfs diff=lfs merge=lfs -text
818
+ 14B_pretrain/iter_000140000/optim/__22_0.distcp filter=lfs diff=lfs merge=lfs -text
819
+ 14B_pretrain/iter_000140000/optim/__26_0.distcp filter=lfs diff=lfs merge=lfs -text
820
+ 14B_pretrain/iter_000140000/optim/__16_0.distcp filter=lfs diff=lfs merge=lfs -text
821
+ 14B_pretrain/iter_000140000/optim/__21_0.distcp filter=lfs diff=lfs merge=lfs -text
822
+ 14B_pretrain/iter_000140000/optim/__0_0.distcp filter=lfs diff=lfs merge=lfs -text
823
+ 2B_G1_post-train/iter_000050000/optim/.metadata filter=lfs diff=lfs merge=lfs -text
824
+ 2B_G1_post-train/iter_000050000/optim/__2_0.distcp filter=lfs diff=lfs merge=lfs -text
825
+ 2B_G1_post-train/iter_000050000/optim/__3_0.distcp filter=lfs diff=lfs merge=lfs -text
826
+ 2B_G1_post-train/iter_000050000/optim/__4_0.distcp filter=lfs diff=lfs merge=lfs -text
827
+ 2B_G1_post-train/iter_000050000/optim/__5_0.distcp filter=lfs diff=lfs merge=lfs -text
828
+ 14B_pretrain/iter_000140000/model/__30_0.distcp filter=lfs diff=lfs merge=lfs -text
829
+ 14B_pretrain/iter_000140000/model/__29_0.distcp filter=lfs diff=lfs merge=lfs -text
830
+ 14B_pretrain/iter_000140000/model/__11_0.distcp filter=lfs diff=lfs merge=lfs -text
831
+ 14B_pretrain/iter_000140000/model/__28_0.distcp filter=lfs diff=lfs merge=lfs -text
832
+ 14B_pretrain/iter_000140000/model/__2_0.distcp filter=lfs diff=lfs merge=lfs -text
833
+ 14B_pretrain/iter_000140000/model/__24_0.distcp filter=lfs diff=lfs merge=lfs -text
834
+ 14B_pretrain/iter_000140000/model/__31_0.distcp filter=lfs diff=lfs merge=lfs -text
835
+ 14B_pretrain/iter_000140000/model/__12_0.distcp filter=lfs diff=lfs merge=lfs -text
836
+ 14B_pretrain/iter_000140000/model/__27_0.distcp filter=lfs diff=lfs merge=lfs -text
837
+ 14B_pretrain/iter_000140000/model/__17_0.distcp filter=lfs diff=lfs merge=lfs -text
838
+ 14B_pretrain/iter_000140000/model/__18_0.distcp filter=lfs diff=lfs merge=lfs -text
839
+ 2B_G1_post-train/iter_000050000/optim/__1_0.distcp filter=lfs diff=lfs merge=lfs -text
840
+ 14B_pretrain/iter_000140000/model/__3_0.distcp filter=lfs diff=lfs merge=lfs -text
841
+ 14B_pretrain/iter_000140000/model/__4_0.distcp filter=lfs diff=lfs merge=lfs -text
842
+ 2B_G1_post-train/iter_000050000/optim/__6_0.distcp filter=lfs diff=lfs merge=lfs -text
843
+ 14B_pretrain/iter_000140000/model/__13_0.distcp filter=lfs diff=lfs merge=lfs -text
844
+ 14B_pretrain/iter_000140000/model/__5_0.distcp filter=lfs diff=lfs merge=lfs -text
845
+ 14B_pretrain/iter_000140000/model/__8_0.distcp filter=lfs diff=lfs merge=lfs -text
846
+ 2B_G1_post-train/iter_000050000/model/.metadata filter=lfs diff=lfs merge=lfs -text
847
+ 14B_pretrain/iter_000140000/model/__14_0.distcp filter=lfs diff=lfs merge=lfs -text
848
+ 2B_G1_post-train/iter_000050000/optim/__7_0.distcp filter=lfs diff=lfs merge=lfs -text
849
+ 2B_G1_post-train/iter_000050000/optim/__0_0.distcp filter=lfs diff=lfs merge=lfs -text
850
+ 14B_pretrain/iter_000140000/model/__25_0.distcp filter=lfs diff=lfs merge=lfs -text
851
+ 14B_pretrain/iter_000140000/model/__15_0.distcp filter=lfs diff=lfs merge=lfs -text
852
+ 14B_pretrain/iter_000140000/model/__1_0.distcp filter=lfs diff=lfs merge=lfs -text
853
+ 14B_pretrain/iter_000140000/model/__20_0.distcp filter=lfs diff=lfs merge=lfs -text
854
+ 14B_pretrain/iter_000140000/model/__9_0.distcp filter=lfs diff=lfs merge=lfs -text
855
+ 14B_pretrain/iter_000140000/model/__10_0.distcp filter=lfs diff=lfs merge=lfs -text
856
+ 14B_pretrain/iter_000140000/model/__19_0.distcp filter=lfs diff=lfs merge=lfs -text
857
+ 14B_pretrain/iter_000140000/model/__6_0.distcp filter=lfs diff=lfs merge=lfs -text
858
+ 14B_pretrain/iter_000140000/model/__16_0.distcp filter=lfs diff=lfs merge=lfs -text
859
+ 14B_pretrain/iter_000140000/model/__22_0.distcp filter=lfs diff=lfs merge=lfs -text
860
+ 14B_pretrain/iter_000140000/model/__23_0.distcp filter=lfs diff=lfs merge=lfs -text
861
+ 14B_pretrain/iter_000140000/model/__26_0.distcp filter=lfs diff=lfs merge=lfs -text
862
+ 14B_pretrain/iter_000140000/model/__21_0.distcp filter=lfs diff=lfs merge=lfs -text
863
+ 14B_pretrain/iter_000140000/model/__7_0.distcp filter=lfs diff=lfs merge=lfs -text
864
+ 14B_pretrain/iter_000140000/model/__0_0.distcp filter=lfs diff=lfs merge=lfs -text
865
+ 14B_AgiBot_post-train/iter_000050000/optim/__9_0.distcp filter=lfs diff=lfs merge=lfs -text
866
+ 14B_AgiBot_post-train/iter_000050000/optim/__20_0.distcp filter=lfs diff=lfs merge=lfs -text
867
+ 14B_AgiBot_post-train/iter_000050000/optim/__1_0.distcp filter=lfs diff=lfs merge=lfs -text
868
+ 14B_AgiBot_post-train/iter_000050000/optim/__19_0.distcp filter=lfs diff=lfs merge=lfs -text
869
+ 14B_AgiBot_post-train/iter_000050000/optim/__15_0.distcp filter=lfs diff=lfs merge=lfs -text
870
+ 14B_AgiBot_post-train/iter_000050000/optim/__21_0.distcp filter=lfs diff=lfs merge=lfs -text
871
+ 14B_AgiBot_post-train/iter_000050000/optim/__10_0.distcp filter=lfs diff=lfs merge=lfs -text
872
+ 14B_AgiBot_post-train/iter_000050000/optim/__6_0.distcp filter=lfs diff=lfs merge=lfs -text
873
+ 14B_AgiBot_post-train/iter_000050000/optim/__23_0.distcp filter=lfs diff=lfs merge=lfs -text
874
+ 14B_AgiBot_post-train/iter_000050000/optim/__22_0.distcp filter=lfs diff=lfs merge=lfs -text
875
+ 14B_AgiBot_post-train/iter_000050000/optim/__7_0.distcp filter=lfs diff=lfs merge=lfs -text
14B_AgiBot_post-train/iter_000050000/model/.metadata ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:cf3d1162e710dbc96acacad10fa77e3b550fed0039bf6da8d8a8fb0fb7426afa
3
+ size 8262141
14B_AgiBot_post-train/iter_000050000/model/__0_0.distcp ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:3ed18b12ec3f137b5da08f922543ffae3b8911a532666651327334aa1aa3449a
3
+ size 2787242818
14B_AgiBot_post-train/iter_000050000/model/__100_0.distcp ADDED
File without changes
14B_AgiBot_post-train/iter_000050000/model/__101_0.distcp ADDED
File without changes
14B_AgiBot_post-train/iter_000050000/model/__102_0.distcp ADDED
File without changes
14B_AgiBot_post-train/iter_000050000/model/__103_0.distcp ADDED
File without changes
14B_AgiBot_post-train/iter_000050000/model/__104_0.distcp ADDED
File without changes
14B_AgiBot_post-train/iter_000050000/model/__105_0.distcp ADDED
File without changes
14B_AgiBot_post-train/iter_000050000/model/__106_0.distcp ADDED
File without changes
14B_AgiBot_post-train/iter_000050000/model/__107_0.distcp ADDED
File without changes
14B_AgiBot_post-train/iter_000050000/model/__108_0.distcp ADDED
File without changes
14B_AgiBot_post-train/iter_000050000/model/__109_0.distcp ADDED
File without changes
14B_AgiBot_post-train/iter_000050000/model/__10_0.distcp ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:4c8c0a1c0341679827a5527718cac4fe68daf1f4d6e4dd12c064b7494ef3d6a7
3
+ size 2786781074
14B_AgiBot_post-train/iter_000050000/model/__110_0.distcp ADDED
File without changes
14B_AgiBot_post-train/iter_000050000/model/__111_0.distcp ADDED
File without changes
14B_AgiBot_post-train/iter_000050000/model/__112_0.distcp ADDED
File without changes
14B_AgiBot_post-train/iter_000050000/model/__113_0.distcp ADDED
File without changes
14B_AgiBot_post-train/iter_000050000/model/__114_0.distcp ADDED
File without changes
14B_AgiBot_post-train/iter_000050000/model/__115_0.distcp ADDED
File without changes
14B_AgiBot_post-train/iter_000050000/model/__116_0.distcp ADDED
File without changes
14B_AgiBot_post-train/iter_000050000/model/__117_0.distcp ADDED
File without changes
14B_AgiBot_post-train/iter_000050000/model/__118_0.distcp ADDED
File without changes
14B_AgiBot_post-train/iter_000050000/model/__119_0.distcp ADDED
File without changes
14B_AgiBot_post-train/iter_000050000/model/__11_0.distcp ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:53ac9041f5db927d2538a75fe2bc8101d129a8a08b2d33e46f473787672e2d8f
3
+ size 2786781074
14B_AgiBot_post-train/iter_000050000/model/__120_0.distcp ADDED
File without changes
14B_AgiBot_post-train/iter_000050000/model/__121_0.distcp ADDED
File without changes
14B_AgiBot_post-train/iter_000050000/model/__122_0.distcp ADDED
File without changes
14B_AgiBot_post-train/iter_000050000/model/__123_0.distcp ADDED
File without changes
14B_AgiBot_post-train/iter_000050000/model/__124_0.distcp ADDED
File without changes
14B_AgiBot_post-train/iter_000050000/model/__125_0.distcp ADDED
File without changes
14B_AgiBot_post-train/iter_000050000/model/__126_0.distcp ADDED
File without changes
14B_AgiBot_post-train/iter_000050000/model/__127_0.distcp ADDED
File without changes
14B_AgiBot_post-train/iter_000050000/model/__128_0.distcp ADDED
File without changes
14B_AgiBot_post-train/iter_000050000/model/__129_0.distcp ADDED
File without changes
14B_AgiBot_post-train/iter_000050000/model/__12_0.distcp ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f52b0f4a736c3021c7e92677608e6d7738f2d6eeea852434ed173f80694e21e8
3
+ size 2786781074
14B_AgiBot_post-train/iter_000050000/model/__130_0.distcp ADDED
File without changes
14B_AgiBot_post-train/iter_000050000/model/__131_0.distcp ADDED
File without changes
14B_AgiBot_post-train/iter_000050000/model/__132_0.distcp ADDED
File without changes
14B_AgiBot_post-train/iter_000050000/model/__133_0.distcp ADDED
File without changes
14B_AgiBot_post-train/iter_000050000/model/__134_0.distcp ADDED
File without changes
14B_AgiBot_post-train/iter_000050000/model/__135_0.distcp ADDED
File without changes
14B_AgiBot_post-train/iter_000050000/model/__136_0.distcp ADDED
File without changes
14B_AgiBot_post-train/iter_000050000/model/__137_0.distcp ADDED
File without changes
14B_AgiBot_post-train/iter_000050000/model/__138_0.distcp ADDED
File without changes
14B_AgiBot_post-train/iter_000050000/model/__139_0.distcp ADDED
File without changes
14B_AgiBot_post-train/iter_000050000/model/__13_0.distcp ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a8adb694cda1285bf15e24210a40f9e9497f7e8162d54dd83d1423df0b430abc
3
+ size 2786781074
14B_AgiBot_post-train/iter_000050000/model/__140_0.distcp ADDED
File without changes
14B_AgiBot_post-train/iter_000050000/model/__141_0.distcp ADDED
File without changes
14B_AgiBot_post-train/iter_000050000/model/__142_0.distcp ADDED
File without changes