bear7011 commited on
Commit
38805cf
·
verified ·
1 Parent(s): 5ab0512

Upload final LoRA adapter artifacts

Browse files
.gitattributes CHANGED
@@ -33,3 +33,4 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
 
 
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
36
+ tokenizer.json filter=lfs diff=lfs merge=lfs -text
README.md ADDED
@@ -0,0 +1,207 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ base_model: google/gemma-4-e4b-it
3
+ library_name: peft
4
+ pipeline_tag: text-generation
5
+ tags:
6
+ - base_model:adapter:google/gemma-4-e4b-it
7
+ - lora
8
+ - transformers
9
+ ---
10
+
11
+ # Model Card for Model ID
12
+
13
+ <!-- Provide a quick summary of what the model is/does. -->
14
+
15
+
16
+
17
+ ## Model Details
18
+
19
+ ### Model Description
20
+
21
+ <!-- Provide a longer summary of what this model is. -->
22
+
23
+
24
+
25
+ - **Developed by:** [More Information Needed]
26
+ - **Funded by [optional]:** [More Information Needed]
27
+ - **Shared by [optional]:** [More Information Needed]
28
+ - **Model type:** [More Information Needed]
29
+ - **Language(s) (NLP):** [More Information Needed]
30
+ - **License:** [More Information Needed]
31
+ - **Finetuned from model [optional]:** [More Information Needed]
32
+
33
+ ### Model Sources [optional]
34
+
35
+ <!-- Provide the basic links for the model. -->
36
+
37
+ - **Repository:** [More Information Needed]
38
+ - **Paper [optional]:** [More Information Needed]
39
+ - **Demo [optional]:** [More Information Needed]
40
+
41
+ ## Uses
42
+
43
+ <!-- Address questions around how the model is intended to be used, including the foreseeable users of the model and those affected by the model. -->
44
+
45
+ ### Direct Use
46
+
47
+ <!-- This section is for the model use without fine-tuning or plugging into a larger ecosystem/app. -->
48
+
49
+ [More Information Needed]
50
+
51
+ ### Downstream Use [optional]
52
+
53
+ <!-- This section is for the model use when fine-tuned for a task, or when plugged into a larger ecosystem/app -->
54
+
55
+ [More Information Needed]
56
+
57
+ ### Out-of-Scope Use
58
+
59
+ <!-- This section addresses misuse, malicious use, and uses that the model will not work well for. -->
60
+
61
+ [More Information Needed]
62
+
63
+ ## Bias, Risks, and Limitations
64
+
65
+ <!-- This section is meant to convey both technical and sociotechnical limitations. -->
66
+
67
+ [More Information Needed]
68
+
69
+ ### Recommendations
70
+
71
+ <!-- This section is meant to convey recommendations with respect to the bias, risk, and technical limitations. -->
72
+
73
+ Users (both direct and downstream) should be made aware of the risks, biases and limitations of the model. More information needed for further recommendations.
74
+
75
+ ## How to Get Started with the Model
76
+
77
+ Use the code below to get started with the model.
78
+
79
+ [More Information Needed]
80
+
81
+ ## Training Details
82
+
83
+ ### Training Data
84
+
85
+ <!-- This should link to a Dataset Card, perhaps with a short stub of information on what the training data is all about as well as documentation related to data pre-processing or additional filtering. -->
86
+
87
+ [More Information Needed]
88
+
89
+ ### Training Procedure
90
+
91
+ <!-- This relates heavily to the Technical Specifications. Content here should link to that section when it is relevant to the training procedure. -->
92
+
93
+ #### Preprocessing [optional]
94
+
95
+ [More Information Needed]
96
+
97
+
98
+ #### Training Hyperparameters
99
+
100
+ - **Training regime:** [More Information Needed] <!--fp32, fp16 mixed precision, bf16 mixed precision, bf16 non-mixed precision, fp16 non-mixed precision, fp8 mixed precision -->
101
+
102
+ #### Speeds, Sizes, Times [optional]
103
+
104
+ <!-- This section provides information about throughput, start/end time, checkpoint size if relevant, etc. -->
105
+
106
+ [More Information Needed]
107
+
108
+ ## Evaluation
109
+
110
+ <!-- This section describes the evaluation protocols and provides the results. -->
111
+
112
+ ### Testing Data, Factors & Metrics
113
+
114
+ #### Testing Data
115
+
116
+ <!-- This should link to a Dataset Card if possible. -->
117
+
118
+ [More Information Needed]
119
+
120
+ #### Factors
121
+
122
+ <!-- These are the things the evaluation is disaggregating by, e.g., subpopulations or domains. -->
123
+
124
+ [More Information Needed]
125
+
126
+ #### Metrics
127
+
128
+ <!-- These are the evaluation metrics being used, ideally with a description of why. -->
129
+
130
+ [More Information Needed]
131
+
132
+ ### Results
133
+
134
+ [More Information Needed]
135
+
136
+ #### Summary
137
+
138
+
139
+
140
+ ## Model Examination [optional]
141
+
142
+ <!-- Relevant interpretability work for the model goes here -->
143
+
144
+ [More Information Needed]
145
+
146
+ ## Environmental Impact
147
+
148
+ <!-- Total emissions (in grams of CO2eq) and additional considerations, such as electricity usage, go here. Edit the suggested text below accordingly -->
149
+
150
+ Carbon emissions can be estimated using the [Machine Learning Impact calculator](https://mlco2.github.io/impact#compute) presented in [Lacoste et al. (2019)](https://arxiv.org/abs/1910.09700).
151
+
152
+ - **Hardware Type:** [More Information Needed]
153
+ - **Hours used:** [More Information Needed]
154
+ - **Cloud Provider:** [More Information Needed]
155
+ - **Compute Region:** [More Information Needed]
156
+ - **Carbon Emitted:** [More Information Needed]
157
+
158
+ ## Technical Specifications [optional]
159
+
160
+ ### Model Architecture and Objective
161
+
162
+ [More Information Needed]
163
+
164
+ ### Compute Infrastructure
165
+
166
+ [More Information Needed]
167
+
168
+ #### Hardware
169
+
170
+ [More Information Needed]
171
+
172
+ #### Software
173
+
174
+ [More Information Needed]
175
+
176
+ ## Citation [optional]
177
+
178
+ <!-- If there is a paper or blog post introducing the model, the APA and Bibtex information for that should go in this section. -->
179
+
180
+ **BibTeX:**
181
+
182
+ [More Information Needed]
183
+
184
+ **APA:**
185
+
186
+ [More Information Needed]
187
+
188
+ ## Glossary [optional]
189
+
190
+ <!-- If relevant, include terms and calculations in this section that can help readers understand the model or model card. -->
191
+
192
+ [More Information Needed]
193
+
194
+ ## More Information [optional]
195
+
196
+ [More Information Needed]
197
+
198
+ ## Model Card Authors [optional]
199
+
200
+ [More Information Needed]
201
+
202
+ ## Model Card Contact
203
+
204
+ [More Information Needed]
205
+ ### Framework versions
206
+
207
+ - PEFT 0.19.1
adapter_config.json ADDED
@@ -0,0 +1,91 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "alora_invocation_tokens": null,
3
+ "alpha_pattern": {},
4
+ "arrow_config": null,
5
+ "auto_mapping": null,
6
+ "base_model_name_or_path": "google/gemma-4-e4b-it",
7
+ "bias": "none",
8
+ "corda_config": null,
9
+ "ensure_weight_tying": false,
10
+ "eva_config": null,
11
+ "exclude_modules": null,
12
+ "fan_in_fan_out": false,
13
+ "inference_mode": true,
14
+ "init_lora_weights": true,
15
+ "layer_replication": null,
16
+ "layers_pattern": "language_model.layers",
17
+ "layers_to_transform": [
18
+ 0,
19
+ 1,
20
+ 2,
21
+ 3,
22
+ 4,
23
+ 5,
24
+ 6,
25
+ 7,
26
+ 8,
27
+ 9,
28
+ 10,
29
+ 11,
30
+ 12,
31
+ 13,
32
+ 14,
33
+ 15,
34
+ 16,
35
+ 17,
36
+ 18,
37
+ 19,
38
+ 20,
39
+ 21,
40
+ 22,
41
+ 23,
42
+ 24,
43
+ 25,
44
+ 26,
45
+ 27,
46
+ 28,
47
+ 29,
48
+ 30,
49
+ 31,
50
+ 32,
51
+ 33,
52
+ 34,
53
+ 35,
54
+ 36,
55
+ 37,
56
+ 38,
57
+ 39,
58
+ 40,
59
+ 41
60
+ ],
61
+ "loftq_config": {},
62
+ "lora_alpha": 32,
63
+ "lora_bias": false,
64
+ "lora_dropout": 0.05,
65
+ "lora_ga_config": null,
66
+ "megatron_config": null,
67
+ "megatron_core": "megatron.core",
68
+ "modules_to_save": null,
69
+ "peft_type": "LORA",
70
+ "peft_version": "0.19.1",
71
+ "qalora_group_size": 16,
72
+ "r": 16,
73
+ "rank_pattern": {},
74
+ "revision": null,
75
+ "target_modules": [
76
+ "o_proj",
77
+ "k_proj",
78
+ "v_proj",
79
+ "up_proj",
80
+ "gate_proj",
81
+ "down_proj",
82
+ "q_proj"
83
+ ],
84
+ "target_parameters": null,
85
+ "task_type": "CAUSAL_LM",
86
+ "trainable_token_indices": null,
87
+ "use_bdlora": null,
88
+ "use_dora": false,
89
+ "use_qalora": false,
90
+ "use_rslora": false
91
+ }
adapter_model.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:d065fae51a8cb3acae2dd90b6674d59983f73352c4209f99f68515850d10877f
3
+ size 69839888
chat_template.jinja ADDED
@@ -0,0 +1,351 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {%- macro format_parameters(properties, required, filter_keys=false) -%}
2
+ {%- set standard_keys = ['description', 'type', 'properties', 'required', 'nullable'] -%}
3
+ {%- set ns = namespace(found_first=false) -%}
4
+ {%- for key, value in properties | dictsort -%}
5
+ {%- set add_comma = false -%}
6
+ {%- if not filter_keys or key not in standard_keys -%}
7
+ {%- if ns.found_first %},{% endif -%}
8
+ {%- set ns.found_first = true -%}
9
+ {{ key }}:{
10
+ {%- if value['description'] -%}
11
+ description:<|"|>{{ value['description'] }}<|"|>
12
+ {%- set add_comma = true -%}
13
+ {%- endif -%}
14
+ {%- if value['type'] | upper == 'STRING' -%}
15
+ {%- if value['enum'] -%}
16
+ {%- if add_comma %},{%- else -%} {%- set add_comma = true -%} {% endif -%}
17
+ enum:{{ format_argument(value['enum']) }}
18
+ {%- endif -%}
19
+ {%- elif value['type'] | upper == 'ARRAY' -%}
20
+ {%- if value['items'] is mapping and value['items'] -%}
21
+ {%- if add_comma %},{%- else -%} {%- set add_comma = true -%} {% endif -%}
22
+ items:{
23
+ {%- set ns_items = namespace(found_first=false) -%}
24
+ {%- for item_key, item_value in value['items'] | dictsort -%}
25
+ {%- if item_value is not none -%}
26
+ {%- if ns_items.found_first %},{% endif -%}
27
+ {%- set ns_items.found_first = true -%}
28
+ {%- if item_key == 'properties' -%}
29
+ properties:{
30
+ {%- if item_value is mapping -%}
31
+ {{- format_parameters(item_value, value['items']['required'] | default([])) -}}
32
+ {%- endif -%}
33
+ }
34
+ {%- elif item_key == 'required' -%}
35
+ required:[
36
+ {%- for req_item in item_value -%}
37
+ <|"|>{{- req_item -}}<|"|>
38
+ {%- if not loop.last %},{% endif -%}
39
+ {%- endfor -%}
40
+ ]
41
+ {%- elif item_key == 'type' -%}
42
+ {%- if item_value is string -%}
43
+ type:{{ format_argument(item_value | upper) }}
44
+ {%- else -%}
45
+ type:{{ format_argument(item_value | map('upper') | list) }}
46
+ {%- endif -%}
47
+ {%- else -%}
48
+ {{ item_key }}:{{ format_argument(item_value) }}
49
+ {%- endif -%}
50
+ {%- endif -%}
51
+ {%- endfor -%}
52
+ }
53
+ {%- endif -%}
54
+ {%- endif -%}
55
+ {%- if value['nullable'] %}
56
+ {%- if add_comma %},{%- else -%} {%- set add_comma = true -%} {% endif -%}
57
+ nullable:true
58
+ {%- endif -%}
59
+ {%- if value['type'] | upper == 'OBJECT' -%}
60
+ {%- if value['properties'] is defined and value['properties'] is mapping -%}
61
+ {%- if add_comma %},{%- else -%} {%- set add_comma = true -%} {% endif -%}
62
+ properties:{
63
+ {{- format_parameters(value['properties'], value['required'] | default([])) -}}
64
+ }
65
+ {%- elif value is mapping -%}
66
+ {%- if add_comma %},{%- else -%} {%- set add_comma = true -%} {% endif -%}
67
+ properties:{
68
+ {{- format_parameters(value, value['required'] | default([]), filter_keys=true) -}}
69
+ }
70
+ {%- endif -%}
71
+ {%- if value['required'] -%}
72
+ {%- if add_comma %},{%- else -%} {%- set add_comma = true -%} {% endif -%}
73
+ required:[
74
+ {%- for item in value['required'] | default([]) -%}
75
+ <|"|>{{- item -}}<|"|>
76
+ {%- if not loop.last %},{% endif -%}
77
+ {%- endfor -%}
78
+ ]
79
+ {%- endif -%}
80
+ {%- endif -%}
81
+ {%- if add_comma %},{%- else -%} {%- set add_comma = true -%} {% endif -%}
82
+ type:<|"|>{{ value['type'] | upper }}<|"|>}
83
+ {%- endif -%}
84
+ {%- endfor -%}
85
+ {%- endmacro -%}
86
+ {%- macro format_function_declaration(tool_data) -%}
87
+ declaration:{{- tool_data['function']['name'] -}}{description:<|"|>{{- tool_data['function']['description'] -}}<|"|>
88
+ {%- set params = tool_data['function']['parameters'] -%}
89
+ {%- if params -%}
90
+ ,parameters:{
91
+ {%- if params['properties'] -%}
92
+ properties:{ {{- format_parameters(params['properties'], params['required']) -}} },
93
+ {%- endif -%}
94
+ {%- if params['required'] -%}
95
+ required:[
96
+ {%- for item in params['required'] -%}
97
+ <|"|>{{- item -}}<|"|>
98
+ {{- ',' if not loop.last -}}
99
+ {%- endfor -%}
100
+ ],
101
+ {%- endif -%}
102
+ {%- if params['type'] -%}
103
+ type:<|"|>{{- params['type'] | upper -}}<|"|>}
104
+ {%- endif -%}
105
+ {%- endif -%}
106
+ {%- if 'response' in tool_data['function'] -%}
107
+ {%- set response_declaration = tool_data['function']['response'] -%}
108
+ ,response:{
109
+ {%- if response_declaration['description'] -%}
110
+ description:<|"|>{{- response_declaration['description'] -}}<|"|>,
111
+ {%- endif -%}
112
+ {%- if response_declaration['type'] | upper == 'OBJECT' -%}
113
+ type:<|"|>{{- response_declaration['type'] | upper -}}<|"|>}
114
+ {%- endif -%}
115
+ {%- endif -%}
116
+ }
117
+ {%- endmacro -%}
118
+ {%- macro format_argument(argument, escape_keys=True) -%}
119
+ {%- if argument is string -%}
120
+ {{- '<|"|>' + argument + '<|"|>' -}}
121
+ {%- elif argument is boolean -%}
122
+ {{- 'true' if argument else 'false' -}}
123
+ {%- elif argument is mapping -%}
124
+ {{- '{' -}}
125
+ {%- set ns = namespace(found_first=false) -%}
126
+ {%- for key, value in argument | dictsort -%}
127
+ {%- if ns.found_first %},{% endif -%}
128
+ {%- set ns.found_first = true -%}
129
+ {%- if escape_keys -%}
130
+ {{- '<|"|>' + key + '<|"|>' -}}
131
+ {%- else -%}
132
+ {{- key -}}
133
+ {%- endif -%}
134
+ :{{- format_argument(value, escape_keys=escape_keys) -}}
135
+ {%- endfor -%}
136
+ {{- '}' -}}
137
+ {%- elif argument is sequence -%}
138
+ {{- '[' -}}
139
+ {%- for item in argument -%}
140
+ {{- format_argument(item, escape_keys=escape_keys) -}}
141
+ {%- if not loop.last %},{% endif -%}
142
+ {%- endfor -%}
143
+ {{- ']' -}}
144
+ {%- else -%}
145
+ {{- argument -}}
146
+ {%- endif -%}
147
+ {%- endmacro -%}
148
+ {%- macro strip_thinking(text) -%}
149
+ {%- set ns = namespace(result='') -%}
150
+ {%- for part in text.split('<channel|>') -%}
151
+ {%- if '<|channel>' in part -%}
152
+ {%- set ns.result = ns.result + part.split('<|channel>')[0] -%}
153
+ {%- else -%}
154
+ {%- set ns.result = ns.result + part -%}
155
+ {%- endif -%}
156
+ {%- endfor -%}
157
+ {{- ns.result | trim -}}
158
+ {%- endmacro -%}
159
+
160
+ {%- macro format_tool_response_block(tool_name, response) -%}
161
+ {{- '<|tool_response>' -}}
162
+ {%- if response is mapping -%}
163
+ {{- 'response:' + tool_name + '{' -}}
164
+ {%- for key, value in response | dictsort -%}
165
+ {{- key -}}:{{- format_argument(value, escape_keys=False) -}}
166
+ {%- if not loop.last %},{% endif -%}
167
+ {%- endfor -%}
168
+ {{- '}' -}}
169
+ {%- else -%}
170
+ {{- 'response:' + tool_name + '{value:' + format_argument(response, escape_keys=False) + '}' -}}
171
+ {%- endif -%}
172
+ {{- '<tool_response|>' -}}
173
+ {%- endmacro -%}
174
+
175
+ {%- set ns = namespace(prev_message_type=None) -%}
176
+ {%- set loop_messages = messages -%}
177
+ {{- bos_token -}}
178
+ {#- Handle System/Tool Definitions Block -#}
179
+ {%- if (enable_thinking is defined and enable_thinking) or tools or messages[0]['role'] in ['system', 'developer'] -%}
180
+ {{- '<|turn>system\n' -}}
181
+ {#- Inject Thinking token at the very top of the FIRST system turn -#}
182
+ {%- if enable_thinking is defined and enable_thinking -%}
183
+ {{- '<|think|>\n' -}}
184
+ {%- set ns.prev_message_type = 'think' -%}
185
+ {%- endif -%}
186
+ {%- if messages[0]['role'] in ['system', 'developer'] -%}
187
+ {%- if messages[0]['content'] is string -%}
188
+ {{- messages[0]['content'] | trim -}}
189
+ {%- elif messages[0]['content'] is sequence -%}
190
+ {%- for item in messages[0]['content'] -%}
191
+ {{- item['text'] | trim + ' '-}}
192
+ {%- endfor -%}
193
+ {%- endif -%}
194
+ {%- set loop_messages = messages[1:] -%}
195
+ {%- endif -%}
196
+ {%- if tools -%}
197
+ {%- for tool in tools %}
198
+ {{- '<|tool>' -}}
199
+ {{- format_function_declaration(tool) | trim -}}
200
+ {{- '<tool|>' -}}
201
+ {%- endfor %}
202
+ {%- set ns.prev_message_type = 'tool' -%}
203
+ {%- endif -%}
204
+ {{- '<turn|>\n' -}}
205
+ {%- endif %}
206
+
207
+ {#- Pre-scan: find last user message index for reasoning guard -#}
208
+ {%- set ns_turn = namespace(last_user_idx=-1) -%}
209
+ {%- for i in range(loop_messages | length) -%}
210
+ {%- if loop_messages[i]['role'] == 'user' -%}
211
+ {%- set ns_turn.last_user_idx = i -%}
212
+ {%- endif -%}
213
+ {%- endfor -%}
214
+
215
+ {#- Loop through messages -#}
216
+ {%- for message in loop_messages -%}
217
+ {%- if message['role'] != 'tool' -%}
218
+ {%- set ns.prev_message_type = None -%}
219
+ {%- set role = 'model' if message['role'] == 'assistant' else message['role'] -%}
220
+ {#- Detect continuation: suppress duplicate <|turn>model when previous non-tool message was also assistant -#}
221
+ {%- set prev_nt = namespace(role=None, found=false) -%}
222
+ {%- if loop.index0 > 0 -%}
223
+ {%- for j in range(loop.index0 - 1, -1, -1) -%}
224
+ {%- if not prev_nt.found -%}
225
+ {%- if loop_messages[j]['role'] != 'tool' -%}
226
+ {%- set prev_nt.role = loop_messages[j]['role'] -%}
227
+ {%- set prev_nt.found = true -%}
228
+ {%- endif -%}
229
+ {%- endif -%}
230
+ {%- endfor -%}
231
+ {%- endif -%}
232
+ {%- set continue_same_model_turn = (role == 'model' and prev_nt.role == 'assistant') -%}
233
+ {%- if not continue_same_model_turn -%}
234
+ {{- '<|turn>' + role + '\n' }}
235
+ {%- endif -%}
236
+
237
+ {#- Render reasoning/reasoning_content as thinking channel -#}
238
+ {%- set thinking_text = message.get('reasoning') or message.get('reasoning_content') -%}
239
+ {%- if thinking_text and loop.index0 > ns_turn.last_user_idx and message.get('tool_calls') -%}
240
+ {{- '<|channel>thought\n' + thinking_text + '\n<channel|>' -}}
241
+ {%- endif -%}
242
+
243
+ {%- if message['tool_calls'] -%}
244
+ {%- for tool_call in message['tool_calls'] -%}
245
+ {%- set function = tool_call['function'] -%}
246
+ {{- '<|tool_call>call:' + function['name'] + '{' -}}
247
+ {%- if function['arguments'] is mapping -%}
248
+ {%- set ns_args = namespace(found_first=false) -%}
249
+ {%- for key, value in function['arguments'] | dictsort -%}
250
+ {%- if ns_args.found_first %},{% endif -%}
251
+ {%- set ns_args.found_first = true -%}
252
+ {{- key -}}:{{- format_argument(value, escape_keys=False) -}}
253
+ {%- endfor -%}
254
+ {%- elif function['arguments'] is string -%}
255
+ {{- function['arguments'] -}}
256
+ {%- endif -%}
257
+ {{- '}<tool_call|>' -}}
258
+ {%- endfor -%}
259
+ {%- set ns.prev_message_type = 'tool_call' -%}
260
+ {%- endif -%}
261
+
262
+ {%- set ns_tr_out = namespace(flag=false) -%}
263
+ {%- if message.get('tool_responses') -%}
264
+ {#- Legacy: tool_responses embedded on the assistant message (Google/Gemma native) -#}
265
+ {%- for tool_response in message['tool_responses'] -%}
266
+ {{- format_tool_response_block(tool_response['name'] | default('unknown'), tool_response['response']) -}}
267
+ {%- set ns_tr_out.flag = true -%}
268
+ {%- set ns.prev_message_type = 'tool_response' -%}
269
+ {%- endfor -%}
270
+ {%- elif message.get('tool_calls') -%}
271
+ {#- OpenAI Chat Completions: forward-scan consecutive role:tool messages -#}
272
+ {%- set ns_tool_scan = namespace(stopped=false) -%}
273
+ {%- for k in range(loop.index0 + 1, loop_messages | length) -%}
274
+ {%- if ns_tool_scan.stopped -%}
275
+ {%- elif loop_messages[k]['role'] != 'tool' -%}
276
+ {%- set ns_tool_scan.stopped = true -%}
277
+ {%- else -%}
278
+ {%- set follow = loop_messages[k] -%}
279
+ {#- Resolve tool_call_id to function name -#}
280
+ {%- set ns_tname = namespace(name=follow.get('name') | default('unknown')) -%}
281
+ {%- for tc in message['tool_calls'] -%}
282
+ {%- if tc.get('id') == follow.get('tool_call_id') -%}
283
+ {%- set ns_tname.name = tc['function']['name'] -%}
284
+ {%- endif -%}
285
+ {%- endfor -%}
286
+ {#- Handle content as string or content-parts array -#}
287
+ {%- set tool_body = follow.get('content') -%}
288
+ {%- if tool_body is string -%}
289
+ {{- format_tool_response_block(ns_tname.name, tool_body) -}}
290
+ {%- elif tool_body is sequence and tool_body is not string -%}
291
+ {%- set ns_txt = namespace(s='') -%}
292
+ {%- for part in tool_body -%}
293
+ {%- if part.get('type') == 'text' -%}
294
+ {%- set ns_txt.s = ns_txt.s + (part.get('text') | default('')) -%}
295
+ {%- endif -%}
296
+ {%- endfor -%}
297
+ {{- format_tool_response_block(ns_tname.name, ns_txt.s) -}}
298
+ {%- else -%}
299
+ {{- format_tool_response_block(ns_tname.name, tool_body) -}}
300
+ {%- endif -%}
301
+ {%- set ns_tr_out.flag = true -%}
302
+ {%- set ns.prev_message_type = 'tool_response' -%}
303
+ {%- endif -%}
304
+ {%- endfor -%}
305
+ {%- endif -%}
306
+
307
+ {%- set captured_content -%}
308
+ {%- if message['content'] is string -%}
309
+ {%- if role == 'model' -%}
310
+ {{- strip_thinking(message['content']) -}}
311
+ {%- else -%}
312
+ {{- message['content'] | trim -}}
313
+ {%- endif -%}
314
+ {%- elif message['content'] is sequence -%}
315
+ {%- for item in message['content'] -%}
316
+ {%- if item['type'] == 'text' -%}
317
+ {%- if role == 'model' -%}
318
+ {{- strip_thinking(item['text']) -}}
319
+ {%- else -%}
320
+ {{- item['text'] | trim -}}
321
+ {%- endif -%}
322
+ {%- elif item['type'] == 'image' -%}
323
+ {{- '<|image|>' -}}
324
+ {%- set ns.prev_message_type = 'image' -%}
325
+ {%- elif item['type'] == 'audio' -%}
326
+ {{- '<|audio|>' -}}
327
+ {%- set ns.prev_message_type = 'audio' -%}
328
+ {%- elif item['type'] == 'video' -%}
329
+ {{- '<|video|>' -}}
330
+ {%- set ns.prev_message_type = 'video' -%}
331
+ {%- endif -%}
332
+ {%- endfor -%}
333
+ {%- endif -%}
334
+ {%- endset -%}
335
+
336
+ {{- captured_content -}}
337
+ {%- set has_content = captured_content | trim | length > 0 -%}
338
+
339
+ {%- if ns.prev_message_type == 'tool_call' and not ns_tr_out.flag -%}
340
+ {{- '<|tool_response>' -}}
341
+ {%- elif not (ns_tr_out.flag and not has_content) -%}
342
+ {{- '<turn|>\n' -}}
343
+ {%- endif -%}
344
+ {%- endif -%}
345
+ {%- endfor -%}
346
+
347
+ {%- if add_generation_prompt -%}
348
+ {%- if ns.prev_message_type != 'tool_response' and ns.prev_message_type != 'tool_call' -%}
349
+ {{- '<|turn>model\n' -}}
350
+ {%- endif -%}
351
+ {%- endif -%}
processor_config.json ADDED
@@ -0,0 +1,75 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "audio_ms_per_token": 40,
3
+ "audio_seq_length": 750,
4
+ "feature_extractor": {
5
+ "dither": 0.0,
6
+ "feature_extractor_type": "Gemma4AudioFeatureExtractor",
7
+ "feature_size": 128,
8
+ "fft_length": 512,
9
+ "fft_overdrive": false,
10
+ "frame_length": 320,
11
+ "hop_length": 160,
12
+ "input_scale_factor": 1.0,
13
+ "max_frequency": 8000.0,
14
+ "mel_floor": 0.001,
15
+ "min_frequency": 0.0,
16
+ "padding_side": "right",
17
+ "padding_value": 0.0,
18
+ "per_bin_mean": null,
19
+ "per_bin_stddev": null,
20
+ "preemphasis": 0.0,
21
+ "preemphasis_htk_flavor": true,
22
+ "return_attention_mask": true,
23
+ "sampling_rate": 16000
24
+ },
25
+ "image_processor": {
26
+ "do_convert_rgb": true,
27
+ "do_normalize": false,
28
+ "do_rescale": true,
29
+ "do_resize": true,
30
+ "image_mean": [
31
+ 0.0,
32
+ 0.0,
33
+ 0.0
34
+ ],
35
+ "image_processor_type": "Gemma4ImageProcessor",
36
+ "image_seq_length": 280,
37
+ "image_std": [
38
+ 1.0,
39
+ 1.0,
40
+ 1.0
41
+ ],
42
+ "max_soft_tokens": 280,
43
+ "patch_size": 16,
44
+ "pooling_kernel_size": 3,
45
+ "resample": 3,
46
+ "rescale_factor": 0.00392156862745098
47
+ },
48
+ "image_seq_length": 280,
49
+ "processor_class": "Gemma4Processor",
50
+ "video_processor": {
51
+ "do_convert_rgb": true,
52
+ "do_normalize": true,
53
+ "do_rescale": true,
54
+ "do_resize": true,
55
+ "do_sample_frames": true,
56
+ "image_mean": [
57
+ 0.0,
58
+ 0.0,
59
+ 0.0
60
+ ],
61
+ "image_std": [
62
+ 1.0,
63
+ 1.0,
64
+ 1.0
65
+ ],
66
+ "max_soft_tokens": 70,
67
+ "num_frames": 32,
68
+ "patch_size": 16,
69
+ "pooling_kernel_size": 3,
70
+ "resample": 3,
71
+ "rescale_factor": 0.00392156862745098,
72
+ "return_metadata": false,
73
+ "video_processor_type": "Gemma4VideoProcessor"
74
+ }
75
+ }
tokenizer.json ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:cc8d3a0ce36466ccc1278bf987df5f71db1719b9ca6b4118264f45cb627bfe0f
3
+ size 32169626
tokenizer_config.json ADDED
@@ -0,0 +1,95 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "audio_token": "<|audio|>",
3
+ "backend": "tokenizers",
4
+ "boa_token": "<|audio>",
5
+ "boi_token": "<|image>",
6
+ "bos_token": "<bos>",
7
+ "eoa_token": "<audio|>",
8
+ "eoc_token": "<channel|>",
9
+ "eoi_token": "<image|>",
10
+ "eos_token": "<eos>",
11
+ "eot_token": "<turn|>",
12
+ "escape_token": "<|\"|>",
13
+ "etc_token": "<tool_call|>",
14
+ "etd_token": "<tool|>",
15
+ "etr_token": "<tool_response|>",
16
+ "extra_special_tokens": [
17
+ "<|video|>"
18
+ ],
19
+ "image_token": "<|image|>",
20
+ "is_local": false,
21
+ "mask_token": "<mask>",
22
+ "model_max_length": 1000000000000000019884624838656,
23
+ "model_specific_special_tokens": {
24
+ "audio_token": "<|audio|>",
25
+ "boa_token": "<|audio>",
26
+ "boi_token": "<|image>",
27
+ "eoa_token": "<audio|>",
28
+ "eoc_token": "<channel|>",
29
+ "eoi_token": "<image|>",
30
+ "eot_token": "<turn|>",
31
+ "escape_token": "<|\"|>",
32
+ "etc_token": "<tool_call|>",
33
+ "etd_token": "<tool|>",
34
+ "etr_token": "<tool_response|>",
35
+ "image_token": "<|image|>",
36
+ "soc_token": "<|channel>",
37
+ "sot_token": "<|turn>",
38
+ "stc_token": "<|tool_call>",
39
+ "std_token": "<|tool>",
40
+ "str_token": "<|tool_response>",
41
+ "think_token": "<|think|>"
42
+ },
43
+ "pad_token": "<pad>",
44
+ "padding_side": "left",
45
+ "processor_class": "Gemma4Processor",
46
+ "response_schema": {
47
+ "properties": {
48
+ "content": {
49
+ "type": "string"
50
+ },
51
+ "role": {
52
+ "const": "assistant"
53
+ },
54
+ "thinking": {
55
+ "type": "string"
56
+ },
57
+ "tool_calls": {
58
+ "items": {
59
+ "properties": {
60
+ "function": {
61
+ "properties": {
62
+ "arguments": {
63
+ "additionalProperties": {},
64
+ "type": "object",
65
+ "x-parser": "gemma4-tool-call"
66
+ },
67
+ "name": {
68
+ "type": "string"
69
+ }
70
+ },
71
+ "type": "object",
72
+ "x-regex": "call\\:(?P<name>\\w+)(?P<arguments>\\{.*\\})"
73
+ },
74
+ "type": {
75
+ "const": "function"
76
+ }
77
+ },
78
+ "type": "object"
79
+ },
80
+ "type": "array",
81
+ "x-regex-iterator": "<\\|tool_call>(.*?)<tool_call\\|>"
82
+ }
83
+ },
84
+ "type": "object",
85
+ "x-regex": "(\\<\\|channel\\>thought\\n(?P<thinking>.*?)\\<channel\\|\\>)?(?P<tool_calls>\\<\\|tool_call\\>.*\\<tool_call\\|\\>)?(?P<content>(?:(?!\\<turn\\|\\>)(?!\\<\\|tool_response\\>).)+)?(?:\\<turn\\|\\>|\\<\\|tool_response\\>)?"
86
+ },
87
+ "soc_token": "<|channel>",
88
+ "sot_token": "<|turn>",
89
+ "stc_token": "<|tool_call>",
90
+ "std_token": "<|tool>",
91
+ "str_token": "<|tool_response>",
92
+ "think_token": "<|think|>",
93
+ "tokenizer_class": "GemmaTokenizer",
94
+ "unk_token": "<unk>"
95
+ }
trainer_state.json ADDED
@@ -0,0 +1,862 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "best_global_step": null,
3
+ "best_metric": null,
4
+ "best_model_checkpoint": null,
5
+ "epoch": 3.0,
6
+ "eval_steps": 500,
7
+ "global_step": 1170,
8
+ "is_hyper_param_search": false,
9
+ "is_local_process_zero": true,
10
+ "is_world_process_zero": true,
11
+ "log_history": [
12
+ {
13
+ "epoch": 0.025682182985553772,
14
+ "grad_norm": 41.89281463623047,
15
+ "learning_rate": 5e-05,
16
+ "loss": 41.062384033203124,
17
+ "step": 10
18
+ },
19
+ {
20
+ "epoch": 0.051364365971107544,
21
+ "grad_norm": 19.25795555114746,
22
+ "learning_rate": 0.00010555555555555557,
23
+ "loss": 23.114556884765626,
24
+ "step": 20
25
+ },
26
+ {
27
+ "epoch": 0.07704654895666131,
28
+ "grad_norm": 15.17092514038086,
29
+ "learning_rate": 0.0001611111111111111,
30
+ "loss": 19.892947387695312,
31
+ "step": 30
32
+ },
33
+ {
34
+ "epoch": 0.10272873194221509,
35
+ "grad_norm": 12.6154146194458,
36
+ "learning_rate": 0.0001999965463076377,
37
+ "loss": 18.954251098632813,
38
+ "step": 40
39
+ },
40
+ {
41
+ "epoch": 0.12841091492776885,
42
+ "grad_norm": 13.695039749145508,
43
+ "learning_rate": 0.00019993515396856082,
44
+ "loss": 18.60211944580078,
45
+ "step": 50
46
+ },
47
+ {
48
+ "epoch": 0.15409309791332262,
49
+ "grad_norm": 13.03475570678711,
50
+ "learning_rate": 0.00019979706714271113,
51
+ "loss": 17.00911102294922,
52
+ "step": 60
53
+ },
54
+ {
55
+ "epoch": 0.1797752808988764,
56
+ "grad_norm": 12.657264709472656,
57
+ "learning_rate": 0.0001995823918037908,
58
+ "loss": 16.737196350097655,
59
+ "step": 70
60
+ },
61
+ {
62
+ "epoch": 0.20545746388443017,
63
+ "grad_norm": 9.450530052185059,
64
+ "learning_rate": 0.00019929129270278366,
65
+ "loss": 17.192404174804686,
66
+ "step": 80
67
+ },
68
+ {
69
+ "epoch": 0.23113964686998395,
70
+ "grad_norm": 11.038394927978516,
71
+ "learning_rate": 0.0001989239932415185,
72
+ "loss": 17.626002502441406,
73
+ "step": 90
74
+ },
75
+ {
76
+ "epoch": 0.2568218298555377,
77
+ "grad_norm": 10.112431526184082,
78
+ "learning_rate": 0.00019848077530122083,
79
+ "loss": 17.375982666015624,
80
+ "step": 100
81
+ },
82
+ {
83
+ "epoch": 0.2825040128410915,
84
+ "grad_norm": 10.951290130615234,
85
+ "learning_rate": 0.0001979619790261853,
86
+ "loss": 16.52630157470703,
87
+ "step": 110
88
+ },
89
+ {
90
+ "epoch": 0.30818619582664525,
91
+ "grad_norm": 13.097779273986816,
92
+ "learning_rate": 0.00019736800256273457,
93
+ "loss": 18.660000610351563,
94
+ "step": 120
95
+ },
96
+ {
97
+ "epoch": 0.33386837881219905,
98
+ "grad_norm": 10.489601135253906,
99
+ "learning_rate": 0.00019669930175366472,
100
+ "loss": 18.038082885742188,
101
+ "step": 130
102
+ },
103
+ {
104
+ "epoch": 0.3595505617977528,
105
+ "grad_norm": 12.037221908569336,
106
+ "learning_rate": 0.0001959563897884124,
107
+ "loss": 16.976956176757813,
108
+ "step": 140
109
+ },
110
+ {
111
+ "epoch": 0.3852327447833066,
112
+ "grad_norm": 13.149781227111816,
113
+ "learning_rate": 0.00019513983680921108,
114
+ "loss": 17.55983123779297,
115
+ "step": 150
116
+ },
117
+ {
118
+ "epoch": 0.41091492776886035,
119
+ "grad_norm": 12.796188354492188,
120
+ "learning_rate": 0.00019425026947353992,
121
+ "loss": 17.86356201171875,
122
+ "step": 160
123
+ },
124
+ {
125
+ "epoch": 0.43659711075441415,
126
+ "grad_norm": 14.400433540344238,
127
+ "learning_rate": 0.0001932883704732001,
128
+ "loss": 17.5066162109375,
129
+ "step": 170
130
+ },
131
+ {
132
+ "epoch": 0.4622792937399679,
133
+ "grad_norm": 16.03416633605957,
134
+ "learning_rate": 0.00019225487801038788,
135
+ "loss": 18.057034301757813,
136
+ "step": 180
137
+ },
138
+ {
139
+ "epoch": 0.48796147672552165,
140
+ "grad_norm": 11.53113842010498,
141
+ "learning_rate": 0.00019115058523116733,
142
+ "loss": 16.775308227539064,
143
+ "step": 190
144
+ },
145
+ {
146
+ "epoch": 0.5136436597110754,
147
+ "grad_norm": 12.079059600830078,
148
+ "learning_rate": 0.00018997633961677582,
149
+ "loss": 17.352998352050783,
150
+ "step": 200
151
+ },
152
+ {
153
+ "epoch": 0.5393258426966292,
154
+ "grad_norm": 11.70594310760498,
155
+ "learning_rate": 0.00018873304233323122,
156
+ "loss": 18.17870635986328,
157
+ "step": 210
158
+ },
159
+ {
160
+ "epoch": 0.565008025682183,
161
+ "grad_norm": 13.156736373901367,
162
+ "learning_rate": 0.00018742164753973855,
163
+ "loss": 16.766079711914063,
164
+ "step": 220
165
+ },
166
+ {
167
+ "epoch": 0.5906902086677368,
168
+ "grad_norm": 11.801448822021484,
169
+ "learning_rate": 0.0001860431616564278,
170
+ "loss": 17.98457489013672,
171
+ "step": 230
172
+ },
173
+ {
174
+ "epoch": 0.6163723916532905,
175
+ "grad_norm": 11.434338569641113,
176
+ "learning_rate": 0.0001845986425919841,
177
+ "loss": 17.011558532714844,
178
+ "step": 240
179
+ },
180
+ {
181
+ "epoch": 0.6420545746388443,
182
+ "grad_norm": 11.592355728149414,
183
+ "learning_rate": 0.00018308919893176396,
184
+ "loss": 18.073362731933592,
185
+ "step": 250
186
+ },
187
+ {
188
+ "epoch": 0.6677367576243981,
189
+ "grad_norm": 12.140549659729004,
190
+ "learning_rate": 0.00018151598908701947,
191
+ "loss": 17.267474365234374,
192
+ "step": 260
193
+ },
194
+ {
195
+ "epoch": 0.6934189406099518,
196
+ "grad_norm": 12.207261085510254,
197
+ "learning_rate": 0.00017988022040588416,
198
+ "loss": 17.970367431640625,
199
+ "step": 270
200
+ },
201
+ {
202
+ "epoch": 0.7191011235955056,
203
+ "grad_norm": 12.024649620056152,
204
+ "learning_rate": 0.000178183148246803,
205
+ "loss": 18.595138549804688,
206
+ "step": 280
207
+ },
208
+ {
209
+ "epoch": 0.7447833065810594,
210
+ "grad_norm": 10.953705787658691,
211
+ "learning_rate": 0.0001764260750151167,
212
+ "loss": 17.661044311523437,
213
+ "step": 290
214
+ },
215
+ {
216
+ "epoch": 0.7704654895666132,
217
+ "grad_norm": 12.161933898925781,
218
+ "learning_rate": 0.0001746103491635407,
219
+ "loss": 17.323040771484376,
220
+ "step": 300
221
+ },
222
+ {
223
+ "epoch": 0.7961476725521669,
224
+ "grad_norm": 14.1478271484375,
225
+ "learning_rate": 0.00017273736415730488,
226
+ "loss": 17.589439392089844,
227
+ "step": 310
228
+ },
229
+ {
230
+ "epoch": 0.8218298555377207,
231
+ "grad_norm": 11.344724655151367,
232
+ "learning_rate": 0.0001708085574047494,
233
+ "loss": 18.087261962890626,
234
+ "step": 320
235
+ },
236
+ {
237
+ "epoch": 0.8475120385232745,
238
+ "grad_norm": 11.49538803100586,
239
+ "learning_rate": 0.00016882540915419623,
240
+ "loss": 16.709014892578125,
241
+ "step": 330
242
+ },
243
+ {
244
+ "epoch": 0.8731942215088283,
245
+ "grad_norm": 11.975954055786133,
246
+ "learning_rate": 0.00016678944135794374,
247
+ "loss": 18.784584045410156,
248
+ "step": 340
249
+ },
250
+ {
251
+ "epoch": 0.898876404494382,
252
+ "grad_norm": 10.979130744934082,
253
+ "learning_rate": 0.00016470221650425582,
254
+ "loss": 17.035598754882812,
255
+ "step": 350
256
+ },
257
+ {
258
+ "epoch": 0.9245585874799358,
259
+ "grad_norm": 10.86307144165039,
260
+ "learning_rate": 0.00016256533641824177,
261
+ "loss": 17.912120056152343,
262
+ "step": 360
263
+ },
264
+ {
265
+ "epoch": 0.9502407704654896,
266
+ "grad_norm": 11.18655776977539,
267
+ "learning_rate": 0.00016038044103254775,
268
+ "loss": 16.4998046875,
269
+ "step": 370
270
+ },
271
+ {
272
+ "epoch": 0.9759229534510433,
273
+ "grad_norm": 11.321154594421387,
274
+ "learning_rate": 0.00015814920712880267,
275
+ "loss": 17.580471801757813,
276
+ "step": 380
277
+ },
278
+ {
279
+ "epoch": 1.0,
280
+ "grad_norm": 6.271613597869873,
281
+ "learning_rate": 0.0001558733470507847,
282
+ "loss": 15.328689575195312,
283
+ "step": 390
284
+ },
285
+ {
286
+ "epoch": 1.0256821829855538,
287
+ "grad_norm": 13.181764602661133,
288
+ "learning_rate": 0.00015355460739029586,
289
+ "loss": 15.265965270996094,
290
+ "step": 400
291
+ },
292
+ {
293
+ "epoch": 1.0513643659711076,
294
+ "grad_norm": 11.904574394226074,
295
+ "learning_rate": 0.00015119476764675305,
296
+ "loss": 14.833596801757812,
297
+ "step": 410
298
+ },
299
+ {
300
+ "epoch": 1.0770465489566614,
301
+ "grad_norm": 12.352519989013672,
302
+ "learning_rate": 0.0001487956388615247,
303
+ "loss": 13.833314514160156,
304
+ "step": 420
305
+ },
306
+ {
307
+ "epoch": 1.102728731942215,
308
+ "grad_norm": 15.478001594543457,
309
+ "learning_rate": 0.00014635906222806058,
310
+ "loss": 15.767561340332032,
311
+ "step": 430
312
+ },
313
+ {
314
+ "epoch": 1.1284109149277688,
315
+ "grad_norm": 15.335821151733398,
316
+ "learning_rate": 0.00014388690767888154,
317
+ "loss": 14.462684631347656,
318
+ "step": 440
319
+ },
320
+ {
321
+ "epoch": 1.1540930979133226,
322
+ "grad_norm": 16.043373107910156,
323
+ "learning_rate": 0.00014138107245051392,
324
+ "loss": 15.908651733398438,
325
+ "step": 450
326
+ },
327
+ {
328
+ "epoch": 1.1797752808988764,
329
+ "grad_norm": 12.852351188659668,
330
+ "learning_rate": 0.00013884347962746948,
331
+ "loss": 15.758128356933593,
332
+ "step": 460
333
+ },
334
+ {
335
+ "epoch": 1.2054574638844302,
336
+ "grad_norm": 15.320487022399902,
337
+ "learning_rate": 0.00013627607666638858,
338
+ "loss": 15.051063537597656,
339
+ "step": 470
340
+ },
341
+ {
342
+ "epoch": 1.231139646869984,
343
+ "grad_norm": 16.30264663696289,
344
+ "learning_rate": 0.00013368083390147913,
345
+ "loss": 14.599794006347656,
346
+ "step": 480
347
+ },
348
+ {
349
+ "epoch": 1.2568218298555376,
350
+ "grad_norm": 13.734286308288574,
351
+ "learning_rate": 0.00013105974303239838,
352
+ "loss": 14.052903747558593,
353
+ "step": 490
354
+ },
355
+ {
356
+ "epoch": 1.2825040128410916,
357
+ "grad_norm": 15.704442024230957,
358
+ "learning_rate": 0.0001284148155957379,
359
+ "loss": 13.530386352539063,
360
+ "step": 500
361
+ },
362
+ {
363
+ "epoch": 1.3081861958266452,
364
+ "grad_norm": 14.808961868286133,
365
+ "learning_rate": 0.00012574808142128477,
366
+ "loss": 14.240873718261719,
367
+ "step": 510
368
+ },
369
+ {
370
+ "epoch": 1.333868378812199,
371
+ "grad_norm": 13.389018058776855,
372
+ "learning_rate": 0.00012306158707424403,
373
+ "loss": 14.467668151855468,
374
+ "step": 520
375
+ },
376
+ {
377
+ "epoch": 1.3595505617977528,
378
+ "grad_norm": 16.077634811401367,
379
+ "learning_rate": 0.00012035739428461739,
380
+ "loss": 13.67303466796875,
381
+ "step": 530
382
+ },
383
+ {
384
+ "epoch": 1.3852327447833066,
385
+ "grad_norm": 14.300647735595703,
386
+ "learning_rate": 0.00011763757836494403,
387
+ "loss": 15.072747802734375,
388
+ "step": 540
389
+ },
390
+ {
391
+ "epoch": 1.4109149277688604,
392
+ "grad_norm": 12.59499454498291,
393
+ "learning_rate": 0.00011490422661761744,
394
+ "loss": 14.030448913574219,
395
+ "step": 550
396
+ },
397
+ {
398
+ "epoch": 1.4365971107544142,
399
+ "grad_norm": 17.819976806640625,
400
+ "learning_rate": 0.00011215943673300093,
401
+ "loss": 13.556326293945313,
402
+ "step": 560
403
+ },
404
+ {
405
+ "epoch": 1.462279293739968,
406
+ "grad_norm": 15.307595252990723,
407
+ "learning_rate": 0.00010940531517957073,
408
+ "loss": 15.464706420898438,
409
+ "step": 570
410
+ },
411
+ {
412
+ "epoch": 1.4879614767255216,
413
+ "grad_norm": 14.918625831604004,
414
+ "learning_rate": 0.00010664397558732244,
415
+ "loss": 14.040945434570313,
416
+ "step": 580
417
+ },
418
+ {
419
+ "epoch": 1.5136436597110754,
420
+ "grad_norm": 13.040802955627441,
421
+ "learning_rate": 0.0001038775371256817,
422
+ "loss": 14.671842956542969,
423
+ "step": 590
424
+ },
425
+ {
426
+ "epoch": 1.5393258426966292,
427
+ "grad_norm": 14.143183708190918,
428
+ "learning_rate": 0.00010110812287716327,
429
+ "loss": 14.198591613769532,
430
+ "step": 600
431
+ },
432
+ {
433
+ "epoch": 1.565008025682183,
434
+ "grad_norm": 16.373567581176758,
435
+ "learning_rate": 9.833785820802739e-05,
436
+ "loss": 14.670704650878907,
437
+ "step": 610
438
+ },
439
+ {
440
+ "epoch": 1.5906902086677368,
441
+ "grad_norm": 17.14202117919922,
442
+ "learning_rate": 9.556886913718317e-05,
443
+ "loss": 14.713813781738281,
444
+ "step": 620
445
+ },
446
+ {
447
+ "epoch": 1.6163723916532904,
448
+ "grad_norm": 13.428468704223633,
449
+ "learning_rate": 9.280328070459135e-05,
450
+ "loss": 14.256681823730469,
451
+ "step": 630
452
+ },
453
+ {
454
+ "epoch": 1.6420545746388444,
455
+ "grad_norm": 14.759570121765137,
456
+ "learning_rate": 9.004321534041835e-05,
457
+ "loss": 14.875436401367187,
458
+ "step": 640
459
+ },
460
+ {
461
+ "epoch": 1.667736757624398,
462
+ "grad_norm": 14.605072021484375,
463
+ "learning_rate": 8.729079123619286e-05,
464
+ "loss": 14.151382446289062,
465
+ "step": 650
466
+ },
467
+ {
468
+ "epoch": 1.6934189406099518,
469
+ "grad_norm": 15.300275802612305,
470
+ "learning_rate": 8.454812071921596e-05,
471
+ "loss": 14.209205627441406,
472
+ "step": 660
473
+ },
474
+ {
475
+ "epoch": 1.7191011235955056,
476
+ "grad_norm": 15.565445899963379,
477
+ "learning_rate": 8.181730863147093e-05,
478
+ "loss": 15.246949768066406,
479
+ "step": 670
480
+ },
481
+ {
482
+ "epoch": 1.7447833065810594,
483
+ "grad_norm": 13.463878631591797,
484
+ "learning_rate": 7.910045071427829e-05,
485
+ "loss": 14.081675720214843,
486
+ "step": 680
487
+ },
488
+ {
489
+ "epoch": 1.7704654895666132,
490
+ "grad_norm": 15.799697875976562,
491
+ "learning_rate": 7.63996319999347e-05,
492
+ "loss": 14.659947204589844,
493
+ "step": 690
494
+ },
495
+ {
496
+ "epoch": 1.7961476725521668,
497
+ "grad_norm": 15.421786308288574,
498
+ "learning_rate": 7.371692521157048e-05,
499
+ "loss": 13.642781066894532,
500
+ "step": 700
501
+ },
502
+ {
503
+ "epoch": 1.8218298555377208,
504
+ "grad_norm": 16.400951385498047,
505
+ "learning_rate": 7.10543891724537e-05,
506
+ "loss": 14.9028076171875,
507
+ "step": 710
508
+ },
509
+ {
510
+ "epoch": 1.8475120385232744,
511
+ "grad_norm": 16.549213409423828,
512
+ "learning_rate": 6.841406722596191e-05,
513
+ "loss": 14.747923278808594,
514
+ "step": 720
515
+ },
516
+ {
517
+ "epoch": 1.8731942215088284,
518
+ "grad_norm": 14.868678092956543,
519
+ "learning_rate": 6.579798566743314e-05,
520
+ "loss": 14.379522705078125,
521
+ "step": 730
522
+ },
523
+ {
524
+ "epoch": 1.898876404494382,
525
+ "grad_norm": 17.223217010498047,
526
+ "learning_rate": 6.320815218910101e-05,
527
+ "loss": 14.513031005859375,
528
+ "step": 740
529
+ },
530
+ {
531
+ "epoch": 1.9245585874799358,
532
+ "grad_norm": 15.178607940673828,
533
+ "learning_rate": 6.064655433930624e-05,
534
+ "loss": 14.75238494873047,
535
+ "step": 750
536
+ },
537
+ {
538
+ "epoch": 1.9502407704654896,
539
+ "grad_norm": 14.607568740844727,
540
+ "learning_rate": 5.8115157997167536e-05,
541
+ "loss": 14.215359497070313,
542
+ "step": 760
543
+ },
544
+ {
545
+ "epoch": 1.9759229534510432,
546
+ "grad_norm": 15.849374771118164,
547
+ "learning_rate": 5.561590586388221e-05,
548
+ "loss": 13.844842529296875,
549
+ "step": 770
550
+ },
551
+ {
552
+ "epoch": 2.0,
553
+ "grad_norm": 8.945368766784668,
554
+ "learning_rate": 5.315071597181504e-05,
555
+ "loss": 13.500308227539062,
556
+ "step": 780
557
+ },
558
+ {
559
+ "epoch": 2.0256821829855536,
560
+ "grad_norm": 14.872233390808105,
561
+ "learning_rate": 5.072148021251821e-05,
562
+ "loss": 10.96926498413086,
563
+ "step": 790
564
+ },
565
+ {
566
+ "epoch": 2.0513643659711076,
567
+ "grad_norm": 16.96340560913086,
568
+ "learning_rate": 4.833006288481371e-05,
569
+ "loss": 10.601210021972657,
570
+ "step": 800
571
+ },
572
+ {
573
+ "epoch": 2.077046548956661,
574
+ "grad_norm": 17.987470626831055,
575
+ "learning_rate": 4.597829926405075e-05,
576
+ "loss": 11.348848724365235,
577
+ "step": 810
578
+ },
579
+ {
580
+ "epoch": 2.102728731942215,
581
+ "grad_norm": 18.53151512145996,
582
+ "learning_rate": 4.3667994193637796e-05,
583
+ "loss": 10.507981109619141,
584
+ "step": 820
585
+ },
586
+ {
587
+ "epoch": 2.128410914927769,
588
+ "grad_norm": 23.948802947998047,
589
+ "learning_rate": 4.140092069992867e-05,
590
+ "loss": 11.215933227539063,
591
+ "step": 830
592
+ },
593
+ {
594
+ "epoch": 2.154093097913323,
595
+ "grad_norm": 21.3605899810791,
596
+ "learning_rate": 3.91788186315269e-05,
597
+ "loss": 10.393006896972656,
598
+ "step": 840
599
+ },
600
+ {
601
+ "epoch": 2.1797752808988764,
602
+ "grad_norm": 26.548303604125977,
603
+ "learning_rate": 3.7003393324051874e-05,
604
+ "loss": 10.384098052978516,
605
+ "step": 850
606
+ },
607
+ {
608
+ "epoch": 2.20545746388443,
609
+ "grad_norm": 23.979907989501953,
610
+ "learning_rate": 3.487631429139183e-05,
611
+ "loss": 10.094139862060548,
612
+ "step": 860
613
+ },
614
+ {
615
+ "epoch": 2.231139646869984,
616
+ "grad_norm": 18.00309944152832,
617
+ "learning_rate": 3.279921394444776e-05,
618
+ "loss": 10.624467468261718,
619
+ "step": 870
620
+ },
621
+ {
622
+ "epoch": 2.2568218298555376,
623
+ "grad_norm": 24.657569885253906,
624
+ "learning_rate": 3.077368633835205e-05,
625
+ "loss": 10.711078643798828,
626
+ "step": 880
627
+ },
628
+ {
629
+ "epoch": 2.2825040128410916,
630
+ "grad_norm": 22.551204681396484,
631
+ "learning_rate": 2.8801285949122593e-05,
632
+ "loss": 10.329103088378906,
633
+ "step": 890
634
+ },
635
+ {
636
+ "epoch": 2.308186195826645,
637
+ "grad_norm": 26.180330276489258,
638
+ "learning_rate": 2.688352648069198e-05,
639
+ "loss": 9.944695281982423,
640
+ "step": 900
641
+ },
642
+ {
643
+ "epoch": 2.333868378812199,
644
+ "grad_norm": 34.10860061645508,
645
+ "learning_rate": 2.502187970322657e-05,
646
+ "loss": 10.465196990966797,
647
+ "step": 910
648
+ },
649
+ {
650
+ "epoch": 2.359550561797753,
651
+ "grad_norm": 25.76052474975586,
652
+ "learning_rate": 2.321777432362764e-05,
653
+ "loss": 10.892754364013673,
654
+ "step": 920
655
+ },
656
+ {
657
+ "epoch": 2.3852327447833064,
658
+ "grad_norm": 25.387516021728516,
659
+ "learning_rate": 2.1472594889080756e-05,
660
+ "loss": 10.628679656982422,
661
+ "step": 930
662
+ },
663
+ {
664
+ "epoch": 2.4109149277688604,
665
+ "grad_norm": 22.750202178955078,
666
+ "learning_rate": 1.9787680724495617e-05,
667
+ "loss": 10.46801986694336,
668
+ "step": 940
669
+ },
670
+ {
671
+ "epoch": 2.436597110754414,
672
+ "grad_norm": 29.39733123779297,
673
+ "learning_rate": 1.8164324904650965e-05,
674
+ "loss": 11.32564697265625,
675
+ "step": 950
676
+ },
677
+ {
678
+ "epoch": 2.462279293739968,
679
+ "grad_norm": 23.36044692993164,
680
+ "learning_rate": 1.660377326183412e-05,
681
+ "loss": 10.110736083984374,
682
+ "step": 960
683
+ },
684
+ {
685
+ "epoch": 2.4879614767255216,
686
+ "grad_norm": 22.447628021240234,
687
+ "learning_rate": 1.5107223429736272e-05,
688
+ "loss": 10.582487487792969,
689
+ "step": 970
690
+ },
691
+ {
692
+ "epoch": 2.513643659711075,
693
+ "grad_norm": 21.37103271484375,
694
+ "learning_rate": 1.3675823924337506e-05,
695
+ "loss": 9.974002838134766,
696
+ "step": 980
697
+ },
698
+ {
699
+ "epoch": 2.539325842696629,
700
+ "grad_norm": 23.942567825317383,
701
+ "learning_rate": 1.2310673262486705e-05,
702
+ "loss": 10.401480102539063,
703
+ "step": 990
704
+ },
705
+ {
706
+ "epoch": 2.5650080256821832,
707
+ "grad_norm": 23.232219696044922,
708
+ "learning_rate": 1.1012819118853147e-05,
709
+ "loss": 10.403594970703125,
710
+ "step": 1000
711
+ },
712
+ {
713
+ "epoch": 2.590690208667737,
714
+ "grad_norm": 26.392332077026367,
715
+ "learning_rate": 9.783257521896227e-06,
716
+ "loss": 11.170610046386718,
717
+ "step": 1010
718
+ },
719
+ {
720
+ "epoch": 2.6163723916532904,
721
+ "grad_norm": 22.941421508789062,
722
+ "learning_rate": 8.62293208947107e-06,
723
+ "loss": 9.754792785644531,
724
+ "step": 1020
725
+ },
726
+ {
727
+ "epoch": 2.6420545746388444,
728
+ "grad_norm": 31.97374153137207,
729
+ "learning_rate": 7.532733304655848e-06,
730
+ "loss": 10.895748138427734,
731
+ "step": 1030
732
+ },
733
+ {
734
+ "epoch": 2.667736757624398,
735
+ "grad_norm": 22.15962791442871,
736
+ "learning_rate": 6.5134978323574066e-06,
737
+ "loss": 10.086806488037109,
738
+ "step": 1040
739
+ },
740
+ {
741
+ "epoch": 2.693418940609952,
742
+ "grad_norm": 23.121976852416992,
743
+ "learning_rate": 5.566007877218882e-06,
744
+ "loss": 9.806757354736328,
745
+ "step": 1050
746
+ },
747
+ {
748
+ "epoch": 2.7191011235955056,
749
+ "grad_norm": 22.89173698425293,
750
+ "learning_rate": 4.6909905833226966e-06,
751
+ "loss": 9.998442077636719,
752
+ "step": 1060
753
+ },
754
+ {
755
+ "epoch": 2.744783306581059,
756
+ "grad_norm": 28.151565551757812,
757
+ "learning_rate": 3.8891174761491735e-06,
758
+ "loss": 10.232617950439453,
759
+ "step": 1070
760
+ },
761
+ {
762
+ "epoch": 2.770465489566613,
763
+ "grad_norm": 27.67608070373535,
764
+ "learning_rate": 3.161003947219421e-06,
765
+ "loss": 11.071966552734375,
766
+ "step": 1080
767
+ },
768
+ {
769
+ "epoch": 2.796147672552167,
770
+ "grad_norm": 26.361303329467773,
771
+ "learning_rate": 2.5072087818176382e-06,
772
+ "loss": 11.048786926269532,
773
+ "step": 1090
774
+ },
775
+ {
776
+ "epoch": 2.821829855537721,
777
+ "grad_norm": 24.121335983276367,
778
+ "learning_rate": 1.928233730155604e-06,
779
+ "loss": 10.665110015869141,
780
+ "step": 1100
781
+ },
782
+ {
783
+ "epoch": 2.8475120385232744,
784
+ "grad_norm": 26.22313117980957,
785
+ "learning_rate": 1.4245231223081301e-06,
786
+ "loss": 11.044110107421876,
787
+ "step": 1110
788
+ },
789
+ {
790
+ "epoch": 2.8731942215088284,
791
+ "grad_norm": 26.499591827392578,
792
+ "learning_rate": 9.964635272153633e-07,
793
+ "loss": 10.147935485839843,
794
+ "step": 1120
795
+ },
796
+ {
797
+ "epoch": 2.898876404494382,
798
+ "grad_norm": 23.90386390686035,
799
+ "learning_rate": 6.443834560132534e-07,
800
+ "loss": 10.296646881103516,
801
+ "step": 1130
802
+ },
803
+ {
804
+ "epoch": 2.924558587479936,
805
+ "grad_norm": 24.210269927978516,
806
+ "learning_rate": 3.685531099202111e-07,
807
+ "loss": 10.592522430419923,
808
+ "step": 1140
809
+ },
810
+ {
811
+ "epoch": 2.9502407704654896,
812
+ "grad_norm": 28.532909393310547,
813
+ "learning_rate": 1.6918417287318245e-07,
814
+ "loss": 9.853338623046875,
815
+ "step": 1150
816
+ },
817
+ {
818
+ "epoch": 2.975922953451043,
819
+ "grad_norm": 27.218006134033203,
820
+ "learning_rate": 4.642964907235481e-08,
821
+ "loss": 10.533452606201172,
822
+ "step": 1160
823
+ },
824
+ {
825
+ "epoch": 3.0,
826
+ "grad_norm": 14.907074928283691,
827
+ "learning_rate": 3.837455592847761e-10,
828
+ "loss": 9.5769775390625,
829
+ "step": 1170
830
+ },
831
+ {
832
+ "epoch": 3.0,
833
+ "step": 1170,
834
+ "total_flos": 2.709259627140219e+17,
835
+ "train_loss": 14.446504472259782,
836
+ "train_runtime": 5026.5595,
837
+ "train_samples_per_second": 1.859,
838
+ "train_steps_per_second": 0.233
839
+ }
840
+ ],
841
+ "logging_steps": 10,
842
+ "max_steps": 1170,
843
+ "num_input_tokens_seen": 0,
844
+ "num_train_epochs": 3,
845
+ "save_steps": 200,
846
+ "stateful_callbacks": {
847
+ "TrainerControl": {
848
+ "args": {
849
+ "should_epoch_stop": false,
850
+ "should_evaluate": false,
851
+ "should_log": false,
852
+ "should_save": true,
853
+ "should_training_stop": true
854
+ },
855
+ "attributes": {}
856
+ }
857
+ },
858
+ "total_flos": 2.709259627140219e+17,
859
+ "train_batch_size": 1,
860
+ "trial_name": null,
861
+ "trial_params": null
862
+ }
training_args.bin ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:2be588ff8c8b6fcd8303c648ab8d3bf2aa58bf5fcdc6821e6579f3a61a41feff
3
+ size 6737