davidi123 commited on
Commit
e799299
·
verified ·
1 Parent(s): 1e42b48

Upload manifest_train.yaml with huggingface_hub

Browse files
Files changed (1) hide show
  1. manifest_train.yaml +595 -0
manifest_train.yaml ADDED
@@ -0,0 +1,595 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ combinations:
2
+ cerebras_cerebras-gpt-2.7b_tokyotech_llm_swallow_code:
3
+ model_name: cerebras/Cerebras-GPT-2.7B
4
+ dataset_name: tokyotech-llm/swallow-code
5
+ path: features/train/cerebras_cerebras-gpt-2.7b_tokyotech_llm_swallow_code
6
+ n_members: 10000
7
+ n_nonmembers: 10000
8
+ feature_dim: 154
9
+ sequence_length: 128
10
+ train_size: 8000
11
+ val_size: 1000
12
+ test_size: 1000
13
+ cerebras_cerebras-gpt-2.7b_codeparrot_codeparrot_clean:
14
+ model_name: cerebras/Cerebras-GPT-2.7B
15
+ dataset_name: codeparrot/codeparrot-clean
16
+ path: features/train/cerebras_cerebras-gpt-2.7b_codeparrot_codeparrot_clean
17
+ n_members: 10000
18
+ n_nonmembers: 10000
19
+ feature_dim: 154
20
+ sequence_length: 128
21
+ train_size: 8000
22
+ val_size: 1000
23
+ test_size: 1000
24
+ cerebras_cerebras-gpt-2.7b_jetbrains_kstack:
25
+ model_name: cerebras/Cerebras-GPT-2.7B
26
+ dataset_name: JetBrains/KStack
27
+ path: features/train/cerebras_cerebras-gpt-2.7b_jetbrains_kstack
28
+ n_members: 10000
29
+ n_nonmembers: 10000
30
+ feature_dim: 154
31
+ sequence_length: 128
32
+ train_size: 8000
33
+ val_size: 1000
34
+ test_size: 1000
35
+ cerebras_cerebras-gpt-2.7b_mvasiliniuc_iva_kotlin_codeint_clean:
36
+ model_name: cerebras/Cerebras-GPT-2.7B
37
+ dataset_name: mvasiliniuc/iva-kotlin-codeint-clean
38
+ path: features/train/cerebras_cerebras-gpt-2.7b_mvasiliniuc_iva_kotlin_codeint_clean
39
+ n_members: 10000
40
+ n_nonmembers: 10000
41
+ feature_dim: 154
42
+ sequence_length: 128
43
+ train_size: 8000
44
+ val_size: 1000
45
+ test_size: 1000
46
+ cerebras_cerebras-gpt-2.7b_nan_do_code_search_net_java:
47
+ model_name: cerebras/Cerebras-GPT-2.7B
48
+ dataset_name: Nan-Do/code-search-net-java
49
+ path: features/train/cerebras_cerebras-gpt-2.7b_nan_do_code_search_net_java
50
+ n_members: 10000
51
+ n_nonmembers: 10000
52
+ feature_dim: 154
53
+ sequence_length: 128
54
+ train_size: 8000
55
+ val_size: 1000
56
+ test_size: 1000
57
+ cerebras_cerebras-gpt-2.7b_bigcode_the_stack_smol_xl:
58
+ model_name: cerebras/Cerebras-GPT-2.7B
59
+ dataset_name: bigcode/the-stack-smol-xl
60
+ path: features/train/cerebras_cerebras-gpt-2.7b_bigcode_the_stack_smol_xl
61
+ n_members: 4000
62
+ n_nonmembers: 4000
63
+ feature_dim: 154
64
+ sequence_length: 128
65
+ train_size: 3200
66
+ val_size: 400
67
+ test_size: 400
68
+ eleutherai_pythia-1.4b_tokyotech_llm_swallow_code:
69
+ model_name: EleutherAI/pythia-1.4b
70
+ dataset_name: tokyotech-llm/swallow-code
71
+ path: features/train/eleutherai_pythia-1.4b_tokyotech_llm_swallow_code
72
+ n_members: 10000
73
+ n_nonmembers: 10000
74
+ feature_dim: 154
75
+ sequence_length: 128
76
+ train_size: 8000
77
+ val_size: 1000
78
+ test_size: 1000
79
+ eleutherai_pythia-1.4b_codeparrot_codeparrot_clean:
80
+ model_name: EleutherAI/pythia-1.4b
81
+ dataset_name: codeparrot/codeparrot-clean
82
+ path: features/train/eleutherai_pythia-1.4b_codeparrot_codeparrot_clean
83
+ n_members: 10000
84
+ n_nonmembers: 10000
85
+ feature_dim: 154
86
+ sequence_length: 128
87
+ train_size: 8000
88
+ val_size: 1000
89
+ test_size: 1000
90
+ eleutherai_pythia-1.4b_jetbrains_kstack:
91
+ model_name: EleutherAI/pythia-1.4b
92
+ dataset_name: JetBrains/KStack
93
+ path: features/train/eleutherai_pythia-1.4b_jetbrains_kstack
94
+ n_members: 10000
95
+ n_nonmembers: 10000
96
+ feature_dim: 154
97
+ sequence_length: 128
98
+ train_size: 8000
99
+ val_size: 1000
100
+ test_size: 1000
101
+ eleutherai_pythia-1.4b_mvasiliniuc_iva_kotlin_codeint_clean:
102
+ model_name: EleutherAI/pythia-1.4b
103
+ dataset_name: mvasiliniuc/iva-kotlin-codeint-clean
104
+ path: features/train/eleutherai_pythia-1.4b_mvasiliniuc_iva_kotlin_codeint_clean
105
+ n_members: 10000
106
+ n_nonmembers: 10000
107
+ feature_dim: 154
108
+ sequence_length: 128
109
+ train_size: 8000
110
+ val_size: 1000
111
+ test_size: 1000
112
+ eleutherai_pythia-1.4b_nan_do_code_search_net_java:
113
+ model_name: EleutherAI/pythia-1.4b
114
+ dataset_name: Nan-Do/code-search-net-java
115
+ path: features/train/eleutherai_pythia-1.4b_nan_do_code_search_net_java
116
+ n_members: 10000
117
+ n_nonmembers: 10000
118
+ feature_dim: 154
119
+ sequence_length: 128
120
+ train_size: 8000
121
+ val_size: 1000
122
+ test_size: 1000
123
+ eleutherai_pythia-1.4b_bigcode_the_stack_smol_xl:
124
+ model_name: EleutherAI/pythia-1.4b
125
+ dataset_name: bigcode/the-stack-smol-xl
126
+ path: features/train/eleutherai_pythia-1.4b_bigcode_the_stack_smol_xl
127
+ n_members: 4000
128
+ n_nonmembers: 4000
129
+ feature_dim: 154
130
+ sequence_length: 128
131
+ train_size: 3600
132
+ val_size: 200
133
+ test_size: 200
134
+ eleutherai_gpt-j-6b_tokyotech_llm_swallow_code:
135
+ model_name: EleutherAI/gpt-j-6b
136
+ dataset_name: tokyotech-llm/swallow-code
137
+ path: features/train/eleutherai_gpt-j-6b_tokyotech_llm_swallow_code
138
+ n_members: 10000
139
+ n_nonmembers: 10000
140
+ feature_dim: 154
141
+ sequence_length: 128
142
+ train_size: 9000
143
+ val_size: 500
144
+ test_size: 500
145
+ eleutherai_gpt-j-6b_codeparrot_codeparrot_clean:
146
+ model_name: EleutherAI/gpt-j-6b
147
+ dataset_name: codeparrot/codeparrot-clean
148
+ path: features/train/eleutherai_gpt-j-6b_codeparrot_codeparrot_clean
149
+ n_members: 10000
150
+ n_nonmembers: 10000
151
+ feature_dim: 154
152
+ sequence_length: 128
153
+ train_size: 9000
154
+ val_size: 500
155
+ test_size: 500
156
+ eleutherai_gpt-j-6b_jetbrains_kstack:
157
+ model_name: EleutherAI/gpt-j-6b
158
+ dataset_name: JetBrains/KStack
159
+ path: features/train/eleutherai_gpt-j-6b_jetbrains_kstack
160
+ n_members: 10000
161
+ n_nonmembers: 10000
162
+ feature_dim: 154
163
+ sequence_length: 128
164
+ train_size: 9000
165
+ val_size: 500
166
+ test_size: 500
167
+ eleutherai_gpt-j-6b_mvasiliniuc_iva_kotlin_codeint_clean:
168
+ model_name: EleutherAI/gpt-j-6b
169
+ dataset_name: mvasiliniuc/iva-kotlin-codeint-clean
170
+ path: features/train/eleutherai_gpt-j-6b_mvasiliniuc_iva_kotlin_codeint_clean
171
+ n_members: 10000
172
+ n_nonmembers: 10000
173
+ feature_dim: 154
174
+ sequence_length: 128
175
+ train_size: 9000
176
+ val_size: 500
177
+ test_size: 500
178
+ eleutherai_gpt-j-6b_nan_do_code_search_net_java:
179
+ model_name: EleutherAI/gpt-j-6b
180
+ dataset_name: Nan-Do/code-search-net-java
181
+ path: features/train/eleutherai_gpt-j-6b_nan_do_code_search_net_java
182
+ n_members: 10000
183
+ n_nonmembers: 10000
184
+ feature_dim: 154
185
+ sequence_length: 128
186
+ train_size: 9000
187
+ val_size: 500
188
+ test_size: 500
189
+ eleutherai_gpt-j-6b_bigcode_the_stack_smol_xl:
190
+ model_name: EleutherAI/gpt-j-6b
191
+ dataset_name: bigcode/the-stack-smol-xl
192
+ path: features/train/eleutherai_gpt-j-6b_bigcode_the_stack_smol_xl
193
+ n_members: 4000
194
+ n_nonmembers: 4000
195
+ feature_dim: 154
196
+ sequence_length: 128
197
+ train_size: 3600
198
+ val_size: 200
199
+ test_size: 200
200
+ google_gemma-2b_tokyotech_llm_swallow_code:
201
+ model_name: google/gemma-2b
202
+ dataset_name: tokyotech-llm/swallow-code
203
+ path: features/train/google_gemma-2b_tokyotech_llm_swallow_code
204
+ n_members: 10000
205
+ n_nonmembers: 10000
206
+ feature_dim: 154
207
+ sequence_length: 128
208
+ train_size: 9000
209
+ val_size: 500
210
+ test_size: 500
211
+ google_gemma-2b_codeparrot_codeparrot_clean:
212
+ model_name: google/gemma-2b
213
+ dataset_name: codeparrot/codeparrot-clean
214
+ path: features/train/google_gemma-2b_codeparrot_codeparrot_clean
215
+ n_members: 10000
216
+ n_nonmembers: 10000
217
+ feature_dim: 154
218
+ sequence_length: 128
219
+ train_size: 9000
220
+ val_size: 500
221
+ test_size: 500
222
+ google_gemma-2b_jetbrains_kstack:
223
+ model_name: google/gemma-2b
224
+ dataset_name: JetBrains/KStack
225
+ path: features/train/google_gemma-2b_jetbrains_kstack
226
+ n_members: 10000
227
+ n_nonmembers: 10000
228
+ feature_dim: 154
229
+ sequence_length: 128
230
+ train_size: 9000
231
+ val_size: 500
232
+ test_size: 500
233
+ google_gemma-2b_mvasiliniuc_iva_kotlin_codeint_clean:
234
+ model_name: google/gemma-2b
235
+ dataset_name: mvasiliniuc/iva-kotlin-codeint-clean
236
+ path: features/train/google_gemma-2b_mvasiliniuc_iva_kotlin_codeint_clean
237
+ n_members: 10000
238
+ n_nonmembers: 10000
239
+ feature_dim: 154
240
+ sequence_length: 128
241
+ train_size: 9000
242
+ val_size: 500
243
+ test_size: 500
244
+ google_gemma-2b_nan_do_code_search_net_java:
245
+ model_name: google/gemma-2b
246
+ dataset_name: Nan-Do/code-search-net-java
247
+ path: features/train/google_gemma-2b_nan_do_code_search_net_java
248
+ n_members: 10000
249
+ n_nonmembers: 10000
250
+ feature_dim: 154
251
+ sequence_length: 128
252
+ train_size: 9000
253
+ val_size: 500
254
+ test_size: 500
255
+ google_gemma-2b_bigcode_the_stack_smol_xl:
256
+ model_name: google/gemma-2b
257
+ dataset_name: bigcode/the-stack-smol-xl
258
+ path: features/train/google_gemma-2b_bigcode_the_stack_smol_xl
259
+ n_members: 4000
260
+ n_nonmembers: 4000
261
+ feature_dim: 154
262
+ sequence_length: 128
263
+ train_size: 3600
264
+ val_size: 200
265
+ test_size: 200
266
+ qwen_qwen2-1.5b_tokyotech_llm_swallow_code:
267
+ model_name: Qwen/Qwen2-1.5B
268
+ dataset_name: tokyotech-llm/swallow-code
269
+ path: features/train/qwen_qwen2-1.5b_tokyotech_llm_swallow_code
270
+ n_members: 10000
271
+ n_nonmembers: 10000
272
+ feature_dim: 154
273
+ sequence_length: 128
274
+ train_size: 9000
275
+ val_size: 500
276
+ test_size: 500
277
+ qwen_qwen2-1.5b_codeparrot_codeparrot_clean:
278
+ model_name: Qwen/Qwen2-1.5B
279
+ dataset_name: codeparrot/codeparrot-clean
280
+ path: features/train/qwen_qwen2-1.5b_codeparrot_codeparrot_clean
281
+ n_members: 10000
282
+ n_nonmembers: 10000
283
+ feature_dim: 154
284
+ sequence_length: 128
285
+ train_size: 9000
286
+ val_size: 500
287
+ test_size: 500
288
+ qwen_qwen2-1.5b_jetbrains_kstack:
289
+ model_name: Qwen/Qwen2-1.5B
290
+ dataset_name: JetBrains/KStack
291
+ path: features/train/qwen_qwen2-1.5b_jetbrains_kstack
292
+ n_members: 10000
293
+ n_nonmembers: 10000
294
+ feature_dim: 154
295
+ sequence_length: 128
296
+ train_size: 9000
297
+ val_size: 500
298
+ test_size: 500
299
+ qwen_qwen2-1.5b_mvasiliniuc_iva_kotlin_codeint_clean:
300
+ model_name: Qwen/Qwen2-1.5B
301
+ dataset_name: mvasiliniuc/iva-kotlin-codeint-clean
302
+ path: features/train/qwen_qwen2-1.5b_mvasiliniuc_iva_kotlin_codeint_clean
303
+ n_members: 10000
304
+ n_nonmembers: 10000
305
+ feature_dim: 154
306
+ sequence_length: 128
307
+ train_size: 9000
308
+ val_size: 500
309
+ test_size: 500
310
+ qwen_qwen2-1.5b_nan_do_code_search_net_java:
311
+ model_name: Qwen/Qwen2-1.5B
312
+ dataset_name: Nan-Do/code-search-net-java
313
+ path: features/train/qwen_qwen2-1.5b_nan_do_code_search_net_java
314
+ n_members: 10000
315
+ n_nonmembers: 10000
316
+ feature_dim: 154
317
+ sequence_length: 128
318
+ train_size: 9000
319
+ val_size: 500
320
+ test_size: 500
321
+ qwen_qwen2-1.5b_bigcode_the_stack_smol_xl:
322
+ model_name: Qwen/Qwen2-1.5B
323
+ dataset_name: bigcode/the-stack-smol-xl
324
+ path: features/train/qwen_qwen2-1.5b_bigcode_the_stack_smol_xl
325
+ n_members: 4000
326
+ n_nonmembers: 4000
327
+ feature_dim: 154
328
+ sequence_length: 128
329
+ train_size: 3600
330
+ val_size: 200
331
+ test_size: 200
332
+ tiiuae_falcon-rw-1b_tokyotech_llm_swallow_code:
333
+ model_name: tiiuae/falcon-rw-1b
334
+ dataset_name: tokyotech-llm/swallow-code
335
+ path: features/train/tiiuae_falcon-rw-1b_tokyotech_llm_swallow_code
336
+ n_members: 10000
337
+ n_nonmembers: 10000
338
+ feature_dim: 154
339
+ sequence_length: 128
340
+ train_size: 9000
341
+ val_size: 500
342
+ test_size: 500
343
+ tiiuae_falcon-rw-1b_codeparrot_codeparrot_clean:
344
+ model_name: tiiuae/falcon-rw-1b
345
+ dataset_name: codeparrot/codeparrot-clean
346
+ path: features/train/tiiuae_falcon-rw-1b_codeparrot_codeparrot_clean
347
+ n_members: 10000
348
+ n_nonmembers: 10000
349
+ feature_dim: 154
350
+ sequence_length: 128
351
+ train_size: 9000
352
+ val_size: 500
353
+ test_size: 500
354
+ tiiuae_falcon-rw-1b_jetbrains_kstack:
355
+ model_name: tiiuae/falcon-rw-1b
356
+ dataset_name: JetBrains/KStack
357
+ path: features/train/tiiuae_falcon-rw-1b_jetbrains_kstack
358
+ n_members: 10000
359
+ n_nonmembers: 10000
360
+ feature_dim: 154
361
+ sequence_length: 128
362
+ train_size: 9000
363
+ val_size: 500
364
+ test_size: 500
365
+ tiiuae_falcon-rw-1b_mvasiliniuc_iva_kotlin_codeint_clean:
366
+ model_name: tiiuae/falcon-rw-1b
367
+ dataset_name: mvasiliniuc/iva-kotlin-codeint-clean
368
+ path: features/train/tiiuae_falcon-rw-1b_mvasiliniuc_iva_kotlin_codeint_clean
369
+ n_members: 10000
370
+ n_nonmembers: 10000
371
+ feature_dim: 154
372
+ sequence_length: 128
373
+ train_size: 9000
374
+ val_size: 500
375
+ test_size: 500
376
+ tiiuae_falcon-rw-1b_nan_do_code_search_net_java:
377
+ model_name: tiiuae/falcon-rw-1b
378
+ dataset_name: Nan-Do/code-search-net-java
379
+ path: features/train/tiiuae_falcon-rw-1b_nan_do_code_search_net_java
380
+ n_members: 10000
381
+ n_nonmembers: 10000
382
+ feature_dim: 154
383
+ sequence_length: 128
384
+ train_size: 9000
385
+ val_size: 500
386
+ test_size: 500
387
+ tiiuae_falcon-rw-1b_bigcode_the_stack_smol_xl:
388
+ model_name: tiiuae/falcon-rw-1b
389
+ dataset_name: bigcode/the-stack-smol-xl
390
+ path: features/train/tiiuae_falcon-rw-1b_bigcode_the_stack_smol_xl
391
+ n_members: 4000
392
+ n_nonmembers: 4000
393
+ feature_dim: 154
394
+ sequence_length: 128
395
+ train_size: 3600
396
+ val_size: 200
397
+ test_size: 200
398
+ distilgpt2_bigcode_the_stack_smol_xl:
399
+ model_name: distilgpt2
400
+ dataset_name: bigcode/the-stack-smol-xl
401
+ path: features/train/distilgpt2_bigcode_the_stack_smol_xl
402
+ n_members: 4000
403
+ n_nonmembers: 4000
404
+ feature_dim: 154
405
+ sequence_length: 128
406
+ train_size: 3200
407
+ val_size: 400
408
+ test_size: 400
409
+ distilgpt2_codeparrot_codeparrot_clean:
410
+ model_name: distilgpt2
411
+ dataset_name: codeparrot/codeparrot-clean
412
+ path: features/train/distilgpt2_codeparrot_codeparrot_clean
413
+ n_members: 10000
414
+ n_nonmembers: 10000
415
+ feature_dim: 154
416
+ sequence_length: 128
417
+ train_size: 8000
418
+ val_size: 1000
419
+ test_size: 1000
420
+ distilgpt2_jetbrains_kstack:
421
+ model_name: distilgpt2
422
+ dataset_name: JetBrains/KStack
423
+ path: features/train/distilgpt2_jetbrains_kstack
424
+ n_members: 10000
425
+ n_nonmembers: 10000
426
+ feature_dim: 154
427
+ sequence_length: 128
428
+ train_size: 8000
429
+ val_size: 1000
430
+ test_size: 1000
431
+ distilgpt2_mvasiliniuc_iva_kotlin_codeint_clean:
432
+ model_name: distilgpt2
433
+ dataset_name: mvasiliniuc/iva-kotlin-codeint-clean
434
+ path: features/train/distilgpt2_mvasiliniuc_iva_kotlin_codeint_clean
435
+ n_members: 10000
436
+ n_nonmembers: 10000
437
+ feature_dim: 154
438
+ sequence_length: 128
439
+ train_size: 8000
440
+ val_size: 1000
441
+ test_size: 1000
442
+ distilgpt2_nan_do_code_search_net_java:
443
+ model_name: distilgpt2
444
+ dataset_name: Nan-Do/code-search-net-java
445
+ path: features/train/distilgpt2_nan_do_code_search_net_java
446
+ n_members: 10000
447
+ n_nonmembers: 10000
448
+ feature_dim: 154
449
+ sequence_length: 128
450
+ train_size: 8000
451
+ val_size: 1000
452
+ test_size: 1000
453
+ distilgpt2_tokyotech_llm_swallow_code:
454
+ model_name: distilgpt2
455
+ dataset_name: tokyotech-llm/swallow-code
456
+ path: features/train/distilgpt2_tokyotech_llm_swallow_code
457
+ n_members: 10000
458
+ n_nonmembers: 10000
459
+ feature_dim: 154
460
+ sequence_length: 128
461
+ train_size: 8000
462
+ val_size: 1000
463
+ test_size: 1000
464
+ gpt2-xl_bigcode_the_stack_smol_xl:
465
+ model_name: gpt2-xl
466
+ dataset_name: bigcode/the-stack-smol-xl
467
+ path: features/train/gpt2-xl_bigcode_the_stack_smol_xl
468
+ n_members: 4000
469
+ n_nonmembers: 4000
470
+ feature_dim: 154
471
+ sequence_length: 128
472
+ train_size: 3200
473
+ val_size: 400
474
+ test_size: 400
475
+ gpt2-xl_codeparrot_codeparrot_clean:
476
+ model_name: gpt2-xl
477
+ dataset_name: codeparrot/codeparrot-clean
478
+ path: features/train/gpt2-xl_codeparrot_codeparrot_clean
479
+ n_members: 10000
480
+ n_nonmembers: 10000
481
+ feature_dim: 154
482
+ sequence_length: 128
483
+ train_size: 8000
484
+ val_size: 1000
485
+ test_size: 1000
486
+ gpt2-xl_jetbrains_kstack:
487
+ model_name: gpt2-xl
488
+ dataset_name: JetBrains/KStack
489
+ path: features/train/gpt2-xl_jetbrains_kstack
490
+ n_members: 10000
491
+ n_nonmembers: 10000
492
+ feature_dim: 154
493
+ sequence_length: 128
494
+ train_size: 8000
495
+ val_size: 1000
496
+ test_size: 1000
497
+ gpt2-xl_mvasiliniuc_iva_kotlin_codeint_clean:
498
+ model_name: gpt2-xl
499
+ dataset_name: mvasiliniuc/iva-kotlin-codeint-clean
500
+ path: features/train/gpt2-xl_mvasiliniuc_iva_kotlin_codeint_clean
501
+ n_members: 10000
502
+ n_nonmembers: 10000
503
+ feature_dim: 154
504
+ sequence_length: 128
505
+ train_size: 8000
506
+ val_size: 1000
507
+ test_size: 1000
508
+ gpt2-xl_nan_do_code_search_net_java:
509
+ model_name: gpt2-xl
510
+ dataset_name: Nan-Do/code-search-net-java
511
+ path: features/train/gpt2-xl_nan_do_code_search_net_java
512
+ n_members: 10000
513
+ n_nonmembers: 10000
514
+ feature_dim: 154
515
+ sequence_length: 128
516
+ train_size: 8000
517
+ val_size: 1000
518
+ test_size: 1000
519
+ gpt2-xl_tokyotech_llm_swallow_code:
520
+ model_name: gpt2-xl
521
+ dataset_name: tokyotech-llm/swallow-code
522
+ path: features/train/gpt2-xl_tokyotech_llm_swallow_code
523
+ n_members: 10000
524
+ n_nonmembers: 10000
525
+ feature_dim: 154
526
+ sequence_length: 128
527
+ train_size: 8000
528
+ val_size: 1000
529
+ test_size: 1000
530
+ tiiuae_falcon-7b_tokyotech_llm_swallow_code:
531
+ model_name: tiiuae/falcon-7b
532
+ dataset_name: tokyotech-llm/swallow-code
533
+ path: features/train/tiiuae_falcon-7b_tokyotech_llm_swallow_code
534
+ n_members: 10000
535
+ n_nonmembers: 10000
536
+ feature_dim: 154
537
+ sequence_length: 128
538
+ train_size: 9000
539
+ val_size: 500
540
+ test_size: 500
541
+ tiiuae_falcon-7b_codeparrot_codeparrot_clean:
542
+ model_name: tiiuae/falcon-7b
543
+ dataset_name: codeparrot/codeparrot-clean
544
+ path: features/train/tiiuae_falcon-7b_codeparrot_codeparrot_clean
545
+ n_members: 10000
546
+ n_nonmembers: 10000
547
+ feature_dim: 154
548
+ sequence_length: 128
549
+ train_size: 9000
550
+ val_size: 500
551
+ test_size: 500
552
+ tiiuae_falcon-7b_jetbrains_kstack:
553
+ model_name: tiiuae/falcon-7b
554
+ dataset_name: JetBrains/KStack
555
+ path: features/train/tiiuae_falcon-7b_jetbrains_kstack
556
+ n_members: 10000
557
+ n_nonmembers: 10000
558
+ feature_dim: 154
559
+ sequence_length: 128
560
+ train_size: 9000
561
+ val_size: 500
562
+ test_size: 500
563
+ tiiuae_falcon-7b_mvasiliniuc_iva_kotlin_codeint_clean:
564
+ model_name: tiiuae/falcon-7b
565
+ dataset_name: mvasiliniuc/iva-kotlin-codeint-clean
566
+ path: features/train/tiiuae_falcon-7b_mvasiliniuc_iva_kotlin_codeint_clean
567
+ n_members: 10000
568
+ n_nonmembers: 10000
569
+ feature_dim: 154
570
+ sequence_length: 128
571
+ train_size: 9000
572
+ val_size: 500
573
+ test_size: 500
574
+ tiiuae_falcon-7b_nan_do_code_search_net_java:
575
+ model_name: tiiuae/falcon-7b
576
+ dataset_name: Nan-Do/code-search-net-java
577
+ path: features/train/tiiuae_falcon-7b_nan_do_code_search_net_java
578
+ n_members: 10000
579
+ n_nonmembers: 10000
580
+ feature_dim: 154
581
+ sequence_length: 128
582
+ train_size: 9000
583
+ val_size: 500
584
+ test_size: 500
585
+ tiiuae_falcon-7b_bigcode_the_stack_smol_xl:
586
+ model_name: tiiuae/falcon-7b
587
+ dataset_name: bigcode/the-stack-smol-xl
588
+ path: features/train/tiiuae_falcon-7b_bigcode_the_stack_smol_xl
589
+ n_members: 4000
590
+ n_nonmembers: 4000
591
+ feature_dim: 154
592
+ sequence_length: 128
593
+ train_size: 3600
594
+ val_size: 200
595
+ test_size: 200