davanstrien HF Staff commited on
Commit
c779909
·
verified ·
1 Parent(s): 881d610

Training in progress, step 348

Browse files
README.md CHANGED
@@ -31,25 +31,25 @@ model-index:
31
  type: fineweb_c_eval
32
  metrics:
33
  - type: accuracy
34
- value: 0.8837890625
35
  name: Accuracy
36
  - type: accuracy_threshold
37
- value: 0.8104759454727173
38
  name: Accuracy Threshold
39
  - type: f1
40
- value: 0.6024844720496895
41
  name: F1
42
  - type: f1_threshold
43
- value: 0.7146279811859131
44
  name: F1 Threshold
45
  - type: precision
46
- value: 0.6258064516129033
47
  name: Precision
48
  - type: recall
49
- value: 0.5808383233532934
50
  name: Recall
51
  - type: average_precision
52
- value: 0.6180374262832308
53
  name: Average Precision
54
  ---
55
 
@@ -110,7 +110,7 @@ pairs = [
110
  ]
111
  scores = model.predict(pairs)
112
  print(scores)
113
- # [0.1112 0.9909 0.0237 0.0368 0.3648]
114
 
115
  # Or rank different texts based on similarity to a single text
116
  ranks = model.rank(
@@ -159,15 +159,15 @@ You can finetune this model on your own dataset.
159
  * Dataset: `fineweb_c_eval`
160
  * Evaluated with [<code>CrossEncoderClassificationEvaluator</code>](https://sbert.net/docs/package_reference/cross_encoder/evaluation.html#sentence_transformers.cross_encoder.evaluation.CrossEncoderClassificationEvaluator)
161
 
162
- | Metric | Value |
163
- |:----------------------|:----------|
164
- | accuracy | 0.8838 |
165
- | accuracy_threshold | 0.8105 |
166
- | f1 | 0.6025 |
167
- | f1_threshold | 0.7146 |
168
- | precision | 0.6258 |
169
- | recall | 0.5808 |
170
- | **average_precision** | **0.618** |
171
 
172
  <!--
173
  ## Bias, Risks and Limitations
@@ -412,12 +412,22 @@ You can finetune this model on your own dataset.
412
  | 0.9758 | 282 | 0.9496 | - | - |
413
  | 0.9965 | 288 | 0.8691 | - | - |
414
  | 1.0035 | 290 | - | 0.7659 | 0.6180 |
 
 
 
 
 
 
 
 
 
 
415
 
416
 
417
  ### Training Time
418
- - **Training**: 4.7 minutes
419
- - **Evaluation**: 1.7 minutes
420
- - **Total**: 6.4 minutes
421
 
422
  ### Framework Versions
423
  - Python: 3.12.12
 
31
  type: fineweb_c_eval
32
  metrics:
33
  - type: accuracy
34
+ value: 0.90234375
35
  name: Accuracy
36
  - type: accuracy_threshold
37
+ value: 0.7725241184234619
38
  name: Accuracy Threshold
39
  - type: f1
40
+ value: 0.6426229508196721
41
  name: F1
42
  - type: f1_threshold
43
+ value: 0.601359486579895
44
  name: F1 Threshold
45
  - type: precision
46
+ value: 0.7101449275362319
47
  name: Precision
48
  - type: recall
49
+ value: 0.5868263473053892
50
  name: Recall
51
  - type: average_precision
52
+ value: 0.696727465964923
53
  name: Average Precision
54
  ---
55
 
 
110
  ]
111
  scores = model.predict(pairs)
112
  print(scores)
113
+ # [1.0987e-03 9.9933e-01 3.3535e-04 1.2066e-03 3.2731e-03]
114
 
115
  # Or rank different texts based on similarity to a single text
116
  ranks = model.rank(
 
159
  * Dataset: `fineweb_c_eval`
160
  * Evaluated with [<code>CrossEncoderClassificationEvaluator</code>](https://sbert.net/docs/package_reference/cross_encoder/evaluation.html#sentence_transformers.cross_encoder.evaluation.CrossEncoderClassificationEvaluator)
161
 
162
+ | Metric | Value |
163
+ |:----------------------|:-----------|
164
+ | accuracy | 0.9023 |
165
+ | accuracy_threshold | 0.7725 |
166
+ | f1 | 0.6426 |
167
+ | f1_threshold | 0.6014 |
168
+ | precision | 0.7101 |
169
+ | recall | 0.5868 |
170
+ | **average_precision** | **0.6967** |
171
 
172
  <!--
173
  ## Bias, Risks and Limitations
 
412
  | 0.9758 | 282 | 0.9496 | - | - |
413
  | 0.9965 | 288 | 0.8691 | - | - |
414
  | 1.0035 | 290 | - | 0.7659 | 0.6180 |
415
+ | 1.0173 | 294 | 0.6524 | - | - |
416
+ | 1.0381 | 300 | 0.7019 | - | - |
417
+ | 1.0588 | 306 | 0.4723 | - | - |
418
+ | 1.0796 | 312 | 0.5579 | - | - |
419
+ | 1.1003 | 318 | 0.5251 | - | - |
420
+ | 1.1211 | 324 | 0.6861 | - | - |
421
+ | 1.1419 | 330 | 0.4643 | - | - |
422
+ | 1.1626 | 336 | 0.5970 | - | - |
423
+ | 1.1834 | 342 | 0.4205 | - | - |
424
+ | 1.2042 | 348 | 0.4822 | 0.9538 | 0.6967 |
425
 
426
 
427
  ### Training Time
428
+ - **Training**: 5.7 minutes
429
+ - **Evaluation**: 2.1 minutes
430
+ - **Total**: 7.8 minutes
431
 
432
  ### Framework Versions
433
  - Python: 3.12.12
eval/CrossEncoderClassificationEvaluator_fineweb_c_eval_results.csv CHANGED
@@ -4,3 +4,4 @@ epoch,steps,Accuracy,Accuracy_Threshold,F1,F1_Threshold,Precision,Recall,Average
4
  0.6020761245674741,174,0.849609375,0.81698465,0.5108433734939759,0.6361271,0.4274193548387097,0.6347305389221557,0.4518233024149694
5
  0.8027681660899654,232,0.869140625,0.81640637,0.53276955602537,0.5794004,0.4117647058823529,0.7544910179640718,0.5406815506036883
6
  1.0034602076124568,290,0.8837890625,0.81047595,0.6024844720496895,0.714628,0.6258064516129033,0.5808383233532934,0.6180374262832308
 
 
4
  0.6020761245674741,174,0.849609375,0.81698465,0.5108433734939759,0.6361271,0.4274193548387097,0.6347305389221557,0.4518233024149694
5
  0.8027681660899654,232,0.869140625,0.81640637,0.53276955602537,0.5794004,0.4117647058823529,0.7544910179640718,0.5406815506036883
6
  1.0034602076124568,290,0.8837890625,0.81047595,0.6024844720496895,0.714628,0.6258064516129033,0.5808383233532934,0.6180374262832308
7
+ 1.2041522491349481,348,0.90234375,0.7725241,0.6426229508196721,0.6013595,0.7101449275362319,0.5868263473053892,0.696727465964923
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:288034c9bbea2f1fa34be4a1b2b589a29b7d0d208b20816b402946412ab02f8a
3
  size 1230138348
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:d043136e355945d57bcbf7816fcb8037742c275d5c0f6e572f1d812297d2493d
3
  size 1230138348