Text Ranking
sentence-transformers
Safetensors
multilingual
modernbert
cross-encoder
reranker
Generated from Trainer
dataset_size:9220
loss:BinaryCrossEntropyLoss
Eval Results (legacy)
text-embeddings-inference
Instructions to use davanstrien/fineweb-c-quality-reranker-v2 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- sentence-transformers
How to use davanstrien/fineweb-c-quality-reranker-v2 with sentence-transformers:
from sentence_transformers import CrossEncoder model = CrossEncoder("davanstrien/fineweb-c-quality-reranker-v2") query = "Which planet is known as the Red Planet?" passages = [ "Venus is often called Earth's twin because of its similar size and proximity.", "Mars, known for its reddish appearance, is often referred to as the Red Planet.", "Jupiter, the largest planet in our solar system, has a prominent red spot.", "Saturn, famous for its rings, is sometimes mistaken for the Red Planet." ] scores = model.predict([(query, passage) for passage in passages]) print(scores) - Notebooks
- Google Colab
- Kaggle
Training in progress, step 232
Browse files
README.md
CHANGED
|
@@ -31,25 +31,25 @@ model-index:
|
|
| 31 |
type: fineweb_c_eval
|
| 32 |
metrics:
|
| 33 |
- type: accuracy
|
| 34 |
-
value: 0.
|
| 35 |
name: Accuracy
|
| 36 |
- type: accuracy_threshold
|
| 37 |
-
value: 0.
|
| 38 |
name: Accuracy Threshold
|
| 39 |
- type: f1
|
| 40 |
-
value: 0.
|
| 41 |
name: F1
|
| 42 |
- type: f1_threshold
|
| 43 |
-
value: 0.
|
| 44 |
name: F1 Threshold
|
| 45 |
- type: precision
|
| 46 |
-
value: 0.
|
| 47 |
name: Precision
|
| 48 |
- type: recall
|
| 49 |
-
value: 0.
|
| 50 |
name: Recall
|
| 51 |
- type: average_precision
|
| 52 |
-
value: 0.
|
| 53 |
name: Average Precision
|
| 54 |
---
|
| 55 |
|
|
@@ -110,7 +110,7 @@ pairs = [
|
|
| 110 |
]
|
| 111 |
scores = model.predict(pairs)
|
| 112 |
print(scores)
|
| 113 |
-
# [0.
|
| 114 |
|
| 115 |
# Or rank different texts based on similarity to a single text
|
| 116 |
ranks = model.rank(
|
|
@@ -161,13 +161,13 @@ You can finetune this model on your own dataset.
|
|
| 161 |
|
| 162 |
| Metric | Value |
|
| 163 |
|:----------------------|:-----------|
|
| 164 |
-
| accuracy | 0.
|
| 165 |
-
| accuracy_threshold | 0.
|
| 166 |
-
| f1 | 0.
|
| 167 |
-
| f1_threshold | 0.
|
| 168 |
-
| precision | 0.
|
| 169 |
-
| recall | 0.
|
| 170 |
-
| **average_precision** | **0.
|
| 171 |
|
| 172 |
<!--
|
| 173 |
## Bias, Risks and Limitations
|
|
@@ -391,12 +391,22 @@ You can finetune this model on your own dataset.
|
|
| 391 |
| 0.5606 | 162 | 1.0361 | - | - |
|
| 392 |
| 0.5813 | 168 | 0.9421 | - | - |
|
| 393 |
| 0.6021 | 174 | 0.9503 | 0.8597 | 0.4518 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 394 |
|
| 395 |
|
| 396 |
### Training Time
|
| 397 |
-
- **Training**:
|
| 398 |
-
- **Evaluation**: 1.
|
| 399 |
-
- **Total**:
|
| 400 |
|
| 401 |
### Framework Versions
|
| 402 |
- Python: 3.12.12
|
|
|
|
| 31 |
type: fineweb_c_eval
|
| 32 |
metrics:
|
| 33 |
- type: accuracy
|
| 34 |
+
value: 0.869140625
|
| 35 |
name: Accuracy
|
| 36 |
- type: accuracy_threshold
|
| 37 |
+
value: 0.8164063692092896
|
| 38 |
name: Accuracy Threshold
|
| 39 |
- type: f1
|
| 40 |
+
value: 0.53276955602537
|
| 41 |
name: F1
|
| 42 |
- type: f1_threshold
|
| 43 |
+
value: 0.5794004201889038
|
| 44 |
name: F1 Threshold
|
| 45 |
- type: precision
|
| 46 |
+
value: 0.4117647058823529
|
| 47 |
name: Precision
|
| 48 |
- type: recall
|
| 49 |
+
value: 0.7544910179640718
|
| 50 |
name: Recall
|
| 51 |
- type: average_precision
|
| 52 |
+
value: 0.5406815506036883
|
| 53 |
name: Average Precision
|
| 54 |
---
|
| 55 |
|
|
|
|
| 110 |
]
|
| 111 |
scores = model.predict(pairs)
|
| 112 |
print(scores)
|
| 113 |
+
# [0.0706 0.9546 0.1859 0.1413 0.1883]
|
| 114 |
|
| 115 |
# Or rank different texts based on similarity to a single text
|
| 116 |
ranks = model.rank(
|
|
|
|
| 161 |
|
| 162 |
| Metric | Value |
|
| 163 |
|:----------------------|:-----------|
|
| 164 |
+
| accuracy | 0.8691 |
|
| 165 |
+
| accuracy_threshold | 0.8164 |
|
| 166 |
+
| f1 | 0.5328 |
|
| 167 |
+
| f1_threshold | 0.5794 |
|
| 168 |
+
| precision | 0.4118 |
|
| 169 |
+
| recall | 0.7545 |
|
| 170 |
+
| **average_precision** | **0.5407** |
|
| 171 |
|
| 172 |
<!--
|
| 173 |
## Bias, Risks and Limitations
|
|
|
|
| 391 |
| 0.5606 | 162 | 1.0361 | - | - |
|
| 392 |
| 0.5813 | 168 | 0.9421 | - | - |
|
| 393 |
| 0.6021 | 174 | 0.9503 | 0.8597 | 0.4518 |
|
| 394 |
+
| 0.6228 | 180 | 0.9766 | - | - |
|
| 395 |
+
| 0.6436 | 186 | 1.1067 | - | - |
|
| 396 |
+
| 0.6644 | 192 | 1.0229 | - | - |
|
| 397 |
+
| 0.6851 | 198 | 0.9341 | - | - |
|
| 398 |
+
| 0.7059 | 204 | 0.7538 | - | - |
|
| 399 |
+
| 0.7266 | 210 | 1.1375 | - | - |
|
| 400 |
+
| 0.7474 | 216 | 1.0365 | - | - |
|
| 401 |
+
| 0.7682 | 222 | 0.9019 | - | - |
|
| 402 |
+
| 0.7889 | 228 | 1.0598 | - | - |
|
| 403 |
+
| 0.8028 | 232 | - | 0.8322 | 0.5407 |
|
| 404 |
|
| 405 |
|
| 406 |
### Training Time
|
| 407 |
+
- **Training**: 3.8 minutes
|
| 408 |
+
- **Evaluation**: 1.4 minutes
|
| 409 |
+
- **Total**: 5.2 minutes
|
| 410 |
|
| 411 |
### Framework Versions
|
| 412 |
- Python: 3.12.12
|
eval/CrossEncoderClassificationEvaluator_fineweb_c_eval_results.csv
CHANGED
|
@@ -2,3 +2,4 @@ epoch,steps,Accuracy,Accuracy_Threshold,F1,F1_Threshold,Precision,Recall,Average
|
|
| 2 |
0.20069204152249134,58,0.8369140625,0.8085842,0.29916897506925205,0.39676917,0.1945945945945946,0.6467065868263473,0.22527425150874691
|
| 3 |
0.4013840830449827,116,0.8388671875,0.75488913,0.37500000000000006,0.4951936,0.3132530120481928,0.46706586826347307,0.296686339444048
|
| 4 |
0.6020761245674741,174,0.849609375,0.81698465,0.5108433734939759,0.6361271,0.4274193548387097,0.6347305389221557,0.4518233024149694
|
|
|
|
|
|
| 2 |
0.20069204152249134,58,0.8369140625,0.8085842,0.29916897506925205,0.39676917,0.1945945945945946,0.6467065868263473,0.22527425150874691
|
| 3 |
0.4013840830449827,116,0.8388671875,0.75488913,0.37500000000000006,0.4951936,0.3132530120481928,0.46706586826347307,0.296686339444048
|
| 4 |
0.6020761245674741,174,0.849609375,0.81698465,0.5108433734939759,0.6361271,0.4274193548387097,0.6347305389221557,0.4518233024149694
|
| 5 |
+
0.8027681660899654,232,0.869140625,0.81640637,0.53276955602537,0.5794004,0.4117647058823529,0.7544910179640718,0.5406815506036883
|
model.safetensors
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
size 1230138348
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:ee20c26b299d14b87a0fd1769b8aeae4760ee528ebe2a96f2cf8c2d2e1eba1ec
|
| 3 |
size 1230138348
|