dariofinardi commited on
Commit
df3d8cf
·
verified ·
1 Parent(s): 3582c60

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +9 -9
README.md CHANGED
@@ -22,15 +22,15 @@ The model is specifically exported in a fragmented format (encoder, span_rep, co
22
 
23
  The ONNX conversion, combined with the Rust native engine (`ort` binding), allows this model to run extremely fast on both GPUs and edge devices like NPUs.
24
 
25
- **Benchmark Task:** Extracting 14 targeted entities spanning 62 classes from a complex 4-sentence text.
26
-
27
- | Hardware | Execution Provider | Avg Time / Entity | Avg Total Time |
28
- | :--- | :--- | :--- | :--- |
29
- | **NVIDIA RTX 4090** | CUDA (FP16) | **~12.0 ms** 🚀 | ~168.11 ms |
30
- | **NVIDIA RTX 3090** | CUDA (FP16) | **~11.6 ms** 🚀 | ~162.46 ms |
31
- | **Qualcomm Snapdragon X Elite** | QNN (NPU Native) | **~22.78 ms** ✨ | ~1.16 s (51 entities) |
32
- | **Qualcomm Snapdragon X Elite** | CPU (ARM NEON) | **~28.62 ms** | ~1.45 s (51 entities) |
33
- | **AMD Ryzen 9 5900XT** (16-Core) | CPU (x86 AVX2) | **~30.16 ms** 💻 | ~422.37 ms |
34
 
35
  *Note: The NPU matches high-end Desktop CPUs while consuming a fraction of the power!*
36
 
 
22
 
23
  The ONNX conversion, combined with the Rust native engine (`ort` binding), allows this model to run extremely fast on both GPUs and edge devices like NPUs.
24
 
25
+ **Benchmark Task:** Tested on complex text extraction tasks spanning up to 62 classes (metrics normalized per extracted entity to allow cross-device comparison).
26
+
27
+ | Hardware | Execution Provider | Avg Time / Entity |
28
+ | :--- | :--- | :--- |
29
+ | **NVIDIA RTX 4090** | CUDA (FP16) | **~12.0 ms** 🚀 |
30
+ | **NVIDIA RTX 3090** | CUDA (FP16) | **~11.6 ms** 🚀 |
31
+ | **Qualcomm Snapdragon X Elite** | QNN (NPU Native) | **~22.78 ms** ✨ |
32
+ | **Qualcomm Snapdragon X Elite** | CPU (ARM NEON) | **~28.62 ms** |
33
+ | **AMD Ryzen 9 5900XT** (16-Core) | CPU (x86 AVX2) | **~30.16 ms** 💻 |
34
 
35
  *Note: The NPU matches high-end Desktop CPUs while consuming a fraction of the power!*
36