Instructions to use SemplificaAI/gliner2-multi-v1-onnx with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- GLiNER
How to use SemplificaAI/gliner2-multi-v1-onnx with GLiNER:
from gliner import GLiNER model = GLiNER.from_pretrained("SemplificaAI/gliner2-multi-v1-onnx") - Notebooks
- Google Colab
- Kaggle
Upload README.md with huggingface_hub
Browse files
README.md
CHANGED
|
@@ -22,15 +22,15 @@ The model is specifically exported in a fragmented format (encoder, span_rep, co
|
|
| 22 |
|
| 23 |
The ONNX conversion, combined with the Rust native engine (`ort` binding), allows this model to run extremely fast on both GPUs and edge devices like NPUs.
|
| 24 |
|
| 25 |
-
**Benchmark Task:**
|
| 26 |
-
|
| 27 |
-
| Hardware | Execution Provider | Avg Time / Entity |
|
| 28 |
-
| :--- | :--- | :--- |
|
| 29 |
-
| **NVIDIA RTX 4090** | CUDA (FP16) | **~12.0 ms** 🚀 |
|
| 30 |
-
| **NVIDIA RTX 3090** | CUDA (FP16) | **~11.6 ms** 🚀 |
|
| 31 |
-
| **Qualcomm Snapdragon X Elite** | QNN (NPU Native) | **~22.78 ms** ✨ |
|
| 32 |
-
| **Qualcomm Snapdragon X Elite** | CPU (ARM NEON) | **~28.62 ms** |
|
| 33 |
-
| **AMD Ryzen 9 5900XT** (16-Core) | CPU (x86 AVX2) | **~30.16 ms** 💻 |
|
| 34 |
|
| 35 |
*Note: The NPU matches high-end Desktop CPUs while consuming a fraction of the power!*
|
| 36 |
|
|
|
|
| 22 |
|
| 23 |
The ONNX conversion, combined with the Rust native engine (`ort` binding), allows this model to run extremely fast on both GPUs and edge devices like NPUs.
|
| 24 |
|
| 25 |
+
**Benchmark Task:** Tested on complex text extraction tasks spanning up to 62 classes (metrics normalized per extracted entity to allow cross-device comparison).
|
| 26 |
+
|
| 27 |
+
| Hardware | Execution Provider | Avg Time / Entity |
|
| 28 |
+
| :--- | :--- | :--- |
|
| 29 |
+
| **NVIDIA RTX 4090** | CUDA (FP16) | **~12.0 ms** 🚀 |
|
| 30 |
+
| **NVIDIA RTX 3090** | CUDA (FP16) | **~11.6 ms** 🚀 |
|
| 31 |
+
| **Qualcomm Snapdragon X Elite** | QNN (NPU Native) | **~22.78 ms** ✨ |
|
| 32 |
+
| **Qualcomm Snapdragon X Elite** | CPU (ARM NEON) | **~28.62 ms** |
|
| 33 |
+
| **AMD Ryzen 9 5900XT** (16-Core) | CPU (x86 AVX2) | **~30.16 ms** 💻 |
|
| 34 |
|
| 35 |
*Note: The NPU matches high-end Desktop CPUs while consuming a fraction of the power!*
|
| 36 |
|