File size: 737 Bytes
18fb703 | 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 | Qwen3-Reranker-4B (OpenVINO IR conversion)
This OpenVINO IR conversion is a derivative of Qwen/Qwen3-Reranker-4B
(https://huggingface.co/Qwen/Qwen3-Reranker-4B), redistributed under the
Apache License, Version 2.0.
Original work:
Copyright 2024-2025 Alibaba Cloud (Qwen Team).
Licensed under the Apache License, Version 2.0.
Modifications by the redistributor of this OpenVINO conversion:
- Exported model weights to OpenVINO IR via optimum-intel.
- Applied NNCF INT8 (asymmetric, weight-only) quantization.
- Generated OpenVINO Tokenizer / Detokenizer IR via openvino-tokenizers.
No model weights were retrained. Tokenizer vocabulary, chat template, and
generation config are preserved verbatim from the upstream release.
|