YAML Metadata Warning:empty or missing yaml metadata in repo card

Check out the documentation for more information.

Jina Reranker v2 Base Multilingual

Run Jina Reranker v2 optimized for Qualcomm NPUs with nexaSDK.

Quickstart

  1. Install NexaSDK and create a free account at sdk.nexa.ai

  2. Activate your device with your access token:

    nexa config set license '<access_token>'
    
  3. Run the model on Qualcomm NPU in one line:

    nexa infer NexaAI/jina-v2-rerank-npu
    

Description

Jina Reranker v2 Base Multilingual is a multilingual cross-encoder model for document reranking. Given a query–document pair, it outputs a relevance score to improve ranking in retrieval systems.

Features

  • Cross-encoder architecture for fine-grained relevance scoring
  • Supports multilingual inputs
  • Handles inputs up to 1024 tokens using sliding window chunking
  • Employs flash attention optimizations

Use Cases

  • Reranking candidate passages in multilingual search
  • Enhancing retrieval in QA / RAG pipelines
  • Improving semantic relevance in recommendation systems

Inputs & Outputs

  • Input: Query & document (text pair)
  • Output: Scalar relevance score (for ranking)

License

This model is licensed under CC BY-NC 4.0, intended for research and evaluation use. Commercial use requires separate arrangement.

References

Downloads last month
-
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Collection including NexaAI/jina-v2-rerank-npu