bsisduck commited on
Commit
f550822
·
verified ·
1 Parent(s): 71f07c4

Add model card with tags

Browse files
Files changed (1) hide show
  1. README.md +69 -0
README.md ADDED
@@ -0,0 +1,69 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ base_model: Qwen/Qwen3-Reranker-8B
3
+ library_name: mlx-embeddings
4
+ tags:
5
+ - mlx
6
+ - reranker
7
+ - text-classification
8
+ - qwen3
9
+ - apple-silicon
10
+ - fp16
11
+ - cross-encoder
12
+ language:
13
+ - multilingual
14
+ license: apache-2.0
15
+ pipeline_tag: text-classification
16
+ datasets:
17
+ - Qwen/Reranker-Multilingual-General-Instruct
18
+ ---
19
+
20
+ # Qwen3-Reranker-8B — MLX fp16
21
+
22
+ [Qwen/Qwen3-Reranker-8B](https://huggingface.co/Qwen/Qwen3-Reranker-8B) converted to MLX format in **float16** precision for Apple Silicon.
23
+
24
+ ## Model Details
25
+
26
+ | Property | Value |
27
+ |---|---|
28
+ | Base model | [Qwen/Qwen3-Reranker-8B](https://huggingface.co/Qwen/Qwen3-Reranker-8B) |
29
+ | Parameters | 8B |
30
+ | Architecture | Qwen3 (decoder-based, cross-encoder) |
31
+ | Precision | float16 |
32
+ | Max context length | 32,768 tokens |
33
+ | Languages | 100+ |
34
+ | Scoring | "yes"/"no" logit comparison |
35
+ | Converted with | [mlx-embeddings](https://github.com/Blaizzy/mlx-embeddings) v0.1.0 |
36
+
37
+ ## Usage
38
+
39
+ ```bash
40
+ pip install mlx-embeddings
41
+ ```
42
+
43
+ ```python
44
+ from mlx_embeddings import load
45
+ import mlx.core as mx
46
+
47
+ model, tokenizer = load("bsisduck/Qwen3-Reranker-8B-fp16-mlx")
48
+
49
+ scores = model.process({
50
+ "instruction": "Given a web search query, retrieve relevant passages that answer the query",
51
+ "query": {"text": "What is MLX?"},
52
+ "documents": [
53
+ {"text": "MLX is Apple's array framework for machine learning on Apple Silicon."},
54
+ {"text": "Python is a programming language."},
55
+ ],
56
+ }, processor=tokenizer)
57
+
58
+ # Higher score = more relevant
59
+ print(scores)
60
+ ```
61
+
62
+ ## Hardware Requirements
63
+
64
+ - Apple Silicon Mac (M1/M2/M3/M4)
65
+ - ~16 GB unified memory
66
+
67
+ ## Original Model
68
+
69
+ See [Qwen/Qwen3-Reranker-8B](https://huggingface.co/Qwen/Qwen3-Reranker-8B) for benchmarks, training details, and full documentation.