Instructions to use mku64/Qwen3-Reranker-0.6B-mlx-8Bit with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use mku64/Qwen3-Reranker-0.6B-mlx-8Bit with Transformers:
# Load model directly from transformers import AutoTokenizer, AutoModelForCausalLM tokenizer = AutoTokenizer.from_pretrained("mku64/Qwen3-Reranker-0.6B-mlx-8Bit") model = AutoModelForCausalLM.from_pretrained("mku64/Qwen3-Reranker-0.6B-mlx-8Bit") - MLX
How to use mku64/Qwen3-Reranker-0.6B-mlx-8Bit with MLX:
# Download the model from the Hub pip install huggingface_hub[hf_xet] huggingface-cli download --local-dir Qwen3-Reranker-0.6B-mlx-8Bit mku64/Qwen3-Reranker-0.6B-mlx-8Bit
- Notebooks
- Google Colab
- Kaggle
- Local Apps
- LM Studio
Upload model.safetensors with huggingface_hub
Browse files- model.safetensors +3 -0
model.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:95560f503fb3bd1622839727377f172ddda8ccdce4b37a3cf74c9669407b0ce5
|
| 3 |
+
size 633152498
|