Qianfan-OCR โ GGUF Quantizations
GGUF quantizations of baidu/Qianfan-OCR.
Original model: InternVL Chat architecture with Qwen3 LLM backbone (~4.7B params). Quantized by bssrdf using llama.cpp.
Files
| Filename | Quant | Size |
|---|---|---|
Qianfan-OCR-mmproj-bf16.gguf |
BF16 | ~0.65 GB |
Usage (llama.cpp)
llama-cli -m Qianfan-OCR-q4_k_m.gguf --mmproj Qianfan-OCR-mmproj-bf16.gguf \
--image document.jpg -p "Please OCR this document."
Original Model
See baidu/Qianfan-OCR for full documentation, benchmarks (OmniDocBench 93.12, OCRBench 880), and usage examples.
- Downloads last month
- 127
Hardware compatibility
Log In to add your hardware
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐ Ask for provider support
Model tree for bssrdf/Qianfan-OCR-gguf
Base model
baidu/Qianfan-OCR