Qianfan-OCR — GGUF Quantizations

GGUF quantizations of baidu/Qianfan-OCR.

Original model: InternVL Chat architecture with Qwen3 LLM backbone (~4.7B params). Quantized by bssrdf using llama.cpp.

Files

Filename	Quant	Size
`Qianfan-OCR-mmproj-bf16.gguf`	BF16	~0.65 GB

llama-cli -m Qianfan-OCR-q4_k_m.gguf --mmproj Qianfan-OCR-mmproj-bf16.gguf \
  --image document.jpg -p "Please OCR this document."

See baidu/Qianfan-OCR for full documentation, benchmarks (OmniDocBench 93.12, OCRBench 880), and usage examples.

GGUF

Hardware compatibility

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Base model

Quantized

(5)

this model