This repository contains the Q8_0 GGUF deployment artifact for the NER-to-JSON extraction project.
This artifact is intended for deployment and demo serving.
Chat template
8-bit
Base model