Nexora-Vector-v0.1 · GGUF
Nexora-Vector-v0.1 GGUF is the official GGUF quantized release of Nexora-Vector-v0.1, published by Open4bits — the official quantization project under ArkAiLabs. Nexora-Vector is an experimental text-to-vector model that generates structured SVG graphics from natural language prompts. This GGUF release enables efficient local inference across a wide range of hardware via tools such as
llama.cpp,Ollama,LM Studio, andJan.
Table of Contents
- Overview
- Model Details
- Available Quantizations
- Capabilities
- Limitations
- Intended Use
- Architecture & Quantization
- Usage Recommendations
- Installation & Inference
- Original Model & Other Versions
- Evaluation
- Risks & Considerations
- Future Work
- Community & Support
- License
- Acknowledgements
Overview
This is the official GGUF quantized release of Nexora-Vector-v0.1, published by Open4bits — the official quantization project under ArkAiLabs. Multiple quantization levels are provided to suit different hardware configurations, from low-VRAM and CPU-only setups to higher-fidelity deployments.
The base model is a supervised fine-tuned variant of Qwen3-4B-Instruct, adapted specifically to generate structured vector graphics in SVG format from natural language instructions.
This release is in beta and is intended for research, experimentation, and early-stage design tooling. All outputs should be validated before use in any downstream pipeline.
Model Details
| Property | Details |
|---|---|
| Model Type | GGUF Quantized |
| Base Model | Nexora-Vector-v0.1 |
| Original Base | Qwen3-4B-Instruct |
| Architecture | Qwen3 |
| Parameters | 4B |
| Fine-tuning Method | Supervised Fine-Tuning (SFT) |
| Format | GGUF |
| Compatible Runtimes | llama.cpp, Ollama, LM Studio, Jan, kobold.cpp, and others |
| Output Format | SVG |
| License | Apache 2.0 |
Available Quantizations
| Quantization | Bits | File Size | Recommended For |
|---|---|---|---|
| Q2_K | 2-bit | 1.67 GB | Minimal memory, CPU-only, lowest quality |
| IQ3_XXS | 3-bit | 1.67 GB | Very low memory, better than Q2_K |
| IQ3_M | 3-bit | 1.96 GB | Low memory with improved quality over IQ3_XXS |
| IQ4_XS | 4-bit | 2.27 GB | Compact 4-bit with good quality |
| IQ4_NL | 4-bit | 2.38 GB | 4-bit with slightly higher fidelity |
| Q4_K_M | 4-bit | 2.50 GB | Best balance of size and quality ✅ Recommended |
| Q5_K_S | 5-bit | 2.82 GB | Higher quality with moderate memory |
| Q6_K | 6-bit | 3.31 GB | High quality with moderate memory |
| Q8_0 | 8-bit | 4.28 GB | Near full-precision quality, higher memory |
Recommendation:
Q4_K_Mis the best starting point for most users — it offers a strong balance between output quality and memory footprint.
Capabilities
Nexora-Vector-v0.1 is designed to translate textual instructions into structured SVG code. This GGUF release retains all capabilities of the original model while enabling flexible local inference across a wide range of hardware. The model is best suited for:
- Generating SVG markup for simple vector graphics
- Producing geometric shapes and basic illustrations
- Creating lightweight icons and minimal design assets
- Supporting rapid prototyping in vector-based design workflows
- Running fully locally with no internet connection required
Tip: The model performs best with concise, clearly scoped prompts focused on simple visual compositions.
Limitations
This is an early-stage beta release. Users should be aware of the following constraints:
- High hallucination rate — outputs may be invalid or non-renderable SVG
- Limited generalization — the small training dataset (~1,500 samples) affects output consistency
- Weak complex scene handling — highly detailed or multi-element prompts may produce poor results
- Manual correction required — outputs should be validated and post-processed before use
- Not production-ready — not suitable for safety-critical or automated pipelines
- Quantization quality trade-off — lower-bit quants (e.g. Q2_K) will show more degradation in output quality
Intended Use
✅ Supported Use Cases
- Academic and applied research in text-to-vector generation
- Experimental AI-assisted design systems running locally
- Educational exploration of structured output generation
- Lightweight SVG prototyping and ideation with low memory overhead
- Offline and air-gapped environments requiring fully local inference
❌ Out-of-Scope Use Cases
- Production-grade or commercial vector asset pipelines
- High-precision design deliverables without human validation
- Automated systems where SVG correctness is required without manual review
Architecture & Quantization
This release provides GGUF quantizations of the original Nexora-Vector-v0.1 weights, which are a supervised fine-tune of Qwen3-4B-Instruct.
Quantization Details
| Parameter | Details |
|---|---|
| Format | GGUF |
| Source Model | ArkAiLab-Adl/nexora-vector-v0.1 |
| Published By | Open4bits (official ArkAiLabs quantization project) |
| Available Levels | Q2_K, IQ3_XXS, IQ3_M, IQ4_XS, IQ4_NL, Q4_K_M, Q5_K_S, Q6_K, Q8_0 |
| Compatible Runtimes | llama.cpp, Ollama, LM Studio, Jan, kobold.cpp |
Original Training Configuration
| Parameter | Details |
|---|---|
| Fine-tuning Method | Supervised Fine-Tuning (SFT) |
| Dataset Composition | Curated prompt–SVG pairs |
| Dataset Size | ~1,500 samples |
| Training Objective | Structured output generation for SVG formats |
Note: The relatively small dataset size may result in instability and limited generalization across diverse prompts. Improved dataset coverage is planned for future versions.
Usage Recommendations
To get the best results from this model:
- Keep prompts simple and specific — avoid multi-scene or highly complex compositions
- Validate all SVG outputs before rendering or integrating into any pipeline
- Post-process outputs to correct syntax or structural issues
- Use iterative prompting — refining prompts across multiple turns often yields better results
- Expect imperfections — this is a beta model; treat outputs as drafts, not finals
- Choose the right quant — use
Q4_K_Mfor the best quality-to-size ratio; only useQ2_Kif memory is severely constrained
Installation & Inference
llama.cpp
# Clone and build llama.cpp
git clone https://github.com/ggerganov/llama.cpp
cd llama.cpp && make
# Download a quantized file, e.g. Q4_K_M
huggingface-cli download Open4bits/nexora-vector-v0.1-GGUF \
--include "*Q4_K_M*" --local-dir ./nexora-vector-gguf
# Run inference
./llama-cli \
-m ./nexora-vector-gguf/nexora-vector-v0.1-Q4_K_M.gguf \
-p "Generate an SVG of a simple red circle centered on a white background." \
-n 1024
Ollama
# Create a Modelfile
cat > Modelfile <<EOF
FROM ./nexora-vector-v0.1-Q4_K_M.gguf
EOF
ollama create nexora-vector -f Modelfile
ollama run nexora-vector "Generate an SVG of a blue star."
LM Studio
- Open LM Studio and go to the Search tab
- Search for
Open4bits/nexora-vector-v0.1-GGUF - Select your preferred quantization (recommended:
Q4_K_M) - Click Download and load the model
- Use the Chat or API tab to run inference
Python (via llama-cpp-python)
pip install llama-cpp-python
from llama_cpp import Llama
llm = Llama.from_pretrained(
repo_id="Open4bits/nexora-vector-v0.1-GGUF",
filename="*Q4_K_M*",
n_ctx=2048,
)
output = llm(
"Generate an SVG of a simple red circle centered on a white background.",
max_tokens=1024,
)
print(output["choices"][0]["text"])
Original Model & Other Versions
| Version | Link |
|---|---|
| Original (full precision) | ArkAiLab-Adl/nexora-vector-v0.1 |
| GGUF (this model) | Open4bits/nexora-vector-v0.1-GGUF |
| MLX 4-Bit | Open4bits/nexora-vector-v0.1-mlx-4Bit |
Evaluation
Nexora-Vector-v0.1 has not yet undergone formal benchmark evaluation. Current assessment is qualitative, based on manual testing of SVG generation tasks.
Planned evaluation metrics for future releases include:
| Metric | Description |
|---|---|
| SVG Validity Rate | Percentage of outputs that are parseable, valid SVG |
| Structural Correctness | Adherence to SVG schema and element hierarchy |
| Prompt Adherence | Alignment between user intent and generated output |
| Visual Consistency | Stability of outputs across similar prompts |
Risks & Considerations
Developers integrating this model should account for the following risks:
- Generation of malformed or non-functional SVG code
- Inconsistent instruction following across prompt variations
- Unpredictable outputs due to limited training data coverage
- Quality degradation at lower quantization levels (particularly Q2_K)
Recommendation: Implement downstream validation layers and SVG syntax checking before any rendering or integration.
Future Work
The following improvements are planned for upcoming versions of the Nexora Vector series:
- Expanded and more diverse training dataset
- Improved SVG syntax correctness and validity rates
- Reduced hallucination rates
- Enhanced natural language understanding for complex prompts
- Support for richer vector compositions and multi-element scenes
- Formal benchmark evaluation suite
- Updated GGUF releases aligned with future model versions
Community & Support
Join the community for updates and discussion:
License
This model is released under the Apache License 2.0.
You may use, modify, and distribute this model in accordance with the terms of the Apache 2.0 license. See the LICENSE file for full details, or refer to the official Apache 2.0 license text.
Acknowledgements
This is an official ArkAiLabs release, published under the Open4bits project — ArkAiLabs' dedicated initiative for quantized model releases. The GGUF weights are derived from Nexora-Vector-v0.1, which is itself built upon Qwen3-4B by the Qwen team. We thank the llama.cpp team and the open-source AI community for their continued contributions that make projects like this possible.
About Nexora & Open4bits
Nexora is an experimental AI initiative under ArkAiLabs, focused on building lightweight, practical, and creative AI systems for real-world applications. The Nexora Vector series represents our exploration into AI-assisted vector graphics generation.
Open4bits is ArkAiLabs' official project for quantized model releases, providing optimized variants of Nexora models for efficient local inference across different hardware platforms.
- Downloads last month
- 955
2-bit
3-bit
4-bit
5-bit
6-bit
8-bit