Aligned Multi-View Scripts for Universal Chart-to-Code Generation
Paper • 2604.24559 • Published
CharLuMA-1.3B generates a plotting script in Python, R, or LaTeX from a chart image. This 1.3B variant is from the paper "Aligned Multi-View Scripts for Universal Chart-to-Code Generation" (ACL 2026 Main Conference).
| Backbone | Vision encoder | Output languages | dtype |
|---|---|---|---|
| DeepSeek-Coder-1.3B-Instruct | SigLIP-SO400M-patch14-384 | Python, R, LaTeX | bfloat16 |
This is a custom architecture, so AutoModel.from_pretrained will not load it.
Clone the codebase and use its loader:
git clone https://github.com/Zhihan72/CharLuMA
Before loading, replace the placeholder paths in config.json (/your_local_path/...) with deepseek-ai/deepseek-coder-1.3b-instruct (or a local copy) and google/siglip-so400m-patch14-384 (or a local copy).
Inference example: scripts/inference_charluma.py in the repo.
If you find our work useful, consider citing our paper as follows:
@misc{zhang2026aligned,
title = {Aligned Multi-View Scripts for Universal Chart-to-Code Generation},
author = {Zhihan Zhang and Lizi Liao},
year = {2026},
eprint = {2604.24559},
archivePrefix = {arXiv},
primaryClass = {cs.CL},
url = {https://arxiv.org/abs/2604.24559}
}
Base model
deepseek-ai/deepseek-coder-6.7b-instruct