Aligned Multi-View Scripts for Universal Chart-to-Code Generation

CharLuMA-1.3B generates a plotting script in Python, R, or LaTeX from a chart image. This 1.3B variant is from the paper "Aligned Multi-View Scripts for Universal Chart-to-Code Generation" (ACL 2026 Main Conference).

Backbone Vision encoder Output languages dtype
DeepSeek-Coder-1.3B-Instruct SigLIP-SO400M-patch14-384 Python, R, LaTeX bfloat16

Usage

This is a custom architecture, so AutoModel.from_pretrained will not load it.

Clone the codebase and use its loader:

git clone https://github.com/Zhihan72/CharLuMA

Before loading, replace the placeholder paths in config.json (/your_local_path/...) with deepseek-ai/deepseek-coder-1.3b-instruct (or a local copy) and google/siglip-so400m-patch14-384 (or a local copy).

Inference example: scripts/inference_charluma.py in the repo.

Citation

If you find our work useful, consider citing our paper as follows:

@misc{zhang2026aligned,
  title         = {Aligned Multi-View Scripts for Universal Chart-to-Code Generation},
  author        = {Zhihan Zhang and Lizi Liao},
  year          = {2026},
  eprint        = {2604.24559},
  archivePrefix = {arXiv},
  primaryClass  = {cs.CL},
  url           = {https://arxiv.org/abs/2604.24559}
}
Downloads last month
51
Safetensors
Model size
2B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for Zhihan/CharLuMA-1.3B

Finetuned
(63)
this model

Paper for Zhihan/CharLuMA-1.3B