ScanIndex
Collection
Models loaded by https://github.com/welcomyou/scanindex โ OCR, KIE, layout, tables, embedder for Vietnamese admin docs. โข 8 items โข Updated
Re-export of Microsoft's Table Transformer to ONNX, packaged the way GMFT consumes it. Used by ScanIndex for table detection + structure recognition during DOCX export.
| Subdir | Upstream | Task |
|---|---|---|
gmft_onnx/detection/model.onnx |
microsoft/table-transformer-detection |
Detect table bounding boxes on a page |
gmft_onnx/structure/model.onnx |
microsoft/table-transformer-structure-recognition-v1.1-all |
Detect rows / columns / cells inside a cropped table |
Each subdir also contains the HF config.json + preprocessor metadata so transformers / optimum can wrap the ONNX directly.
from huggingface_hub import snapshot_download
local = snapshot_download("welcomyou/gmft-tatr-onnx", local_dir="models")
# Detection: f"{local}/gmft_onnx/detection/model.onnx"
# Structure: f"{local}/gmft_onnx/structure/model.onnx"
See train-convert/gmft/convert/export_gmft_tatr_to_onnx.py.
MIT, inherited from Microsoft Table Transformer.
Base model
microsoft/table-transformer-detection