Faria ONNX Models

Pre-exported ONNX models used by Faria, a document processing library with ML-powered layout detection and table extraction. These files are ready for direct use with ONNX Runtime โ€” no Python or conversion step required.

Models

detr_layout_detection.onnx (~350 MB)

Document layout detection. Identifies structural elements across a page.

Input

Name Shape Type
pixel_values [batch, 3, 800, 800] float32

Outputs

Name Shape Type Description
logits [batch, 100, 12] float32 Class scores (11 classes + no-object)
pred_boxes [batch, 100, 4] float32 Normalized boxes (cx, cy, w, h)

Class labels (DocLayNet)

Index Label
0 Caption
1 Footnote
2 Formula
3 List-item
4 Page-footer
5 Page-header
6 Picture
7 Section-header
8 Table
9 Text
10 Title
11 (no object)

Post-processing

  1. Apply softmax to logits
  2. Filter by confidence threshold
  3. Convert (cx, cy, w, h) โ†’ (x1, y1, x2, y2)
  4. Scale boxes to image size

nemotron_table_structure.onnx (~200 MB)

Table structure recognition.

Inputs

Name Shape Type Description
input [1, 3, 1024, 1024] float32 RGB image
orig_sizes [1, 2] int64 [height, width]

Outputs

Name Shape Type
labels [N] float32
boxes [N, 4] float32
scores [N] float32

Class labels

Index Label
1 cell
2 row
3 column
4 header

Installation

curl -fsSL https://raw.githubusercontent.com/exto360-inc/faria-install/main/install.sh | bash -s -- --features idp

Or download manually:

# Layout detection
curl -fsSL https://huggingface.co/pavan-synkrato360/faria-models/resolve/main/detr_layout_detection.onnx -o detr_layout_detection.onnx

# Table structure
curl -fsSL https://huggingface.co/pavan-synkrato360/faria-models/resolve/main/nemotron_table_structure.onnx -o nemotron_table_structure.onnx

config.json

{
  "models": {
    "detr_layout_detection": {
      "filename": "detr_layout_detection.onnx",
      "task": "document-layout-detection",
      "source": "cmarkea/detr-layout-detection",
      "onnx_opset": 14,
      "input": {
        "pixel_values": [1, 3, 800, 800]
      },
      "outputs": {
        "logits": [1, 100, 12],
        "pred_boxes": [1, 100, 4]
      },
      "classes": [
        "Caption", "Footnote", "Formula", "List-item",
        "Page-footer", "Page-header", "Picture", "Section-header",
        "Table", "Text", "Title"
      ]
    },
    "nemotron_table_structure": {
      "filename": "nemotron_table_structure.onnx",
      "task": "table-structure-recognition",
      "source": "nvidia/nemotron-table-structure-v1",
      "onnx_opset": 18,
      "inputs": {
        "input": [1, 3, 1024, 1024],
        "orig_sizes": [1, 2]
      },
      "outputs": {
        "labels": ["N"],
        "boxes": ["N", 4],
        "scores": ["N"]
      },
      "classes": {
        "1": "cell",
        "2": "row",
        "3": "column",
        "4": "header"
      }
    }
  }
}
Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support