craft-mlt-25k-onnx

This repository contains an optimized ONNX version of the CRAFT (Character Region Awareness for Text) model.

Model Details

  • Architecture: CRAFT (VGG16-based)
  • Format: ONNX (Single File)
  • Opset: 18
  • Task: Scene Text Detection

Input Requirements

  • Shape: [batch_size, 3, height, width]
  • Constraints: height and width must be multiples of 32.
  • Normalization: Scale pixels to [0, 1] (divide by 255).

Usage

import onnxruntime as ort
from huggingface_hub import hf_hub_download

# Download and load model
model_path = hf_hub_download(repo_id="KvaytG/craft-mlt-25k-onnx", filename="craft.onnx")
session = ort.InferenceSession(model_path, providers=['CUDAExecutionProvider', 'CPUExecutionProvider'])

# Prepare input (example)
# img_tensor shape: (1, 3, 768, 768)
outputs = session.run(None, {'input': img_tensor})

Credits

Original model by Clova AI Research.

License

Licensed under the MIT license.

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support