Model 6: DeiT RGB + raw depth carbohydrate regression

This model predicts total carbohydrate from overhead RGB images and overhead raw depth images.

Architecture

  • RGB backbone: deit_base_distilled_patch16_384_imagenet
  • Input size: 384x384
  • RGB branch uses pretrained DeiT
  • Raw depth branch uses a lightweight CNN
  • Depth branch also creates a soft guidance mask for the RGB input
  • Fusion head combines RGB and depth features
  • 2-stage fine-tuning

Test results

Stage 1

  • Loss: 89.8303
  • MAE: 6.3148
  • MSE: 89.8303

Stage 2

  • Loss: 63.8722
  • MAE: 5.2992
  • MSE: 63.8722
Downloads last month
18
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support