siftformer-1.5 / README.md
koreashin's picture
Add README
8f49ab6 verified
metadata
license: apache-2.0
tags:
  - video-classification
  - driver-behavior
pipeline_tag: video-classification

SiftFormer 1.5

Classes

Index Class
0 정상 (normal)
1 졸음 (drowsy)
2 주의분산 (distracted)
3 폭행 (violence)

Input

  • Shape: [B, T, C, H, W]
  • T = 30 frames
  • H = W = 224
  • C = 3 (RGB)
  • ImageNet normalization (mean [0.485, 0.456, 0.406], std [0.229, 0.224, 0.225])
  • dtype: float32 or bfloat16