vectorized-dev commited on
Commit
b93c971
·
verified ·
1 Parent(s): a707b55

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +104 -0
README.md ADDED
@@ -0,0 +1,104 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ library_name: ultralytics
3
+ tags:
4
+ - yolo
5
+ - yolo11
6
+ - object-detection
7
+ - logo-detection
8
+ - logodet-3k
9
+ - brandspotter
10
+ license: mit
11
+ datasets:
12
+ - LogoDet-3K
13
+ metrics:
14
+ - mAP50
15
+ - mAP50-95
16
+ - precision
17
+ - recall
18
+ pipeline_tag: object-detection
19
+ ---
20
+
21
+ # BrandSpotter — Logo Detection & Brand Identification
22
+
23
+ A three-stage pipeline for detecting and identifying brand logos in images: **YOLO11 detection**, **ResNet50 classification**, and **open-set rejection** for unknown brands.
24
+
25
+ This repo contains the trained model weights. Source code: [github.com/daa2618/brandspotter](https://github.com/daa2618/brandspotter)
26
+
27
+ ## Models
28
+
29
+ ### YOLO11m — Logo Detection (`yolo/`)
30
+
31
+ Fine-tuned YOLO11m for single-class logo detection on [LogoDet-3K](https://github.com/Wangjing1551/LogoDet-3K-Dataset).
32
+
33
+ | Metric | Value |
34
+ |--------|-------|
35
+ | mAP@0.5 | **0.894** |
36
+ | mAP@0.5:0.95 | **0.639** |
37
+ | Precision | 0.829 |
38
+ | Recall | 0.863 |
39
+
40
+ **Training details:**
41
+ - Base model: `yolo11m.pt` (pretrained)
42
+ - Epochs: 50 (best checkpoint at epoch 47)
43
+ - Image size: 640x640
44
+ - Optimizer: auto (AdamW)
45
+ - Learning rate: 0.001
46
+ - Batch size: auto
47
+ - Hardware: Google Colab T4 GPU (~2 hours)
48
+ - Dataset: LogoDet-3K (single-class: "logo")
49
+ - Augmentation: mosaic, randaugment, erasing (0.4), horizontal flip (0.5)
50
+
51
+ ### ResNet50 — Brand Classification (`resnet/`)
52
+
53
+ _Coming soon._
54
+
55
+ ## Usage
56
+
57
+ ```python
58
+ from ultralytics import YOLO
59
+
60
+ # Download from HuggingFace
61
+ model = YOLO("hf://vectorized-dev/brandspotter/yolo/best.pt")
62
+
63
+ # Run inference
64
+ results = model("path/to/image.jpg")
65
+ results[0].show()
66
+ ```
67
+
68
+ Or download manually and load from a local path:
69
+
70
+ ```python
71
+ model = YOLO("path/to/best.pt")
72
+ results = model("path/to/image.jpg")
73
+ ```
74
+
75
+ ## Files
76
+
77
+ ```
78
+ yolo/
79
+ best.pt — Trained weights (best checkpoint, ~39 MB)
80
+ args.yaml — Full training arguments
81
+ results.csv — Per-epoch training metrics
82
+ ```
83
+
84
+ ## Dataset
85
+
86
+ [LogoDet-3K](https://github.com/Wangjing1551/LogoDet-3K-Dataset) (Wang et al., ACM TOMM 2022). 158,652 images across 3,000 logo classes. The detection model treats all logos as a single class for region proposal; brand identification is handled by the downstream classifier.
87
+
88
+ ## Citation
89
+
90
+ ```bibtex
91
+ @article{wang2022logodet3k,
92
+ title={LogoDet-3K: A Large-scale Image Dataset for Logo Detection},
93
+ author={Wang, Jing and Min, Weiqing and Hou, Sujuan and Ma, Shengnan and Zheng, Yuanjie and Jiang, Shuqiang},
94
+ journal={ACM Transactions on Multimedia Computing, Communications, and Applications},
95
+ volume={18},
96
+ number={3},
97
+ year={2022},
98
+ publisher={ACM}
99
+ }
100
+ ```
101
+
102
+ ## License
103
+
104
+ MIT