File size: 5,829 Bytes
08ec965 | 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 | # Prepare Datasets for Training and Evaluating CutLER
A dataset can be used by accessing [DatasetCatalog](https://detectron2.readthedocs.io/modules/data.html#detectron2.data.DatasetCatalog)
for its data, or [MetadataCatalog](https://detectron2.readthedocs.io/modules/data.html#detectron2.data.MetadataCatalog) for its metadata (class names, etc).
This document explains how to setup the builtin datasets so they can be used by the above APIs. [Use Custom Datasets](https://detectron2.readthedocs.io/tutorials/datasets.html) gives a deeper dive on how to use `DatasetCatalog` and `MetadataCatalog`,
and how to add new datasets to them.
CutLER has builtin support for a few datasets. The datasets are assumed to exist in a directory specified by the environment variable `DETECTRON2_DATASETS`. Under this directory, detectron2 will look for datasets in the structure described below, if needed.
```
$DETECTRON2_DATASETS/
imagenet/
coco/
voc/
kitti/
...
```
You can set the location for builtin datasets by `export DETECTRON2_DATASETS=/path/to/datasets`. If left unset, the default is `./datasets` relative to your current working directory.
All dataset annotation files should be converted to the COCO format.
## Expected dataset structure for [ImageNet](https://image-net.org/download.php):
```
imagenet/
train/
n01440764/*.JPEG
n01443537/*.JPEG
...
annotations/
imagenet_train_fixsize480_tau0.15_N3.json # generated by MaskCut
```
The ImageNet-1K samples should be downloaded from [here](https://image-net.org/download.php) and the pre-generated json file can be directly download from [here](http://dl.fbaipublicfiles.com/cutler/maskcut/imagenet_train_fixsize480_tau0.15_N3.json).
Or if you want to generate your own pseudo-masks, you can follow the instructions on [MaskCut](../README.md#1-maskcut).
## Expected dataset structure for [COCO, COCO20K, LVIS](https://cocodataset.org/#download):
```
coco/
annotations/
instances_{train,val}2017.json
coco20k_trainval_gt.json
coco_cls_agnostic_instances_val2017.json
{1,2,5,10,20,30,40,50,60,80}perc_instances_train2017.json
lvis1.0_cocofied_val_cls_agnostic.json
{train,val}2017/
000000000139.jpg
000000000285.jpg
...
train2014/
COCO_train2014_000000581921.jpg
COCO_train2014_000000581909.jpg
...
```
Following previous work, we prepare annotations for semi-supervised learning by randomly sampling a subset of labeled images.
You can download these annotation filess needed for COCO semi-supervised learning experiments and for evaluating the performance of the models on COCO, COCO20K and LVIS: [coco-semi](http://dl.fbaipublicfiles.com/cutler/coco-smi/annotations.zip), [coco](http://dl.fbaipublicfiles.com/cutler/coco/coco_cls_agnostic_instances_val2017.json), [lvis](http://dl.fbaipublicfiles.com/cutler/coco/lvis1.0_cocofied_val_cls_agnostic.json) and [coco20k](http://dl.fbaipublicfiles.com/cutler/coco/coco20k_trainval_gt.json).
## Expected dataset structure for [VOC2007](http://host.robots.ox.ac.uk/pascal/VOC/voc2012/index.html#devkit):
```
voc/
annotations/
trainvaltest_2007_cls_agnostic.json
VOC2007/
JPEGImages/
000001.jpg
...
```
We provide pre-converted COCO-style annotation files [here](http://dl.fbaipublicfiles.com/cutler/voc/trainvaltest_2007_cls_agnostic.json)
## Expected dataset structure for [Objects365-V2](https://www.objects365.org/download.html):
```
objects365/
annotations/
zhiyuan_objv2_val_cls_agnostic.json
val/
000000000139.jpg
000000000285.jpg
...
```
You only need to download the validation set of Objects365 for evaluation.
We provide pre-converted COCO-style annotation files [here](http://dl.fbaipublicfiles.com/cutler/objects365/zhiyuan_objv2_val_cls_agnostic.json)
## Expected dataset structure for [OpenImages-V6](https://storage.googleapis.com/openimages/web/download_v6.html):
```
openImages/
annotations/
openimages_val_cls_agnostic.json
validation/
47947b97662dc962.jpg
...
```
You only need to download the validation set of OpenImages for evaluation.
We provide pre-converted COCO-style annotation files [here](http://dl.fbaipublicfiles.com/cutler/openImages/openimages_val_cls_agnostic.json)
## Expected dataset structure for [UVO](https://sites.google.com/view/unidentified-video-object/dataset):
```
uvo/
annotations/
val_sparse_cleaned_cls_agnostic.json
all_UVO_frames/
000000000139.jpg
000000000285.jpg
...
```
You only need to download the validation set of UVO for evaluation.
We provide pre-converted COCO-style annotation files [here](http://dl.fbaipublicfiles.com/cutler/uvo/val_sparse_cleaned_cls_agnostic.json)
## Expected dataset structure for [KITTI](https://www.cvlibs.net/datasets/kitti/eval_object.php?obj_benchmark=2d):
```
kitti/
annotations/
trainval_cls_agnostic.json
JPEGImages/
001717.jpg
...
```
We provide pre-converted COCO-style annotation files [here](https://dl.fbaipublicfiles.com/cutler/kitti/trainval_cls_agnostic.json)
## Expected dataset structure for [Watercolor, Comic, Clipart](https://github.com/naoto0804/cross-domain-detection):
```
watercolor/
annotations/
traintest_cls_agnostic.json
JPEGImages/
163330523.jpg
...
comic/
annotations/
traintest_cls_agnostic.json
JPEGImages/
161067391.jpg
...
clipart/
annotations/
traintest_cls_agnostic.json
JPEGImages/
375390294.jpg
...
```
We provide pre-converted COCO-style annotation files here: [watercolor](http://dl.fbaipublicfiles.com/cutler/watercolor/traintest_cls_agnostic.json), [comic](http://dl.fbaipublicfiles.com/cutler/comic/traintest_cls_agnostic.json) and [clipart](http://dl.fbaipublicfiles.com/cutler/clipart/traintest_cls_agnostic.json).
NOTE: ALL DATASETS FOLLOW THEIR ORIGINAL LICENSES. |