pytorch YOLO-v1

中文 博客

This is a experimental repository, which are not exactly the same as the original paper, our performance on voc07test is 0.665 map, [email protected]

I write this code for the purpose of learning. In, i write forward only, with autograd mechanism, backward will be done automatically.

For the convenience of using pytorch pretrained model, our backbone network is resnet50, add an extra block to increase the receptive field, in addition, we drop Fully connected layer.

Effciency has not been optimized. It may be faster... I don't know

Train on voc2012+2007

model backbone [email protected] FPS
our ResNet_YOLO ResNet50 66.5% 57
YOLO darknet19? 63.4% 45
YOLO VGG-16 VGG-16 66.4% 21

1. Dependency

  • pytorch 0.2.0_2
  • opencv
  • visdom
  • tqdm

2. Prepare

  1. Download voc2012train dataset
  2. Download voc2007test dataset
  3. put all images in one folder, i have provide txt annotation file 3. Convert xml annotations to txt file, for the purpose of using, you should put the in the same folder of voc dataset, or change Annotations path in

3. Train

Run python

Be careful: 1. change the image file path 2. I recommend you install visdom and run it

4. Evaluation

Run python

be careful 1. change the image file path

5. result

Our map in voc2007 test set is 0.665~ some result are below, you can see more in testimg folder.

