Elephant Detection

Using Keras-RetinaNet to detect elephants from aerial images

Flatiron School DC-DS-08262019 Capstone Project

In this project I wanted to explore the realm of computer vision, and jumped straight into the deep pool of object detection.

Data

Source: Naudé, Johannes J., & Joubert, Deon. (2019). The Aerial Elephant Dataset. Zenodo. http://doi.org/10.5281/zenodo.3234780

From the paper:

(A) challenging dataset to enable research on game detection under real-world conditions. The dataset consists of 2101 images containing a total of 15 511 African bush elephants in their natural habitats, imaged with a consistent methodology over a range of background types, resolutions and times-of-day.

Methods and Libraries

I used the Keras implementation of RetinaNet by fizyr, which is based on Focal Loss for Dense Object Detection Paper by Tsung-Yi Lin, Priya Goyal, Ross Girshick, Kaiming He, and Piotr Dollar.

As a supplement, this anchor optimization tool by @martinzlocha is also used to make sure the right configuration (anchor ratios and scales) is implemented. This is especially important due to the (extremely) small object sizes to be detected.

This is a single class object detection task (class 0: elephant)

Work Environment

Google Cloud Platform instance with Nvidia Tesla T4 GPU
CUDA 10
Tensorflow 1.15
Keras 2.3.1

Project Steps and Results

Create bounding box around dataset's annotation (mid point pixel x,y coordinate)
Divide test set into test and validation
Train dataset without augmentation using pre-trained coco model (downloaded here), continued training for 60 epochs
Train dataset with visual effect augmentation using pre-trained coco model, continued training for 38 epochs

Model performance at this point:

Recall : 42.9%
Precision : 12.3%
mAP : 19.5%

Discussion and Future Work

The model resulted in a lot of false positives (about 7 false positives for every true positive), and it does a better job (higher recall, precision) on some landscapes than others. To improve performance, the following steps can be taken:

Clip / divide images into smaller ones to help with data imbalance and speed up training
Train separate models for different landscape
Train model to include objects often mis-identified as elephants

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
config		config
images		images
.gitignore		.gitignore
Exploration_and_Bounding_Box.ipynb		Exploration_and_Bounding_Box.ipynb
Inference.ipynb		Inference.ipynb
Pre-train.ipynb		Pre-train.ipynb
README.md		README.md
Slide_deck.pdf		Slide_deck.pdf
Train-aug1.ipynb		Train-aug1.ipynb
Train-base.ipynb		Train-base.ipynb
_config.yml		_config.yml
annotations.csv		annotations.csv
classes.csv		classes.csv
img_list.csv		img_list.csv
script.py		script.py
test_annot.csv		test_annot.csv
train_label_new		train_label_new
val_annot.csv		val_annot.csv

akharina/elephant_detection

Folders and files

Latest commit

History

Repository files navigation

Elephant Detection

Data

Methods and Libraries

Work Environment

Project Steps and Results

Discussion and Future Work

About

Resources

Stars

Watchers

Forks

Languages