Togo Crop Mask

A pixel-wise land type classifier, used to generate a crop mask for Togo

Introduction

This repository contains code and data to generate a crop mask for Togo. It was used to deliver a high-resolution (10m) cropland mask in 10 days to help the government distribute aid to smallholder farmers during the COVID-19 pandemic.

It combines a hand-labelled dataset of crop / non-crop images with a global database of crowdsourced cropland data to train a multi-headed LSTM-based model to predict the presence of cropland in a pixel.

The map can be found on Google Earth Engine.

Pipeline

The main entrypoints into the pipeline are the scripts. Specifically:

scripts/export.py exports data (locally, or to Google Drive - see below)
scripts/process.py processes the raw data
scripts/engineer.py combines the earth observation data with the labels to create (x, y) training data
scripts/models.py trains the models
scripts/predict.py takes a trained model and runs it on exported tif files (the path to these files is defined in the script)

The split_tiff.py script is useful to break large exports from Google Earth Engine, which may be too large to fit into memory.

Once the pipeline has been run, the directory structure of the data folder should look like the following. If you get errors, a good first check would be to see if any files are missing.

data
│   README.md
│
└───raw // raw exports
│   └───togo  // this is included in this repo
│   └───geowiki_landcover_2017  // exported by scripts.export.export_geowiki()
│   └───earth_engine_togo  // exported to Google Drive by scripts.export.export_togo(), and must be copied here
│   │                      // scripts.export.export_togo() expects processed/togo{_evaluation} to exist
│   └───earth_engine_togo_evaluation  // exported to Google Drive by scripts.export.export_togo(), and must be copied here
│   │                                 // scripts.export.export_togo() expects processed/togo{_evaluation} to exist
│   └───earth_engine_geowiki  // exported to Google Drive by scripts.export.export_geowiki_sentinel_ee(), and must be copied here
│                             // scripts.export.export_geowiki_sentinel_ee() expects processed/geowiki_landcover_2017 to exist
│
└──processed  // raw data processed for clarity
│   └───geowiki_landcover_2017 // created by scripts.process.process_geowiki()
│   │                          // which expects raw/geowiki_landcover_2017 to exist
│   └───togo  // created by scripts.process.process_togo()
│   └───togo_evaluation  // created by scripts.process.process_togo()
│
└──features  // the arrays which will be ingested by the model
│   └───geowiki_landcover_2017 // created by scripts.engineer.engineer_geowiki()
│   └───togo  // created by scripts.engineer.engineer_togo()
│   └───togo_evaluation  // created by scripts.engineer.engineer_togo()
│
└──lightning_logs // created by pytorch_lightning when training models

Setup

Anaconda running python 3.6 is used as the package manager. To get set up with an environment, install Anaconda from the link above, and (from this directory) run

conda env create -f environment.yml

This will create an environment named landcover-mapping with all the necessary packages to run the code. To activate this environment, run

conda activate landcover-mapping

Earth Engine

Earth engine is used to export data. To use it, once the conda environment has been activated, run

earthengine authenticate

and follow the instructions. To test that everything has worked, run

python -c "import ee; ee.Initialize()"

Note that Earth Engine exports files to Google Drive by default (to the same google account used sign up to Earth Engine).

Running exports can be viewed (and individually cancelled) in the Tabs bar on the Earth Engine Code Editor. For additional support the Google Earth Engine forum is super helpful.

Exports from Google Drive should be saved in data/raw. This happens by default if the GDrive exporter is used.

Tests

The following tests can be run against the pipeline:

pytest  # unit tests, written in the test folder
black .  # code formatting
mypy src  # type checking

Reference

If you find this code useful, please cite the following paper:

Hannah Kerner, Gabriel Tseng, Inbal Becker-Reshef, Catherine Nakalembe, Brian Barker, Blake Munshell, Madhava Paliyam, and Mehdi Hosseini. 2020. Rapid Response Crop Maps in Data Sparse Regions. KDD ’20: ACMSIGKDD Conference on Knowledge Discovery and Data Mining Workshops, August 22–27, 2020, San Diego, CA.

The hand-labeled training and test data used in the above paper can be found at: https://doi.org/10.5281/zenodo.3836629

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
data		data
diagrams		diagrams
scripts		scripts
src		src
tests		tests
.gitignore		.gitignore
README.md		README.md
conftest.py		conftest.py
environment.yml		environment.yml
mypy.ini		mypy.ini
pytest.ini		pytest.ini

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

data

data

diagrams

diagrams

scripts

scripts

src

src

tests

tests

.gitignore

.gitignore

README.md

README.md

conftest.py

conftest.py

environment.yml

environment.yml

mypy.ini

mypy.ini

pytest.ini

pytest.ini

Repository files navigation

Togo Crop Mask

Introduction

Pipeline

Setup

Earth Engine

Tests

Reference

About

Releases

Packages

Languages

nasaharvest/togo-crop-mask

Folders and files

Latest commit

History

Repository files navigation

Togo Crop Mask

Introduction

Pipeline

Setup

Earth Engine

Tests

Reference

About

Topics

Resources

Stars

Watchers

Forks

Languages