🌮 Trash Annotations in Context Dataset Toolkit
Alternatives To Taco
Project NameStarsDownloadsRepos Using ThisPackages Using ThisMost Recent CommitTotal ReleasesLatest ReleaseOpen IssuesLicenseLanguage
Tensorflow Examples42,312
a year ago218otherJupyter Notebook
TensorFlow Tutorial and Examples for Beginners (support TF v1 & v2)
Pytorch Cyclegan And Pix2pix21,090
2 days ago519otherPython
Image-to-Image Translation in PyTorch
Datasets17,69297606 hours ago76November 16, 2023620apache-2.0Python
🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
Label Studio14,94138 hours ago182December 01, 2023753apache-2.0Python
Label Studio is a multi-type data labeling and annotation tool with standardized output format
First Order Model13,952
19 hours ago301mitJupyter Notebook
This repository contains the source code for the paper First Order Motion Model for Image Animation
Tensor2tensor13,7018286 months ago79June 17, 2020589apache-2.0Python
Library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.
3 months ago7apache-2.0Python
pix2code: Generating Code from a Graphical User Interface Screenshot
Cvat10,51937 hours ago23November 27, 2023525mitTypeScript
Annotate better with CVAT, the industry-leading data engine for machine learning. Used and trusted by teams at any scale, for data of any scale.
Fashion Mnist9,856
2 years ago24mitPython
A MNIST-like fashion product database. Benchmark :point_down:
3 years ago76otherLua
Image-to-image translation with conditional adversarial nets
Alternatives To Taco
Select To Compare

Alternative Project Comparisons

TACO is a growing image dataset of waste in the wild. It contains images of litter taken under diverse environments: woods, roads and beaches. These images are manually labeled and segmented according to a hierarchical taxonomy to train and evaluate object detection algorithms. Currently, images are hosted on Flickr and we have a server that is collecting more images and annotations @ tacodataset.org

For convenience, annotations are provided in COCO format. Check the metadata here: http://cocodataset.org/#format-data

TACO is still relatively small, but it is growing. Stay tuned!


For more details check our paper: https://arxiv.org/abs/2003.06975

If you use this dataset and API in a publication, please cite us using:  

    title={TACO: Trash Annotations in Context for Litter Detection},
    author={Pedro F Proença and Pedro Simões},
    journal={arXiv preprint arXiv:2003.06975},


December 20, 2019 - Added more 785 images and 2642 litter segmentations.
November 20, 2019 - TACO is officially open for new annotations: http://tacodataset.org/annotate

Getting started


To install the required python packages simply type

pip3 install -r requirements.txt

Additionaly, to use demo.pynb, you will also need coco python api. You can get this using

pip3 install git+https://github.com/philferriere/cocoapi.git#subdirectory=PythonAPI


To download the dataset images simply issue

python3 download.py

Alternatively, download from DOI

Our API contains a jupyter notebook demo.pynb to inspect the dataset and visualize annotations.

Unlabeled data

A list of URLs for both unlabeled and labeled images is now also provided in data/all_image_urls.csv. Each image contains one URL for each original image (second column) and one URL for a VGA-resized version (first column) for images hosted by Flickr. If you decide to annotate these images using other tools, please make them public and contact us so we can keep track.

Unofficial data

Annotations submitted via our website are added weekly to data/annotations_unofficial.json. These have not yet been been reviewed by us -- some may be inaccurate or have poor segmentations. You can use the same command to download the respective images:

python3 download.py --dataset_path ./data/annotations_unofficial.json

Trash Detection

The implementation of Mask R-CNN by Matterport is included in /detector with a few modifications. Requirements are the same. Before using this, the dataset needs to be split. You can either donwload our weights and splits or generate these from scratch using the split_dataset.py script to generate N random train, val, test subsets. For example, run this inside the directory detector:

python3 split_dataset.py --dataset_dir ../data

For further usage instructions, check detector/detector.py.

As you can see here, most of the original classes of TACO have very few annotations, therefore these must be either left out or merged together. Depending on the problem, detector/taco_config contains several class maps to target classes, which maintain the most dominant classes, e.g., Can, Bottles and Plastic bags. Feel free to make your own classes.

Popular Dataset Projects
Popular Deep Learning Projects
Popular Data Processing Categories
Related Searches

Get A Weekly Email With Trending Projects For These Categories
No Spam. Unsubscribe easily at any time.
Jupyter Notebook
Deep Learning
Object Detection