Awesome Open Source
Awesome Open Source

DOI Create a Slack Account with us Slack Status

Wrangling Genomics

Lesson for quality control and wrangling genomics data. This repository is maintained by Josh Herr, Ming Tang, and Fotis Psomopoulos.

Amazon public AMI for this tutorial is "dataCgen-qc".


Wrangling genomics trains novice learners on a variant calling workflow. Participants will learn how to evaluate sequence quality and what to do if it is not good. We will then cover aligning reads to a genome, and calling variants, as well as discussing different file formats. Results will be visualized. Finally, we will cover how to automate the process by building a shell script.

This lesson is part of the Data Carpentry Genomics Workshop.


Code of Conduct

All participants should agree to abide by the Data Carpentry Code of Conduct.


Wrangling genomics is authored and maintained by the community.


Please cite as:

Erin Alison Becker, Taylor Reiter, Fotis Psomopoulos, Sheldon John McKay, Jessica Elizabeth Mizzi, Jason Williams, Winni Kretzschmar. (2019, June). datacarpentry/wrangling-genomics: Data Carpentry: Genomics data wrangling and processing, June 2019 (Version v2019.06.1). Zenodo.

Alternative Project Comparisons
Related Awesome Lists
Top Programming Languages
Top Projects

Get A Weekly Email With Trending Projects For These Topics
No Spam. Unsubscribe easily at any time.
Python (795,011
Shell (169,403
Programming (18,142
English (6,857
Variants (5,177
Genomics (1,702
Carpentries (88
Data Carpentry (39