Repository for the book Introduction to Artificial Neural Networks and Deep Learning: A Practical Guide with Applications in Python.
Deep learning is not just the talk of the town among tech folks. Deep learning allows us to tackle complex problems, training artificial neural networks to recognize complex patterns for image and speech recognition. In this book, we'll continue where we left off in Python Machine Learning and implement deep learning algorithms in PyTorch.
This repository will contain the instructions, code examples, and solutions for the Hands-On and Exercise portions of each chapter.
PDF and ebook versions of the book will be available from Leanpub.
Paperback: est. 2018
01 - Introduction
02 - The Perceptron
03 - Optimizing Cost Functions with Gradient Descent
04 - Logistic Regression and Softmax Regression
05 - From Softmax Regression to Multilayer Perceptrons
06 - Cross Validation and Performance Metrics
07 - Regularization in Neural Networks
08 - Learning Rates and Weight Initialization
09 - Convolutional Neural Networks
10 - Recurrent Neural Networks
11 - Autoencoders
12 - General Adverserial Neural Networks
13 - Deep Generative Models
14 - Reinforcement Learning
Appendix A: Mathematical Notation [PDF]
Appendix B: Algebra Basics [PDF]
Appendix C: Linear Algebra Essentials
Appendix D: Calculus and Differentiation Primer [PDF]
Appendix E: Probability Theory Overview
Appendix F: Notational Conventions Reference
Appendix G: Python Setup
Appendix I: PyTorch Basics
Appendix J: Cloud Computing [PDF]
Machine learning has become a central part of our life — as consumers, customers, and hopefully as researchers and practitioners! I appreciate all the nice feedback that you sent me about Python Machine Learning, and I am so happy to hear that you found it so useful as a learning guide, helping you with your business applications and research projects. I have received many emails since its release. Also, in these very emails, you were asking me about a possible prequel or sequel.
Initially, I was inclined to write more about the "math" parts, which can be a real hurdle for almost everyone without (or even with) a math major in college. Initially, I thought that writing a book about "machine learning math" was a cool thing to do. Now, I have ~15 chapters worth of notes about pre-calculus, calculus, linear algebra, statistics, and probability theory. However, I eventually came to a conclusion that there were too many other math books out there, already! Most of them are far better and more comprehensive and accurate than my potential ~500-page introduction to the topics that I had in store. After all, I think that the real motivation for learning and understanding a subject comes from being excited about it in the first place; if you are passionate about machine learning and you stumble upon the chain rule in calculus, you wouldn't have any problems to find a trusted resource via your favorite search engine these days.
So, instead of writing that "prequel," let me write about something that's built upon the concepts that I introduced in the later chapters of Python Machine Learning -- algorithms for deep learning. After we coded a multi-layer perceptron (a certain kind of feedforward artificial neural network) from scratch, we took a brief look at some Python libraries for implementing deep learning algorithms, and I introduced convolutional and recurrent neural networks on a conceptual level.
In this book, I want to continue where I left off and want to implement deep neural networks and algorithms for deep learning algorithms from scratch, using Python, NumPy, and SciPy throughout this educational journey. In addition to the vanilla Python science-stack, we will implement these algorithms in TensorFlow, highly performant yet very accessible deep learning library for implementing and applying deep learning to real-world problems.
All code in this repository (including the code examples in Jupyter Notebooks) is open source content, released under the MIT software license. In short, the permissive MIT license allows you to do anything with the code with proper attribution and without warranty; please check the MIT LICENSE notice for further details.
All non-code content and creative work in this repository, including text and graphics, is under exclusive copyright by the author, Sebastian Raschka. Unless noted otherwise, text content shared in this repository is intended for personal use only. You may use, modify, or share short text passages of this work with proper attribution to the author. However, if you are planning to modify and/or share substantial portions of this book for other writings, such as blog posts, magazine article, or teaching material, contact the author for permission.
Figures and graphics marked by a Creative Commons Attribution-ShareAlike 4.0 International are free to share under the respective license terms (as listed in the Creative Commons Attribution-ShareAlike 4.0 International section in the LICENSE file) and proper attribution.
I would like to give my special thanks to the readers, who caught various typos and errors and offered suggestions for clarifying my writing.