Awesome Open Source
Awesome Open Source

Voicebook

Tweet

This is an assortment of all the scripts associated with the Intro to Voice Computing Book. These scripts should give you a jumpstart in learning how to write Python code for voice-related applications.

^^ Note: If you find this code useful, please leave a star :) ^^

Voicebook video

Getting started: setting up environment (mac)

Watch the overview video for this book here.

Note that these are the instructions for Mac computers; you may need some custom setup for FFmpeg and/or SoX if you're using a Windows and/or Linux computer.

First, clone the repository and submodules:

git clone --recurse-submodules -j8 https://github.com/jim-schwoebel/voicebook

Now you need to run the setup.py script in a virtual environment to make sure you have all the required dependencies for all the chapters of the book. To do this, run:

cd voicebook
pip3 install virtualenv
virtualenv env 
source env/bin/activate
python3 setup.py

Now you have all the dependencies necessary to follow along with the chapters in the book. You don’t need to worry about any other installations.

License

This repository is licensed under the Apache 2.0 License.

Feedback

Any feedback on the book or this repository is greatly appreciated.

  • If you find something that is missing or doesn't work, please consider opening a GitHub issue.
  • If you'd like to be mentored by someone on our team, check out the Innovation Fellows Program.
  • If you are looking for a framework to start building machine learning models in voice computing, check out Allie.
  • If you want to talk to me directly, please send me an email @ [email protected].

Citation

Please use the following citation when citing this book in your research work:

References

Check out the wiki or documentation below to follow along with each chapter in the book. In this way, you can get quickly up to speed with the 200+ scripts included in this repository.


Get A Weekly Email With Trending Projects For These Topics
No Spam. Unsubscribe easily at any time.
python (55,261
machine-learning (3,713
security (1,934
python3 (1,670
visualization (841
server (544
data (405
voice (101
generation (41
voice-assistant (40
voice-recognition (35
voice-control (27
data-cleaning (25