Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Ntds_2016 | 91 | 3 years ago | mit | Jupyter Notebook | ||||||
Material for the EPFL master course "A Network Tour of Data Science", edition 2016. | ||||||||||
Ntds_2017 | 61 | 3 years ago | mit | Jupyter Notebook | ||||||
Material for the EPFL master course "A Network Tour of Data Science", edition 2017. | ||||||||||
Ntds_2018 | 61 | 3 years ago | other | Jupyter Notebook | ||||||
Material for the EPFL master course "A Network Tour of Data Science", edition 2018. | ||||||||||
Ntds_2019 | 45 | 3 years ago | other | Jupyter Notebook | ||||||
Material for the EPFL master course "A Network Tour of Data Science", edition 2019. | ||||||||||
Python_tour_of_data_science | 26 | 5 years ago | mit | Jupyter Notebook | ||||||
A Python Tour of Data Science | ||||||||||
Neptune Examples | 22 | a year ago | Jupyter Notebook | |||||||
Examples of using Neptune to keep track of your experiments (maintenance only). | ||||||||||
Tdf | 12 | 3 years ago | 2 | R | ||||||
🚴🏅📊Tour de France winners and stages data | ||||||||||
Pga Tour Data Science Project | 10 | 5 years ago | Jupyter Notebook | |||||||
Zava | 5 | 2 years ago | 3 | December 13, 2020 | apache-2.0 | Jupyter Notebook | ||||
Parallel coordinates with grand tour for exploratory data visualization of massive and high-dimensional data | ||||||||||
Beginners R | 5 | 4 years ago | 1 | R | ||||||
Resources for getting started with R at our R-Tours |
This repository contains the material for the practical work associated with the EPFL master course EE-558 A Network Tour of Data Science (moodle), taught in fall 2016. The course material revolves around the following topics: Data Science, Machine Learning, Deep Learning, Graph Signal Processing.
Theoretical knowledge is taught during lectures. Practical knowledge is taught through exercises. Both are practiced and evaluated through four assignments and a semester project.
Look at the 2017, 2018, and 2019 editions for a course more focused on graphs and networks (instead of deep learning).
There is two types of exercises.
The Data Scientist toolkit, a set of tools, mostly in Python, to help during the Data Science process.
Machine Learning (ML) & Graph Signal Processing (GSP) algorithms. These exercises are designed so as to familiarize yourself with the algorithms presented in class.
Part of the course is evaluated by a project, proposed and carried out by groups of one to three students. Below is the work of the 38 students enrolled that year.
The easiest way to play with the code is to run it inside a docker container, a lightweight virtualization method.
Install Docker on your Windows, Mac or Linux machine.
Run the image, which is automatically updated from this git repository.
docker pull mdeff/ntds_2016 # to update it
docker run --rm -i -p 8871:8888 -v ~/:/data/mount mdeff/ntds_2016
Access the container's Jupyter notebook at http://localhost:8871. There you'll find two folders:
repo
contains a copy of this git repository. Nothing you modify in this
folder is persistent. If you want to keep your modifications, use File
,
Download as
, Notebook
in the Jupyter interface.mount
contains a view of your home directory, from which you can
persistently modify any of your files.Windows and Mac users may need to redirect the port in VirtualBox.
If you want to use it for your projects and need additional software or Python packages, you'll need to install them into the container.
Create your named container.
docker run -i -p 8871:8888 -v ~/:/data/mount --name myproject mdeff/ntds_2016
Once you stop it, you'll be able to start it again with docker start myproject
.
In another terminal, install packages while the container is running.
docker exec -i myproject /bin/bash
pip install mypackage
apt-get install myotherpackage
Warning: this may be problematic for Windows users, as TensorFlow does not support Windows yet.
Install Python.
brew install python3
. You can also use Anaconda.Clone the course repository. You may need to first install git.
git clone https://github.com/mdeff/ntds_2016.git
cd ntds_2016
Optionally, create a virtual environment.
pyvenv /path/to/new/virtual/env
. /path/to/new/virtual/env/bin/activate
A virtual environment allows you to install a different set of packages for each of your Python project. Each project thus stays cleanly separated from each other. It is a good practice but by no means necessary. You can read more about virtual environments on this blog post. Anaconda users, see here.
Install the packages we'll use from PyPI, the Python Package Index.
pip install -r requirements.txt # or make install
If it fails, it is probably because you need to install some native packages with your package manager. Please read the error messages and remember, Google is your friend ! You may look at the dockerfile to get an idea of which setup is necessary on a Debian / Ubuntu system.
Depending on your installation, pip
may refer to Python 2 (you can
verify with pip -V
). In that case, use pip3
instead of pip
.
Anaconda users can also install packages with conda install packname
.
See here for your options.
Verify that you have a working installation by running a simple test.
Again, you may need to call python3
.
python check_install.py # or make test
WARNING (theano.configdefaults): g++ not detected!
, you may want to
install mingw-w64 with conda install mingw libpython
. Otherwise your Deep Learning models will run extremly slowly.
This may however not work for Python 3.5, see this GitHub
issue for a workaround.Open the jupyter web interface and play with the notebooks !
jupyter notebook
All codes and examples are released under the terms of the MIT License.