Skills Ml

Data Processing and Machine learning methods for the Open Skills Project
Alternatives To Skills Ml
Project NameStarsDownloadsRepos Using ThisPackages Using ThisMost Recent CommitTotal ReleasesLatest ReleaseOpen IssuesLicenseLanguage
Dali4,290
10 hours ago263apache-2.0C++
A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.
Texar2,008
23 years ago5November 19, 201932apache-2.0Python
Toolkit for Machine Learning, Natural Language Processing, and Text Generation, in TensorFlow
Dialogpt1,751
5 months ago59mitPython
Large-scale pretraining for dialogue
Data Science On Gcp1,157
3 months ago13apache-2.0Jupyter Notebook
Source code accompanying book: Data Science on the Google Cloud Platform, Valliappa Lakshmanan, O'Reilly 2017
Texar Pytorch71111a year ago5April 14, 202236apache-2.0Python
Integrating the Best of TF into PyTorch, for Machine Learning, Natural Language Processing, and Text Generation. This is part of the CASL project: http://casl-project.ai/
Haupt452
10 hours ago1October 04, 2022agpl-3.0Python
Lineage metadata API, artifacts streams, sandbox, API, and spaces for Polyaxon
Bytewax397
2 days ago24April 22, 202213apache-2.0Rust
Python Stream Processing
Godel328
3 months ago12mitPython
Large-scale pretrained models for goal-directed dialog
Nonechucks315
22 years ago18June 09, 202117mitPython
Deal with bad samples in your dataset dynamically, use Transforms as Filters, and more!
Machine Learning Notebooks241
2 years ago1Jupyter Notebook
Machine Learning notebooks for refreshing concepts.
Alternatives To Skills Ml
Select To Compare


Alternative Project Comparisons
Readme

skill-ml

Build Status Code Coverage Updates Python 3 PyPI Code Climate

Open Skills Project - Machine Learning

This is the library for the methods usable by the Open Skills API, including processing algorithms and utilities for computing our jobs and skills taxonomy.

New to Skills-ML? Check out the Skills-ML Tour! It will get you started with the concepts. You can also check out the notebook version of the tour which you can run on your own.

Documentation

Hosted on Github Pages

Quick Start

1. Virtualenv

skills-ml depends on python3.6, so create a virtual environment using a python3.6 executable.

virtualenv venv -p /usr/bin/python3.6

Activate your virtualenv

source venv/bin/activate

2. Installation

pip install skills-ml

3. Import skills_ml

import skills_ml
  • There are a couple of examples of specific uses of components to perform specific tasks in examples.
  • Check out the descriptions of different algorithm types in algorithms/ and look at any individual directories that match what you'd like to do (e.g. skill extraction, job title normalization)
  • skills-airflow is the open-source production system that uses skills-ml algorithms in an Airflow pipeline to generate open datasets

Building the Documentation

skills-ml uses a forked version of pydocmd, and a custom script to keep the pydocmd config file up to date. Here's how to keep the docs updated before you push:

$ cd docs $ PYTHONPATH="../" python update_docs.py # this will update docs/pydocmd.yml with the package/module structure and export the Skills-ML Tour notebook to the documentation directory $ pydocmd serve # will serve local documentation that you can check in your browser $ pydocmd gh-deploy # will update the gh-pages branch

Structure

  • algorithms/ - Core algorithmic module. Each submodule is meant to contain a different type of component, such as a job title normalizer or a skill tagger, with a common interface so different pipelines can try out different versions of the components.
  • datasets/ - Wrappers for interfacing with different datasets, such as ONET, Urbanized Area.
  • evaluation/ - Code for testing different components against each other.

Contributors

License

This project is licensed under the MIT License - see the LICENSE.md file for details.

Popular Machine Learning Projects
Popular Data Processing Projects
Popular Machine Learning Categories
Related Searches

Get A Weekly Email With Trending Projects For These Categories
No Spam. Unsubscribe easily at any time.
Jupyter Notebook
Ml
Algorithms
Dataset
Tour
Airflow
Data Processing