A tool for building feature stores. Transform your raw data into beautiful features.
Made with ❤️ by the MLOps team from QuintoAndar
This library supports Python version 3.7+ and meant to provide tools for building ETL pipelines for Feature Stores using Apache Spark.
The library is centered on the following concetps:
To understand the main concepts of Feature Store modeling and library main features you can check Butterfree's Documentation, which is hosted by Read the Docs.
To learn how to use Butterfree in practice, see Butterfree's notebook examples
Butterfree depends on Python 3.7+ and it is Spark 3.0 ready ✔️
Python Package Index hosts reference to a pip-installable module of this library, using it is as straightforward as including it on your project's requirements.
pip install butterfree
Or after listing
butterfree in your
pip install -r requirements.txt
Dev Package are available for testing using the .devN versions of the Butterfree on PyPi.
All contributions are welcome! Feel free to open Pull Requests. Check the development and contributing guidelines described here.