Nos

Module to Automatically maximize the utilization of GPU resources in a Kubernetes cluster through real-time dynamic partitioning and elastic quotas - Effortless optimization at its finest!
Alternatives To Nos
Project NameStarsDownloadsRepos Using ThisPackages Using ThisMost Recent CommitTotal ReleasesLatest ReleaseOpen IssuesLicenseLanguage
Serve3,768152 days ago22October 12, 2023339apache-2.0Java
Serve, optimize and scale PyTorch models in production
Nos527
8 days ago18apache-2.0Go
Module to Automatically maximize the utilization of GPU resources in a Kubernetes cluster through real-time dynamic partitioning and elastic quotas - Effortless optimization at its finest!
Kube Reqsizer182
3 months ago13Go
A Kubernetes controller for automatically optimizing pod requests based on their continuous usage. VPA alternative that can work with HPA.
Elearning73
10 months agootherHTML
elearning linux/mac/db/cache/server/tools/人工智能
Gke Pod Usage36
4 years agootherPython
pod_usage queries a k8s cluster and compares pod usage versus the pod requests and limits. It provides output that can then be analyzed to determine if optimizations can be made, and helps you find out if your oversubscribed or undersubcribed.
Experiments27
a year ago1May 15, 201810apache-2.0Python
Experiments API for Experiment Tracking on Kubernetes
Container Optimization Data Forwarder5
2 months ago1apache-2.0Go
Openflow_analysis2
2 years agomitPython
utils for analysis and optimization of config files/rule-sets from the cloud/OF environment
Alternatives To Nos
Select To Compare


Alternative Project Comparisons
Readme

Nebuly Operating System (nos)


Documentation: docs.nebuly.com/nos/overview

If you like the project please support it by leaving a star


nos is the open-source module to efficiently run AI workloads on Kubernetes, increasing GPU utilization, cutting down infrastructure costs and improving workloads performance.

Currently, the available features are:

  • Dynamic GPU partitioning: allow to schedule Pods requesting fractions of GPU. GPU partitioning is performed automatically in real-time based on the Pods pending and running in the cluster, so that Pods can request only the resources that are strictly necessary and GPUs are always fully utilized.

  • Elastic Resource Quota management: increase the number of Pods running on the cluster by allowing namespaces to borrow quotas of reserved resources from other namespaces as long as they are not using them.

Getting started

Prerequisites

Installation

You can install nos using Helm 3 (recommended). You can find all the available configuration values in the Chart documentation.

helm install oci://ghcr.io/nebuly-ai/helm-charts/nos \
  --version 0.1.2 \
  --namespace nebuly-nos \
  --generate-name \
  --create-namespace

Alternatively, you can use Kustomize by cloning the repository and running make deploy.


Join the community | Contribute

Popular Optimization Projects
Popular Kubernetes Projects
Popular Software Performance Categories
Related Searches

Get A Weekly Email With Trending Projects For These Categories
No Spam. Unsubscribe easily at any time.
Go
Kubernetes
Gpu
Optimization