Scaling

Koyeb offers flexible scaling options to help you manage the underlying infrastructure supporting your application's resources.

You can configure your Services to scale automatically based on resource and traffic targets. Alternatively, you can opt for fixed horizontal scaling to maintain a set number of Service Instances.

Autoscaling

Autoscaling allows you to dynamically adjust the number of Service Instances based on configurable resource and traffic targets. Service metrics are continuously monitored to scale the Service up or down to meet the demand.

You can learn more in the Autoscaling section.

Fixed horizontal scaling

Fixed horizontal scaling allows you to define a specific number of Instances for your Service. Each healthy Instance will receive a proportional share of the incoming requests for the region.

This approach is useful when you want to ensure a specific number of Service Instances are always running. It also enhances availability and resilience by distributing the load and accommodating events like hardware or Service issues.

If you'd like your Service to automatically scale the number of Instances according to configurable resource and traffic thresholds, take a look at the autoscaling documentation.

Scale-to-zero

Scale-to-zero enables you to automatically scale your Service Instances down to zero when there is no demand. Like autoscaling, Service metrics are monitored to determine when your service should scale down to zero.

You can learn more in the scale-to-zero section.

Regions Service Mesh & Discovery