Skip to main content
Version: User Guides (BYOC)

Scale Cluster

As data grows, you may face constraints that impact data writing. For example, read operations remain functional, but inserting or upserting new data might fail when the cluster reaches its maximum capacity.

To address such issues, you can adjust the number of CUs to match fluctuations in workload or storage requirements. You can enhance your cluster's performance by scaling up CUs in response to increased CPU or memory usage, and scale down to reduce costs during periods of low demand.

This guide outlines the procedures of scaling a cluster.

Manual scaling

You have the option to scale cluster manually by using the Zilliz Cloud web console or making an API request to scale your cluster. This guide focuses on how to manually scale a cluster using the web console. For more information about using the RESTful API, please refer to Modify Cluster.

🚧Warning

The scaling may lead to slight service jitter. Please exercise caution.

Scale up a cluster

manual-scale-entry

In the Scale Cluster dialog box, you can scale up the size allocated to the cluster of the same type in the same cloud region as the original one.

  • For Dedicated (Standard) clusters, you can scale the size up to a maximum of 32 CUs.

  • For Dedicated (Enterprise) clusters, you can scale up to a maximum of 256 CUs.

If you require a larger CU size, please create a support ticket.

📘Notes

The cluster CU size x replica count should not exceed 256. Otherwise, cluster scaling may fail.

Scale down a cluster

manual-scale-entry

In the Scale Cluster dialog box, select the desired CU size in the dialog window. Once you click Scale, Zilliz Cloud will check the cluster's data volume and collection numbers. Scaling down will be successfully triggered only when both of the following two conditions are met:

  • Current data volume < 80% of the CU capacity of the new CU size.

  • Current number of collections < the maximum number of collections allowed in the new CU size.

The time required to complete the process depends on the data volume in your cluster.

📘Notes

To scale down the cluster CU size to fewer than 8 CUs, ensure that there are no replicas in the cluster.

Increase QPS

To boost QPS and query throughput, please consider adding replicas. For more information, refer to Manage Replica