FAQ: Resource Planning
This topic lists the possible issues that you may encounter while planning your resources on Zilliz Cloud and the corresponding solution.
Contents
- What is a Compute Unit (CU)?
- How can I avoid expenses on unused clusters?
- How many query CUs do I need for a given collection?
- Which type of cluster should I pick?
- What's the difference between Performance-optimized CU and Capacity-optimized CU?
FAQs
What is a Compute Unit (CU)?
A compute unit (CU) is a group of hardware resources for serving your indexes and search requests. You can simply consider a CU as a fully-managed physical node for deploying search service.
For more details, see Select the Right CU.
How can I avoid expenses on unused clusters?
We recommend suspending unused clusters to save computing costs. You can resume them later when necessary.
How many query CUs do I need for a given collection?
- 
Performance-optimized: Supports up to 1.5 million 768-dimensional vectors. 
- 
Capacity-optimized: Supports up to 5 million 768-dimensional vectors. 
- 
Tiered-storage: Supports up to 20 million 768-dimensional vectors. 
These estimates are based on vectors with primary keys only. Additional scalar fields like IDs or labels may reduce capacity. We recommend conducting your own tests for accurate assessment.
Which type of cluster should I pick?
Select the Performance-optimized if you instant search results and high concurrent traffic for real-time applications. Choose the Capacity-optimized if you need to handle large vector datasets while maintaining reliable search speeds. Opt for the Tiered-storage cluster if you need to handle ultra-large-scale, cost-sensitive workloads with clear hot and cold data patterns. To select a Tiered-storage cluster, your cluster must have at least 8 query CUs.
What's the difference between Performance-optimized CU and Capacity-optimized CU?
The "Performance-optimized CU" suits low latency or high throughput similarity searches. This option works best for high-search performance scenarios.
The "Capacity-optimized CU" suits data volumes that are five times larger than the performance-optimized CU option. This option works best for increased storage capacity scenarios.
For more details, see Select the Right CU.