Skip to main content
Version: User Guides (Cloud)

On-Demand Compute Cost
Public Preview

On-demand compute in Zilliz Cloud follows a usage-based billing model. You are charged for query compute and index-building compute consumed by your workloads.

The total on-demand compute cost is the sum of the following components:

  • Query CU cost

  • Indexing CU cost

Query CU cost

Query CU cost measures the compute resources consumed by your on-demand clusters.

Cost calculation

Query CU Cost = Query CU Unit Price × Number of Query CU × Active Runtime
  • Query CU Unit Price: Determined by your cloud region and project plan. For detailed rates, refer to Zilliz Cloud Pricing Guide.

  • Number of Query CU: The number of query CUs configured for the on-demand cluster.

  • Active Runtime: The billable runtime when the compute resources of an on-demand cluster is used.

    • Billing starts when the on-demand cluster is in the Running status.

    • Billing stops when the on-demand cluster is auto-suspended (in the Suspending or Suspended status) due to inactivity.

    • The minimum billing unit is 1 minute. Any usage shorter than 1 minute is billed as 1 minute.

Indexing CU cost

Indexing CU cost measures compute resources consumed when you build indexes for data in both managed and external collections in on-demand compute.

Sources of indexing CU cost

You incur Indexing CU cost in the following scenarios:

  • Initial CreateIndex builds for data in both managed and external collections

  • Incremental index builds triggered by Refresh

Cost calculation

Indexing CU Cost = Indexing CU Unit Price × Number of Indexing CU x Time
  • Indexing CU Unit Price: Determined by cloud region and project plan. For detailed rates, refer to Zilliz Cloud Pricing Guide.

  • Number of Indexing CU: The system automatically allocates the most appropriate amount of indexing CUs. You cannot specify the number of indexing CUs to use.

  • Time: The time it takes to complete the index building job. Note that only the job execution time is counted. Queue waiting time and failed jobs are not billed. The minimum billing unit is 1 minute. Any usage shorter than 1 minute is billed as 1 minute.

📘**Note**

On the Usage and Invoice pages, indexing CU costs are shown as totals by database rather than by individual job.