Deploy BYOC-I on AWS
This page explains how to create a Bring-Your-Own-Cloud (BYOC) data plane with a BYOC agent in your AWS Virtual Private Cloud (VPC).
Zilliz BYOC is currently available in General Availability. For access and implementation details, please contact Zilliz Cloud support.
Prerequisites
Ensure that
- You are the owner of a BYOC-I organization.
Procedures
Step 1: Prepare the deployment environment
A deployment environment is a local machine, a virtual machine (VM), or a CI/CD pipeline configured to run the Terraform configuration files and deploy the data plane of your BYOC-I project. In this step, you need to
-
Configure AWS credentials (AWS profile or access key).
For details on how to configure AWS credentials, refer to this document.
-
Install the latest Terraform binary.
For details on how to install Terraform, refer to this document.
Step 2: Create a project
Within your BYOC-I organization, click the Create Project and Deploy Data Plane button to start the deployment.
Step 3: Set up the general settings
In General Settings, you need to set the project name and determine the cloud providers and regions where Zilliz Cloud deploys the data plane for the project.
-
Set Project Name.
-
Select Cloud Provider and Region.
-
(Optional) Configure Instance Settings.
In a BYOC project, the search service, fundamental database components, and core support services use different instances. You can set instance types for these services and components.
For details, see Instance Settings.
-
Determine whether to enable AWS PrivateLink.
This option allows private connectivity to the clusters within the current project. If you enable this option, you must create a VPC Endpoint for private connectivity.
-
Click Create and Next.
Step 4: Deploy the data plane
Follow the steps displayed in the dialog to deploy the data plane for the currently created project.
For details on running the above Terraform scripts, refer to the Zilliz Cloud BYOC-I Project Setup Guide.
Manage projects
Projects with an Undeploy tag
If the status tag on the right corner of a project card reads Undeploy, you can always click the Deploy Data Plane button on the project card to reopen it. To rename or delete the project, click the ... button in the project card and select Rename or Delete from the drop-down menu.
Projects with a Deploying tag
Once you have prepared the deployment environment and executed the displayed commands, you must wait for the BYOC agent to activate. When the status tag on the project card reads Deploying and shows the progress percentage, you cannot rename or delete the project until the data plane is in place.
Projects with a Running tag
Once the status tag on a project card reads Running, you can start creating clusters in the project. To rename or delete a running project, ensure that there are no clusters in the project.
Instance settings
The data plane of a Zilliz BYOC project comprises three types of components: Search Services, Fundamental Database Components, and Core Support Services, which use different EC2 instances.
In the General settings, you need to determine the EC2 instance types for the three data plane components mentioned above. Additionally, you need to specify the number of EC2 instances for Core Support Services, which determines the maximum number of clusters that can be created within the project.
There are four predefined project size options, and they are described as follows:
Size | Maximum Cluster Quantity | Maximum Number of Entities | |
---|---|---|---|
Performance-optimized CU | Capacity-optimized CU | ||
Small | 3 clusters with 8 to 16 CUs | 10 Million - 25 Million | 40 Million - 80 Million |
Medium | 7 clusters with 16 to 64 CUs | 25 Million - 100 Million | 80 Million - 350 Million |
Large | 12 clusters with 64 to 192 CUs | 100 Million - 300 Million | 350 Million - 1 Billion |
X-Large | 17 clusters with 192 to 576 CUs | 300 Million - 900 Million | 1 Billion - 3 Billion |
You can also choose to customize the settings by selecting Custom in Initial Project Size and adjusting the EC2 instance types and counts for all data plane components. If your preferred EC2 instance types are not listed, please contact Zilliz support.