Skip to main content
Version: User Guides (Cloud)

Create Collection Instantly

You can create a collection instantly by setting its name and the vector field dimensionality. Zilliz Cloud automatically indexes the vector field and loads the collection upon creation. This page demonstrates how to create a collection instantly with default settings.

Overview

A collection is a two-dimensional table with fixed columns and variant rows. Each column represents a field, and each row represents an entity. A schema is required to implement such structural data management. Every entity to insert has to meet the constraints defined in the schema.

AIGC applications usually use vector databases as a knowledge base to manage the data generated during the interaction between users and Large Language Models (LLMs). Such knowledge bases are almost similar. To accelerate the use of Zilliz Cloud clusters in such scenarios, an instant method is available for you to create a collection with only two parameters, namely the collection name and the vector field dimensionality.

When you create a collection instantly with default settings, the following settings apply:

  • The primary and vector fields are added to the schema (id and vector).

  • The primary field accepts integers and disables AutoId.

  • The vector field accepts floating vector embeddings.

  • AUTOINDEX is used to create an index on the vector field.

  • COSINE is used to measure similarities between vector embeddings.

  • The reserves dynamic field named $meta is enabled to save non-schema-defined fields and their values in key-value pairs.

  • The collection is automatically loaded upon creation.

For details on the terminologies above, refer to Collection Explained.

It is worth noting that creating a collection instantly with default settings does not fit all scenarios. You are advised to familiarize yourself with the common collection creation procedure so that you can gain a better understanding of Zilliz Cloud's capabilities.

Quick Setup

In this manner, you can create a collection instantly with only the collection name and the vector field dimensionality.

from pymilvus import MilvusClient, DataType

CLUSTER_ENDPOINT = "YOUR_CLUSTER_ENDPOINT"
TOKEN = "YOUR_CLUSTER_TOKEN"

# 1. Set up a Milvus client
client = MilvusClient(
uri=CLUSTER_ENDPOINT,
token=TOKEN
)

# 2. Create a collection in quick setup mode
client.create_collection(
collection_name="quick_setup",
dimension=5
)

res = client.get_load_state(
collection_name="quick_setup"
)

print(res)

# Output
#
# {
# "state": "<LoadState: Loaded>"
# }

Quick Setup with Custom Fields

If the default metric type, field names, and data types does not meet your need, you can tune these settings as follows.

from pymilvus import MilvusClient, DataType

CLUSTER_ENDPOINT = "YOUR_CLUSTER_ENDPOINT"
TOKEN = "YOUR_CLUSTER_TOKEN"

# 1. Set up a Milvus client
client = MilvusClient(
uri=CLUSTER_ENDPOINT,
token=TOKEN
)

# 2. Create a collection in quick setup mode
client.create_collection(
collection_name="custom_quick_setup",
dimension=5,
primary_field_name="my_id",
id_type="string",
vector_field_name="my_vector",
metric_type="L2",
auto_id=True,
max_length=512
)

res = client.get_load_state(
collection_name="custom_quick_setup"
)

print(res)

# Output
#
# {
# "state": "<LoadState: Loaded>"
# }

If the collections created using the above two manners still cannot meet your needs, consider following the procedure in Create Collection.