メインコンテンツまでスキップ

Create Pipeline (V1)
About to Deprecate

This creates an pipeline of the Ingestion, Search, and Deletion types.

POST
/v1/pipelines
Base URL

The base URL for this API is in the following format:

https://controller.api.${CLOUD_REGION}.zillizcloud.com

📘Notes
export CLOUD_REGION="gcp-us-west1"
export BASE_URL="https://controller.api.${CLOUD_REGION}.zillizcloud.com"
Parameters
Authorizationstringheaderrequired

The authentication token should be an API key with appropriate privileges.

Example Value: Bearer {{TOKEN}}
Request Bodyapplication/json
namestringrequired

Name of the pipeline to create. The pipeline name should be a string of 3-64 characters and can contain only alphanumeric letters and underscores.

descriptionstring

Description of the pipeline to create.

typestringrequired

Type of the pipeline to create. For an ingestion pipeline, the value should be INGESTION. For more information, refer to Understanding pipelines.

functionsarrayrequired

Functions to add in the ingestion pipeline to create. An ingestion pipeline must have one and only one INDEX function and can have 0-50 PRESERVE functions.

[]functionsoneOf

A function to add. You have the following options.

A function to add.

namestringrequired

Name of the function to add. The function name should be a string of 3-64 characters and can contain only alphanumeric letters and underscores.

actionstringrequired

Type of the function to add. For a text ingestion pipeline, the value should be INDEX_TEXT or PRESERVE.

embeddingstringrequired

Name of the embedding model used to convert the source data into vector embeddings. For possible values, refer to Which embedding model does Zilliz Cloud Pipelines use?.

languagestringrequired

Language of your documents.

A function to add.

namestringrequired

Name of the function to add. The function name should be a string of 3-64 characters and can contain only alphanumeric letters and underscores.

actionstringrequired

Type of the function to add. For a doc ingestion pipeline, the value should be INDEX_DOC or PRESERVE.

embeddingstringrequired

Name of the embedding model used to convert the source data into vector embeddings. For possible values, refer to Which embedding model does Zilliz Cloud Pipelines use?.

languagestringrequired

Language of your documents.

chunkSizeinteger

The maximum size of a splitted doc segment. For more information about the supported chunk size range of each embedding model, please refer to Zilliz Cloud Limits.

Value Range: ≥ 20≤ 12000
splitByarray

The splitters for Zilliz Cloud to split the specified document into smaller chunks. The value defaults to ["\n\n", "\n", " ", ""].

[]splitBystring

A splitter.

A function to add.

namestringrequired

Name of the function to add. The function name should be a string of 3-64 characters and can contain only alphanumeric letters and underscores.

actionstringrequired

Type of the function to add. For an image ingestion pipeline, the value should be INDEX_IMAGE or PRESERVE.

embeddingstringrequired

Name of the embedding model used to convert the source data into vector embeddings. For possible values, refer to Which embedding model does Zilliz Cloud Pipelines use?.

A function to add.

namestringrequired

Name of the function to add. The function name should be a string of 3-64 characters and can contain only alphanumeric letters and underscores.

actionstringrequired

Type of the function to add. For an ingestion pipeline that preserves metadata, the value should be PRESERVE.

inputFieldstringrequired

Name the input field according to your needs. In a preserve function of an ingestion pipeline, Zilliz Cloud uses the value as the name of a field in the collection to create.

outputFieldstringrequired

Name of the output field. The value should be the same as that of input_field.

fieldTypestringrequired

Data type of the field to create in the target collection.

clusterIdstringrequired

ID of the target cluster to which the pipeline applies.

collectionNamestringrequired

Name of the target collection to which the pipeline applies. Zilliz Cloud checks if the collection with the specified name exists. If the collection exists, Zilliz Cloud creates a pipeline for that collection. Otherwise, Zilliz Cloud creates a new collection with the specified name.

projectIdstringrequired

ID of the project to which the target cluster belongs.

namestringrequired

Name of the pipeline to create. The pipeline name should be a string of 3-64 characters and can contain only alphanumeric letters and underscores.

descriptionstring

Description of the pipeline to create.

typestringrequired

Type of the pipeline to create. For a search pipeline, the value should be SEARCH. For more information, refer to Understanding pipelines.

functionsarrayrequired

Functions to add in the search pipeline to create. A search pipeline can only have one function.

[]functionsoneOf
(To be added soon)

A function to add.

namestringrequired

Name of the function to add. The function name should be a string of 3-64 characters and can contain only alphanumeric letters and underscores.

actionstringrequired

Type of the function to add. For a text search pipeline, the value should be SEARCH_TEXT.

clusterIdstringrequired

ID of the target cluster to which the pipeline applies.

collectionNamestringrequired

Name of the target collection to which the pipeline applies.

embeddingstringrequired

An embedding model in use for vector search. It should be consistent with the embedding model chosen in its compatible collection. For more information, refer to Which embedding model does Zilliz Cloud Pipelines use?.

rerankerstring

A reranking model used to rank a set of candidate outputs to improve the quality of the search results. Currently, the only possible value is zilliz/bge-reranker-base. For more information, refer to Reranker.

A function to add.

namestringrequired

Name of the function to add. The function name should be a string of 3-64 characters and can contain only alphanumeric letters and underscores.

actionstringrequired

Type of the function to add. For a doc search pipeline, the value should be SEARCH_DOC_CHUNK.

clusterIdstringrequired

ID of the target cluster to which the pipeline applies.

collectionNamestringrequired

Name of the target collection to which the pipeline applies.

embeddingstringrequired

An embedding model in use for vector search. It should be consistent with the embedding model chosen in its compatible collection. For more information, refer to Which embedding model does Zilliz Cloud Pipelines use?.

rerankerstring

A reranking model used to rank a set of candidate outputs to improve the quality of the search results. Currently, the only possible value is zilliz/bge-reranker-base. For more information, refer to Reranker.

A function to add.

namestringrequired

Name of the function to add. The function name should be a string of 3-64 characters and can contain only alphanumeric letters and underscores.

actionstringrequired

Type of the function to add. For a reverse image search pipeline, the value should be SEARCH_IMAGE_BY_IMAGE.

clusterIdstringrequired

ID of the target cluster to which the pipeline applies.

collectionNamestringrequired

Name of the target collection to which the pipeline applies.

embeddingstringrequired

An embedding model in use for vector search. It should be consistent with the embedding model chosen in its compatible collection. For more information, refer to Which embedding model does Zilliz Cloud Pipelines use?.

rerankerstring

A reranking model used to rank a set of candidate outputs to improve the quality of the search results. Currently, the only possible value is zilliz/bge-reranker-base. For more information, refer to Reranker.

A function to add.

namestringrequired

Name of the function to add. The function name should be a string of 3-64 characters and can contain only alphanumeric letters and underscores.

actionstringrequired

Type of the function to add. For an text-image search pipeline , the value should be SEARCH_IMAGE_BY_TEXT.

clusterIdstringrequired

ID of the target cluster to which the pipeline applies.

collectionNamestringrequired

Name of the target collection to which the pipeline applies.

embeddingstringrequired

An embedding model in use for vector search. It should be consistent with the embedding model chosen in its compatible collection. For a text-image search pipeline, the only possible value is zilliz/clip-vit-base-patch32-multilingual-v1.For more information, refer to Which embedding model does Zilliz Cloud Pipelines use?.

rerankerstring

A reranking model used to rank a set of candidate outputs to improve the quality of the search results. Currently, the only possible value is zilliz/bge-reranker-base. For more information, refer to Reranker.

projectIdstringrequired

ID of the project to which the target cluster belongs.

namestringrequired

Name of the pipeline to create. The pipeline name should be a string of 3-64 characters and can contain only alphanumeric letters and underscores.

descriptionstringrequired

Description of the pipeline to create.

typestringrequired

Type of the pipeline to create. For a deletion pipeline, the value should be DELETION. For more information, refer to Understanding pipelines.

functionsarrayrequired

Functions to add in the deletion pipeline to create. A deletion pipeline can only have one function.

[]functionsoneOf
(To be added soon)
namestringrequired

Name of the function to add. The function name should be a string of 3-64 characters and can contain only alphanumeric letters and underscores.

actionstringrequired

Type of the function to add. For a text deletion pipeline, the value should be PURGE_TEXT_INDEX or PURGE_BY_EXPRESSION.

namestringrequired

Name of the function to add. The function name should be a string of 3-64 characters and can contain only alphanumeric letters and underscores.

actionstringrequired

Type of the function to add. For a doc deletion pipeline, the value should be PURGE_DOC_INDEX or PURGE_BY_EXPRESSION.

namestringrequired

Name of the function to add. The function name should be a string of 3-64 characters and can contain only alphanumeric letters and underscores.

actionstringrequired

Type of the function to add. For an image deletion pipeline, the value should be PURGE_IMAGE_INDEX or PURGE_BY_EXPRESSION.

clusterIdstringrequired

ID of the target cluster to which the pipeline applies.

collectionNamestringrequired

Name of the target collection to which the pipeline applies.

projectIdstringrequired

ID of the project to which the target cluster belongs.

export TOKEN="YOUR_API_KEY"

curl --request POST \
--url "${BASE_URL}/v1/pipelines" \
--header "Authorization: Bearer ${TOKEN}" \
--header "Content-Type: application/json" \
-d '{
"name": "my_text_ingestion_pipeline",
"clusterId": "inxx-xxxxxxxxxxxxxxx",
"projectId": "proj-xxxx",
"collectionName": "my_collection",
"description": "A pipeline that generates text embeddings and stores additional fields.",
"type": "INGESTION",
"functions": [
{
"name": "index_my_text",
"action": "INDEX_TEXT",
"language": "ENGLISH",
"embedding": "zilliz/bge-base-en-v1.5"
},
{
"name": "keep_text_info",
"action": "PRESERVE",
"inputField": "source",
"outputField": "source",
"fieldType": "VarChar"
}
]
}'
export TOKEN="YOUR_API_KEY"

curl --request POST \
--url "${BASE_URL}/v1/pipelines" \
--header "Authorization: Bearer ${TOKEN}" \
--header "Content-Type: application/json" \
-d '{
"projectId": "proj-xxxx",
"name": "my_doc_ingestion_pipeline",
"description": "A pipeline that splits a doc file into chunks and generates embeddings. It also stores the publish_year with each chunk.",
"type": "INGESTION",
"functions": [
{
"name": "index_my_doc",
"action": "INDEX_DOC",
"language": "ENGLISH",
"chunkSize": 500,
"embedding": "zilliz/bge-base-en-v1.5",
"splitBy": [
"\n\n",
"\n",
" ",
""
]
},
{
"name": "keep_doc_info",
"action": "PRESERVE",
"inputField": "publish_year",
"outputField": "publish_year",
"fieldType": "Int16"
}
],
"clusterId": "inxx-xxxxxxxxxxxxxxx",
"newCollectionName": "my_collection"
}'
export TOKEN="YOUR_API_KEY"

curl --request POST \
--url "${BASE_URL}/v1/pipelines" \
--header "Authorization: Bearer ${TOKEN}" \
--header "Content-Type: application/json" \
-d '{
"name": "my_image_ingestion_pipeline",
"clusterId": "inxx-xxxxxxxxxxxxxxx",
"projectId": "proj-xxxx",
"collectionName": "my_collection",
"description": "A pipeline that converts an image into vector embeddings and store in efficient index for search.",
"type": "INGESTION",
"functions": [
{
"name": "index_my_image",
"action": "INDEX_IMAGE",
"embedding": "zilliz/vit-base-patch16-224"
},
{
"name": "keep_image_tag",
"action": "PRESERVE",
"inputField": "image_title",
"outputField": "image_title",
"fieldType": "VarChar"
}
]
}'
Responses200 - application/json
codeinteger

Indicates whether the request succeeds.

dataobject

The information of the ingestion pipeline just created.

pipelineIdinteger

ID of the pipeline.

namestring

Name of the pipeline.

typestring

Type of the pipeline. For an ingestion pipeline, the value should be INGESTION.

createTimestampinteger

Timestamp indicating when the pipeline is created.

descriptionstring

Description of the pipeline.

statusstring

Current status of the pipeline. If the value is other than SERVING, the pipeline is not working.

totalUsagearray

The total token usage of the pipeline.

[]totalUsageobject

Statistics of the total token usage of the pipeline.

embeddingstring

The total token usage of the embedding model.

functionsarray

Functions in the pipeline. An ingestion pipeline must have one and only one INDEX function and can have 0-50 PRESERVE functions.

[]functionsoneOf

A function in the pipeline.

Functions of a text ingestion pipeline.

namestring

Name of the function.

actionstring

Type of the function. For a text ingestion pipeline, the value should be INDEX_TEXT or PRESERVE.

inputFieldsarray

Name the field according to your needs. For a text ingestion pipeline, use it for a list of texts to be ingested.

[]inputFieldsstring

An input field.

langaugestring

Language that your source data is in.

embeddingstring

Name of the embedding model in use.

Functions of a doc ingestion pipeline.

namestring

Name of the function.

actionstring

Type of the function. For a doc ingestion pipeline, the value should be INDEX_DOC or PRESERVE.

inputFieldstring

Name the field according to your needs. For a doc ingestion pipeline, use it for the pre-signed url of the doc to be ingested.

langaugestring

Language that your source data is in.

chunkSizeinteger

The maximum size of a splitted document segment. The allowed chunk size range depends on the embedding model in use. For more information, refer to Zilliz Cloud Limits

embeddingstring

Name of the embedding model in use.

splitByarray

The splitters for Zilliz Cloud to split the specified document into smaller chunks. The value defaults to ["\n\n", "\n", " ", ""].

[]splitBystring

A splitter.

Functions of an image ingestion pipeline.

namestring

Name of the function.

actionstring

Type of the function. For an image ingestion pipeline, the value should be INDEX_IMAGE or PRESERVE.

inputFieldsarray

Name the fields according to your needs. In an image ingestion pipeline: image_url stands for pre-signed image URLs in object storage buckets, and image_id stands for the image ID.

[]inputFieldsstring

An input field.

embeddingstring

Name of the embedding model in use.

Functions for an ingestion pipeline to preserve metadata.

namestring

Name of the function.

actionstring

Type of the function. For an ingestion pipeline that preserves metadata, the value should be PRESERVE.

inputFieldstring

Name the field according to your needs. In a PRESERVE function of an ingestion pipeline, Zilliz Cloud uses the value as the name of a field in the collection to create.

outputFieldstring

Name of the output field. The value should be the same as that of input_field.

fieldTypestring

Data type of the field to create in the target collection.

CLUSTER_IDstring

The target cluster to which the pipeline applies.

collectionNamestring

The target collection to which the pipeline applies.

codeinteger

Indicates whether the request succeeds.

dataobject

The information about the search pipeline just created.

pipelineIdinteger

A pipeline ID.

namestring

Name of the pipeline

typestring

Type of the pipeline. For a search pipeline, the value should be SEARCH.

descriptionstring

Description of the pipeline.

statusstring

Current status of the pipeline. If the value is not SERVING, the pipeline is not working.

totalUsagearray

Total token usage of the pipeline.

[]totalUsageobject

Statistics on the total token usage of the pipeline.

embeddingstring

Token usage of the embedding model.

rerankstring

Token usage of the reranker model.

functionsarray

Functions in the pipeline. A search pipeline can only have one function.

[]functionsoneOf

A Function in the created pipeline.

Functions of a text search pipeline.

namestring

Name of the function.

actionstring

Type of the function. For a text search pipeline, the value should be SEARCH_TEXT.

inputFieldsarray

Name the field according to your needs. For a text search pipeline, use it for the query text (query_text).

[]inputFieldsstring

An input field.

embeddingstring

The embedding model in use.

rerankerstring

The reranking model in use.

collectionNamestring

Name of the target collection to which the pipeline applies.

clusterIdstring

ID of the target cluster to which the pipeline applies.

Functions of a doc search pipeline.

namestring

Name of the function to add. The function name should be a string of 3-64 characters and can contain only alphanumeric letters and underscores.

actionstring

Type of the function to add. For a doc search pipeline, the value should be SEARCH_DOC_CHUNK.

clusterIdstring

ID of the target cluster to which the pipeline applies.

collectionNamestring

Name of the target collection to which the pipeline applies.

embeddingstring

An embedding model in use for vector search. It should be consistent with the embedding model chosen in its compatible collection. For more information, refer to Which embedding model does Zilliz Cloud Pipelines use?.

rerankerstring

A reranking model used to rank a set of candidate outputs to improve the quality of the search results. Currently, the only possible value is zilliz/bge-reranker-base. For more information, refer to Reranker.

inputFieldsarray

Name the field according to your needs. For a doc search pipeline, use it for the query text (query_text).

[]inputFieldsstring

An input field.

Functions of a reverse image search pipeline.

namestring

Name of the function to add. The function name should be a string of 3-64 characters and can contain only alphanumeric letters and underscores.

actionstring

Type of the function to add. For a reverse image search pipeline, the value should be SEARCH_IMAGE_BY_IMAGE.

clusterIdstring

ID of the target cluster to which the pipeline applies.

collectionNamestring

Name of the target collection to which the pipeline applies.

embeddingstring

An embedding model in use for vector search. It should be consistent with the embedding model chosen in its compatible collection. For more information, refer to Which embedding model does Zilliz Cloud Pipelines use?.

rerankerstring

The reranker model in use .

inputFieldsarray

Name the field according to your needs. For a reverse image search pipeline, use it for the query image URL (query_image_url).

[]inputFieldsstring

An input field.

Functions of an image search pipeline.

namestring

Name of the function to add. The function name should be a string of 3-64 characters and can contain only alphanumeric letters and underscores.

actionstring

Type of the function to add. For an image search pipeline, the value should be SEARCH_IMAGE_BY_TEXT.

clusterIdstring

ID of the target cluster to which the pipeline applies.

collectionNamestring

Name of the target collection to which the pipeline applies.

embeddingstring

An embedding model in use for vector search. It should be consistent with the embedding model chosen in its compatible collection. For more information, refer to Which embedding model does Zilliz Cloud Pipelines use?.

rerankerstring

The reranker model in use .

inputFieldsarray

Name the field according to your needs. For a text-image search pipeline, use it for the query text (query_text).

[]inputFieldsstring

An input field.

codeinteger

Indicates whether the request succeeds.

dataobject

The information about the deletion pipeline just created.

pipelineIdinteger

ID of the pipeline.

namestring

Name of the pipeline.

typestring

Type of the pipeline. For a deletion pipeline, the value should be DELETION.

createTimestampinteger

Timestamp indicating when the pipeline is created.

descriptionstring

Description of the pipeline.

statusstring

Current status of the pipeline. If the value is other than SERVING, the pipeline is not working.

functionsarray

Functions in the pipeline. An ingestion pipeline must have one and only one INDEX function and can have 0-50 PRESERVE functions.

[]functionsoneOf

A function in the pipeline.

Functions of a text deletion pipeline.

namestring

Name of the function.

actionstring

Type of the function. For a text deletion pipeline, the value should be PURGE_TEXT_INDEX or PURGE_BY_EXPRESSION.

inputFieldsarray

Name the field according to your needs. For a text ingestion pipeline, use it for the ID of the text (id) to be deleted or an expression (expression).

[]inputFieldsstring

An input field.

Functions of a doc deletion pipeline.

namestring

Name of the function.

actionstring

Type of the function. For a doc deletion pipeline, the value should be PURGE_DOC_INDEX or PURGE_BY_EXPRESSION.

inputFieldsstring

Name the field according to your needs. For a doc deletion pipeline, use it for the name of the doc (doc_name) to be deleted or an expression (expression).

Functions of an image deletion pipeline.

namestring

Name of the function to add. The function name should be a string of 3-64 characters and can contain only alphanumeric letters and underscores.

actionstring

Type of the function to add. For an image deletion pipeline, the value should be PURGE_IMAGE_INDEX or PURGE_BY_EXPRESSION.

inputFieldsarray

Name the fields according to your needs. In an image deletion pipeline, use it for the ID of the image (image_id) to delete or an expression (expression).

[]inputFieldsstring

An input field.

CLUSTER_IDstring

The target cluster to which the pipeline applies.

collectionNamestring

The target collection to which the pipeline applies.

Returns an error message.

codeinteger

Response code.

messagestring

Error message.

{
"code": 200,
"data": {
"pipelineId": "pipe-xxx",
"name": "my_text_ingestion_pipeline",
"type": "INGESTION",
"createTimestamp": 1721187300000,
"description": "A pipeline that generates text embeddings and stores additional fields.",
"status": "SERVING",
"totalUsage": {
"embedding": 0
},
"functions": [
{
"name": "index_my_text",
"action": "INDEX_TEXT",
"inputFields": [
"text_list"
],
"language": "ENGLISH",
"embedding": "zilliz/bge-base-en-v1.5"
},
{
"name": "keep_text_info",
"action": "PRESERVE",
"inputField": "source",
"outputField": "source",
"fieldType": "VarChar"
}
],
"clusterId": "inxx-xxxx",
"collectionName": "my_collection"
}
}
{
"code": 200,
"data": {
"pipelineId": "pipe-xxxx",
"name": "my_doc_ingestion_pipeline",
"type": "INGESTION",
"createTimestamp": 1721187300000,
"description": "A pipeline that splits a doc file into chunks and generates embeddings. It also stores the publish_year with each chunk.",
"status": "SERVING",
"totalUsage": {
"embedding": 0
},
"functions": [
{
"action": "INDEX_DOC",
"name": "index_my_doc",
"inputField": "doc_url",
"language": "ENGLISH",
"chunkSize": 500,
"embedding": "zilliz/bge-base-en-v1.5",
"splitBy": [
"\n\n",
"\n",
" ",
""
]
},
{
"action": "PRESERVE",
"name": "keep_doc_info",
"inputField": "publish_year",
"outputField": "publish_year",
"fieldType": "Int16"
}
],
"clusterId": "in03-***************",
"collectionName": "my_collection"
}
}
{
"code": 200,
"data": {
"pipelineId": "pipe-xxxx",
"name": "my_image_ingestion_pipeline",
"type": "INGESTION",
"createTimestamp": 1721187300000,
"clusterId": "in03-***************",
"collectionName": "my_collection",
"description": "A pipeline that converts an image into vector embeddings and store in efficient index for search.",
"status": "SERVING",
"totalUsage": {
"embedding": 0
},
"functions": [
{
"action": "INDEX_IMAGE",
"name": "index_my_image",
"inputFields": [
"image_url",
"image_id"
],
"embedding": "zilliz/vit-base-patch16-224"
},
{
"action": "PRESERVE",
"name": "keep_image_tag",
"inputField": "image_title",
"outputField": "image_title",
"fieldType": "VarChar"
}
]
}
}