Create Pipeline
About to Deprecate

This creates an pipeline of the Ingestion, Search, and Deletion types.

POST

/v1/pipelines

Base URL

The base URL for this API is in the following format:

https://controller.api.${CLOUD_REGION}.zillizcloud.com

📘Notes

You need to replace ${CLOUD_REGION} with the appropriate region for your deployment.
To get the cloud region ID, refer to On Zilliz Cloud Console or List Cloud Regions.

export CLOUD_REGION="gcp-us-west1"
export BASE_URL="https://controller.api.${CLOUD_REGION}.zillizcloud.com"

Parameters

Authorizationstringheaderrequired

The authentication token should be an API key with appropriate privileges.

Example Value: Bearer {{TOKEN}}

Request Bodyapplication/json

TEXT INGESTION

export TOKEN="YOUR_API_KEY"

curl --request POST \
--url "${BASE_URL}/v1/pipelines" \
--header "Authorization: Bearer ${TOKEN}" \
--header "Content-Type: application/json" \
-d '{
    "name": "my_text_ingestion_pipeline",
    "clusterId": "inxx-xxxxxxxxxxxxxxx",
    "projectId": "proj-xxxx",
    "collectionName": "my_collection",
    "description": "A pipeline that generates text embeddings and stores additional fields.",
    "type": "INGESTION",
    "functions": [
        {
            "name": "index_my_text",
            "action": "INDEX_TEXT",
            "language": "ENGLISH",
            "embedding": "zilliz/bge-base-en-v1.5"
        },
        {
            "name": "keep_text_info",
            "action": "PRESERVE",
            "inputField": "source",
            "outputField": "source",
            "fieldType": "VarChar"
        }
    ]
}'

DOCUMENT INGESTION

export TOKEN="YOUR_API_KEY"

curl --request POST \
--url "${BASE_URL}/v1/pipelines" \
--header "Authorization: Bearer ${TOKEN}" \
--header "Content-Type: application/json" \
-d '{
    "projectId": "proj-xxxx",
    "name": "my_doc_ingestion_pipeline",
    "description": "A pipeline that splits a doc file into chunks and generates embeddings. It also stores the publish_year with each chunk.",
    "type": "INGESTION",
    "functions": [
        {
            "name": "index_my_doc",
            "action": "INDEX_DOC",
            "language": "ENGLISH",
            "chunkSize": 500,
            "embedding": "zilliz/bge-base-en-v1.5",
            "splitBy": [
                "\n\n",
                "\n",
                " ",
                ""
            ]
        },
        {
            "name": "keep_doc_info",
            "action": "PRESERVE",
            "inputField": "publish_year",
            "outputField": "publish_year",
            "fieldType": "Int16"
        }
    ],
    "clusterId": "inxx-xxxxxxxxxxxxxxx",
    "newCollectionName": "my_collection"
}'

IMAGE INGESTION

export TOKEN="YOUR_API_KEY"

curl --request POST \
--url "${BASE_URL}/v1/pipelines" \
--header "Authorization: Bearer ${TOKEN}" \
--header "Content-Type: application/json" \
-d '{
    "name": "my_image_ingestion_pipeline",
    "clusterId": "inxx-xxxxxxxxxxxxxxx",
    "projectId": "proj-xxxx",
    "collectionName": "my_collection",
    "description": "A pipeline that converts an image into vector embeddings and store in efficient index for search.",
    "type": "INGESTION",
    "functions": [
        {
            "name": "index_my_image",
            "action": "INDEX_IMAGE",
            "embedding": "zilliz/vit-base-patch16-224"
        },
        {
            "name": "keep_image_tag",
            "action": "PRESERVE",
            "inputField": "image_title",
            "outputField": "image_title",
            "fieldType": "VarChar"
        }
    ]
}'

Responses200 - application/json

TEXT INGESTION

{
    "code": 200,
    "data": {
        "pipelineId": "pipe-xxx",
        "name": "my_text_ingestion_pipeline",
        "type": "INGESTION",
        "createTimestamp": 1721187300000,
        "description": "A pipeline that generates text embeddings and stores additional fields.",
        "status": "SERVING",
        "totalUsage": {
            "embedding": 0
        },
        "functions": [
            {
                "name": "index_my_text",
                "action": "INDEX_TEXT",
                "inputFields": [
                    "text_list"
                ],
                "language": "ENGLISH",
                "embedding": "zilliz/bge-base-en-v1.5"
            },
            {
                "name": "keep_text_info",
                "action": "PRESERVE",
                "inputField": "source",
                "outputField": "source",
                "fieldType": "VarChar"
            }
        ],
        "clusterId": "inxx-xxxx",
        "collectionName": "my_collection"
    }
}

DOCUMENT INGESTION

{
    "code": 200,
    "data": {
        "pipelineId": "pipe-xxxx",
        "name": "my_doc_ingestion_pipeline",
        "type": "INGESTION",
        "createTimestamp": 1721187300000,
        "description": "A pipeline that splits a doc file into chunks and generates embeddings. It also stores the publish_year with each chunk.",
        "status": "SERVING",
        "totalUsage": {
            "embedding": 0
        },
        "functions": [
            {
                "action": "INDEX_DOC",
                "name": "index_my_doc",
                "inputField": "doc_url",
                "language": "ENGLISH",
                "chunkSize": 500,
                "embedding": "zilliz/bge-base-en-v1.5",
                "splitBy": [
                    "\n\n",
                    "\n",
                    " ",
                    ""
                ]
            },
            {
                "action": "PRESERVE",
                "name": "keep_doc_info",
                "inputField": "publish_year",
                "outputField": "publish_year",
                "fieldType": "Int16"
            }
        ],
        "clusterId": "in03-***************",
        "collectionName": "my_collection"
    }
}

IMAGE INGESTION

{
    "code": 200,
    "data": {
        "pipelineId": "pipe-xxxx",
        "name": "my_image_ingestion_pipeline",
        "type": "INGESTION",
        "createTimestamp": 1721187300000,
        "clusterId": "in03-***************",
        "collectionName": "my_collection",
        "description": "A pipeline that converts an image into vector embeddings and store in efficient index for search.",
        "status": "SERVING",
        "totalUsage": {
            "embedding": 0
        },
        "functions": [
            {
                "action": "INDEX_IMAGE",
                "name": "index_my_image",
                "inputFields": [
                    "image_url",
                    "image_id"
                ],
                "embedding": "zilliz/vit-base-patch16-224"
            },
            {
                "action": "PRESERVE",
                "name": "keep_image_tag",
                "inputField": "image_title",
                "outputField": "image_title",
                "fieldType": "VarChar"
            }
        ]
    }
}