Version: 8.9 (unreleased)

Vector database connector

The vector database connector allows embedding, storing, and retrieving Large Language Model (LLM) embeddings. This enables building AI-based solutions for your organizations, such as context document search, long-term LLM memory, and agentic AI interaction.

note

The vector database connector uses the LangChain4j library. Data models and possible implementations are limited to the latest stable released LangChain4j library.

Prerequisites

Before using the vector database connector, ensure you understand the concept of LLM embeddings.

To start using the vector database connector, ensure you have access to a supported LLM embeddings API to convert document content into vectorized embedding form. You will also need to have write access to a supported database.

Create a connector task

You can apply a connector to a task or event via the append menu. For example:

From the canvas: Select an element and click the Change element icon to change an existing element, or use the append feature to add a new element to the diagram.
From the properties panel: Navigate to the Template section and click Select.
From the side palette: Click the Create element icon.

change element

After you have applied a connector to your element, follow the configuration steps or see using connectors to learn more.

Operations

Embed document
Retrieve document

The embed document operation performs the following steps:

Consume a document.
Parse the document depending on a file format (optionally split into text chunks).
Convert chunks into a vector form with LLM help.
Store produced vectors in a vector database.

To perform this operation, enter the following:

Operation dropdown: Embed document.
Embedding model: Refer to the relevant section.
Vector store: Refer to the relevant section.
Document: Refer to the relevant section.

As a result of this operation, you will get an array of created embedding chunk IDs, for example ["d599ec62-fe51-4a91-bbf0-26e1241f9079", "a1fad021-5148-42b4-aa02-7de9d590e69c"].

Updating embedded documents

Each time you embed a document, the connector generates a new set of chunks and stores them in the vector database.
If the document was previously embedded, this creates duplicate chunks.

To prevent duplicates:

Delete the existing chunks before re-embedding the document.
Use the chunk IDs returned by the previous embedding operation.
If the embedded document is from Camunda, use the filename metadata field to find the chunk IDs.
Follow your vector store’s documentation for deleting chunks.

The retrieve document operation performs the following steps:

Consumes a query.
Convert the query into a vector form with LLM help.
Perform a vector similarity search on previously-stored LLM embeddings.
Store results in Camunda document storage.

To perform this operation, enter the following:

Operation dropdown: Retrieve document.
Search query: Enter your search query.
Max results: Enter maximum amount of returned results.
Min score: Enter the lowest score threshold similarity value; the value should be between 0 and 1, for example, 0.81.
Embedding model: Refer to the relevant section.
Vector store: Refer to the relevant section.

As a result of this operation, you will get an array of relevant chunks, where each includes a chunk ID, Camunda document reference metadata, similarity score, and the actual text content.

{
  "chunks": [
    {
      "chunkId": "e30d570b-2a3a-4f4a-9a0c-78f0f1acd383",
      "documentReference": {
        "storeId": "local",
        "documentId": "2b36ec67-a78f-4f99-9371-8c6e5b332838",
        "contentHash": "1c232bc1e553c10d00c3327dcca9012b6b4b0758a1c2afaad8b77c80fa1bd36e",
        "metadata": {
          "size": 116,
          "fileName": null,
          "processDefinitionId": null,
          "processInstanceKey": null,
          "customProperties": {},
          "expiresAt": null,
          "contentType": "text/plain"
        },
        "camunda.document.type": "camunda"
      },
      "score": 0.6721556,
      "content": "Camunda is a platform for orchestrating and automating business processes. It helps organizations design, execute, and manage workflows, enabling them to optimize processes and improve efficiency."
    }
  ]
}

Embedding models

Amazon Bedrock
Azure OpenAI
Google Vertex AI
OpenAI

The vector database connector supports Amazon Titan V1 and V2 models.
You can also specify any custom model that supports text embedding and is available in your Amazon Bedrock account.

To use Amazon Bedrock as an embedding model, provide:

Access key – Access key for a user with permissions for the Amazon Bedrock InvokeModel action.
Secret key – Secret key for the user associated with the provided access key.
Region – AWS region where the model is hosted (for example, us-east-1). See AWS model region support for details.
Model name – One of:
- Amazon Titan V1 – amazon.titan-embed-text-v1
- Amazon Titan V2 – amazon.titan-embed-text-v2:0
- Custom model – Name of your custom Amazon Bedrock embedding model.

When using Amazon Titan V2, you can also specify:

Embedding dimensions – Number of dimensions for the embedding vector.
Normalize – Whether to normalize the embedding vector. See AWS blog for more details.

For all models, the following parameter is optional:

Max retries – Maximum number of retries for the embedding request in case of failure.

To use Azure OpenAI as an embedding model, provide:

Endpoint – The Azure OpenAI endpoint URL, for example https://<your-resource-name>.openai.azure.com/.
Authentication – Select the authentication type to use with Azure OpenAI.

Optional parameters include:

Embedding dimensions – Number of dimensions for the embedding vector. If not specified, the default value for the selected model is used.
Custom headers – Additional headers to include in the request.
Max retries – Maximum number of retries for the embedding request in case of failure.

Two authentication methods are supported:

API key – Authenticate using an Azure OpenAI API key from the Azure AI Foundry portal.
Client credentials – Authenticate using a client ID and secret. This requires registering an application in Microsoft Entra ID. Provide:
- Client ID – The Microsoft Entra application ID.
- Client secret – The application’s client secret.
- Tenant ID – The Microsoft Entra tenant ID.
- Authority host – (Optional) The authority host URL. Defaults to https://login.microsoftonline.com/. Can also be an OAuth 2.0 token endpoint.

To use Google Vertex AI as an embedding model, provide:

Project ID – The Google Cloud project ID.
Region – The region where AI inference should take place.
Authentication – Select the authentication type for connecting to Google Cloud.
Model name – The Vertex AI model to use for embeddings. Refer to the Vertex AI documentation for available models.
Embedding dimensions – Number of dimensions for the embedding vector. Consult the documentation for the selected model for valid ranges.

Optional parameters include:

Publisher – The publisher of the Vertex AI model. Defaults to google if not specified.
Max retries – Maximum number of retries for the embedding request in case of failure.

Two authentication methods are supported:

Service Account Credentials – Authenticate using a service account key in JSON format.
Application Default Credentials (ADC) – Authenticate using the default credentials available in your environment.
This method is only supported in Self-Managed or hybrid environments.
To set up ADC in a local development environment, follow the instructions here.