For AI agents: a documentation index is available at the root level at /llms.txt and /llms-full.txt. Append /llms.txt to any URL for a page-level index, or .md for the markdown version of any page.
Sample appsIntegrationsDiscordPlaygroundDevEx repo
GuidesSDK ReferenceAPI Reference
GuidesSDK ReferenceAPI Reference
  • TwelveLabs API
    • Introduction
    • Authentication
  • API Reference
    • Manage indexes
    • Upload content
    • Index content
    • Manage videos
    • Manage entities
    • Any-to-video search
    • Create embeddings v2
    • Create embeddings v1
      • Create video embeddings
      • Create text, image, and audio embeddings
        • The embedding object
        • POSTCreate embeddings for text, image, and audio
    • Analyze and segment videos
    • Error codes
LogoLogo
Sample appsIntegrationsDiscordPlaygroundDevEx repo
API ReferenceCreate embeddings v1Create text, image, and audio embeddings

The embedding object

The embedding object has the following fields:

  • model_name: A string that represents the name of the video understanding model used by the platform to create the embeddings.

  • One or more of the following embedding fields, depending on the parameters specified in the request:

    • text_embedding
    • audio_embedding
    • image_embedding
      Each of these fields is an object containing the following fields, among other information:
      • segments: An array of objects that contains the embeddings for each segment and associated information. Each of these objects contains, among other information, an array of floating point numbers named float representing an embedding. This array has 512 dimensions, and you can use it with cosine similarity for various downstream tasks.

The Marengo video understanding model generates embeddings for all modalities in the same latent space. This shared space enables any-to-any searches across different types of content.

Was this page helpful?
Previous

Create embeddings for text, image, and audio

Next
Built with