This endpoint synchronously creates embeddings for multimodal content and returns the results immediately in the response.
When to use this endpoint:
Do not use this endpoint for:
POST method of the /embed-v2/tasks endpoint instead.Text:
Images:
Audio and video:
This endpoint is rate-limited. For details, see the Rate limits page.
The type of content for the embeddings.
Values:
audio: Creates embeddings for an audio filevideo: Creates embeddings for a video fileimage: Creates embeddings for an image filetext: Creates embeddings for text inputtext_image: Creates embeddings for text and an imagemulti_input: Creates a single embedding from up to 10 images. You can optionally include text to provide context. To reference specific images in your text, use placeholders in the following format: <@name>, where name matches the name field of a media sourceThe video understanding model to use. Value: “marengo3.0”.
This field is required if the input_type parameter is text.
This field is required if the input_type parameter is image.
This field is required if the input_type parameter is text_image.
This field is required if the input_type parameter is audio.
This field is required if the input_type parameter is video.
This field is required if the input_type parameter is multi_input.
Successful request; normal operation
Metadata for the media input. Available for image, text_image, audio, video, and multi_input inputs.
Your API key.
You can find your API key on the API Keys page.