mirror of
https://github.com/elastic/elasticsearch.git
synced 2025-04-25 07:37:19 -04:00
76 lines
No EOL
2.9 KiB
Text
76 lines
No EOL
2.9 KiB
Text
[role="xpack"]
|
|
[[put-inference-api]]
|
|
=== Create {infer} API
|
|
|
|
experimental[]
|
|
|
|
Creates an {infer} endpoint to perform an {infer} task.
|
|
|
|
[IMPORTANT]
|
|
====
|
|
* The {infer} APIs enable you to use certain services, such as built-in
|
|
{ml} models (ELSER, E5), models uploaded through Eland, Cohere, OpenAI, Mistral,
|
|
Azure OpenAI, Google AI Studio, Google Vertex AI, Anthropic or Hugging Face.
|
|
* For built-in models and models uploaded through Eland, the {infer} APIs offer an
|
|
alternative way to use and manage trained models. However, if you do not plan to
|
|
use the {infer} APIs to use these models or if you want to use non-NLP models,
|
|
use the <<ml-df-trained-models-apis>>.
|
|
====
|
|
|
|
|
|
[discrete]
|
|
[[put-inference-api-request]]
|
|
==== {api-request-title}
|
|
|
|
`PUT /_inference/<task_type>/<inference_id>`
|
|
|
|
[discrete]
|
|
[[put-inference-api-prereqs]]
|
|
==== {api-prereq-title}
|
|
|
|
* Requires the `manage_inference` <<privileges-list-cluster,cluster privilege>>
|
|
(the built-in `inference_admin` role grants this privilege)
|
|
|
|
[discrete]
|
|
[[put-inference-api-path-params]]
|
|
==== {api-path-parms-title}
|
|
|
|
`<inference_id>`::
|
|
(Required, string)
|
|
include::inference-shared.asciidoc[tag=inference-id]
|
|
|
|
`<task_type>`::
|
|
(Required, string)
|
|
include::inference-shared.asciidoc[tag=task-type]
|
|
+
|
|
--
|
|
Refer to the service list in the <<put-inference-api-desc,API description section>> for the available task types.
|
|
--
|
|
|
|
|
|
[discrete]
|
|
[[put-inference-api-desc]]
|
|
==== {api-description-title}
|
|
|
|
The create {infer} API enables you to create an {infer} endpoint and configure a {ml} model to perform a specific {infer} task.
|
|
|
|
The following services are available through the {infer} API.
|
|
You can find the available task types next to the service name.
|
|
Click the links to review the configuration details of the services:
|
|
|
|
* <<infer-service-alibabacloud-ai-search,AlibabaCloud AI Search>> (`completion`, `rerank`, `sparse_embedding`, `text_embedding`)
|
|
* <<infer-service-amazon-bedrock,Amazon Bedrock>> (`completion`, `text_embedding`)
|
|
* <<infer-service-anthropic,Anthropic>> (`completion`)
|
|
* <<infer-service-azure-ai-studio,Azure AI Studio>> (`completion`, `text_embedding`)
|
|
* <<infer-service-azure-openai,Azure OpenAI>> (`completion`, `text_embedding`)
|
|
* <<infer-service-cohere,Cohere>> (`completion`, `rerank`, `text_embedding`)
|
|
* <<infer-service-elasticsearch,Elasticsearch>> (`rerank`, `sparse_embedding`, `text_embedding` - this service is for built-in models and models uploaded through Eland)
|
|
* <<infer-service-elser,ELSER>> (`sparse_embedding`)
|
|
* <<infer-service-google-ai-studio,Google AI Studio>> (`completion`, `text_embedding`)
|
|
* <<infer-service-google-vertex-ai,Google Vertex AI>> (`rerank`, `text_embedding`)
|
|
* <<infer-service-hugging-face,Hugging Face>> (`text_embedding`)
|
|
* <<infer-service-mistral,Mistral>> (`text_embedding`)
|
|
* <<infer-service-openai,OpenAI>> (`completion`, `text_embedding`)
|
|
|
|
The {es} and ELSER services run on a {ml} node in your {es} cluster. The rest of
|
|
the services connect to external providers. |