elasticsearch/docs/reference/ml/trained-models/apis
Dimitris Athanasiou 4e67df8b05
[ML] Low priority trained model deployments (#91234)
This adds a new parameter to the start trained model deployment API,
namely `priority`. The available settings are `normal` and `low`.

For normal priority deployments the allocations get distributed so that
node processors are never oversubscribed.

Low priority deployments allow users to test model functionality even if there
are no node processors available. They are limited to 1 allocation with a single thread.
In addition, the process is executed in low priority which limits the amount of
CPU that can be used when the CPU is under pressure. The intention of this is to
limit the impact of low priority deployments on normal priority deployments.

When we rebalance model assignments we now:

  1. compute a plan just for normal priority deployments
  2. fix the resources used by normal deployments
  3. compute a plan just for low priority deployments
  4. merge the two plans

Closes #91024
2022-11-04 14:22:30 +02:00
..
clear-trained-model-deployment-cache.asciidoc [ML] Release native inference functionality as beta (#90418) 2022-09-28 11:09:02 +01:00
delete-trained-models-aliases.asciidoc [DOCS] Move trained model APIs out of dataframe analytics (#81315) 2021-12-03 09:21:09 -08:00
delete-trained-models.asciidoc [DOCS] Move trained model APIs out of dataframe analytics (#81315) 2021-12-03 09:21:09 -08:00
get-trained-models-stats.asciidoc [ML] Low priority trained model deployments (#91234) 2022-11-04 14:22:30 +02:00
get-trained-models.asciidoc [ML] add text_similarity nlp task documentation (#88994) 2022-08-02 12:17:14 -04:00
index.asciidoc [ML] Add api to update trained model deployment number_of_allocations (#90728) 2022-10-12 10:04:23 +03:00
infer-trained-model-deployment.asciidoc [ML] add new trained_models/{model_id}/_infer endpoint for all supervised models and deprecate deployment infer api (#86361) 2022-05-05 14:58:59 -04:00
infer-trained-model.asciidoc [ML] Release native inference functionality as beta (#90418) 2022-09-28 11:09:02 +01:00
ml-trained-models-apis.asciidoc [ML] Add api to update trained model deployment number_of_allocations (#90728) 2022-10-12 10:04:23 +03:00
put-trained-model-definition-part.asciidoc [ML] Release native inference functionality as beta (#90418) 2022-09-28 11:09:02 +01:00
put-trained-model-vocabulary.asciidoc [ML] Release native inference functionality as beta (#90418) 2022-09-28 11:09:02 +01:00
put-trained-models-aliases.asciidoc [DOCS] Move trained model APIs out of dataframe analytics (#81315) 2021-12-03 09:21:09 -08:00
put-trained-models.asciidoc [ML] add text_similarity nlp task documentation (#88994) 2022-08-02 12:17:14 -04:00
start-trained-model-deployment.asciidoc [ML] Low priority trained model deployments (#91234) 2022-11-04 14:22:30 +02:00
stop-trained-model-deployment.asciidoc [ML] Release native inference functionality as beta (#90418) 2022-09-28 11:09:02 +01:00
update-trained-model-deployment.asciidoc [ML] Add api to update trained model deployment number_of_allocations (#90728) 2022-10-12 10:04:23 +03:00