elasticsearch

mirror of https://github.com/elastic/elasticsearch.git synced 2025-06-30 02:13:33 -04:00

History

Dimitris Athanasiou 4e67df8b05 [ML] Low priority trained model deployments (#91234 ) This adds a new parameter to the start trained model deployment API, namely `priority`. The available settings are `normal` and `low`. For normal priority deployments the allocations get distributed so that node processors are never oversubscribed. Low priority deployments allow users to test model functionality even if there are no node processors available. They are limited to 1 allocation with a single thread. In addition, the process is executed in low priority which limits the amount of CPU that can be used when the CPU is under pressure. The intention of this is to limit the impact of low priority deployments on normal priority deployments. When we rebalance model assignments we now: 1. compute a plan just for normal priority deployments 2. fix the resources used by normal deployments 3. compute a plan just for low priority deployments 4. merge the two plans Closes #91024		2022-11-04 14:22:30 +02:00
..
clear-trained-model-deployment-cache.asciidoc	[ML] Release native inference functionality as beta (#90418 )	2022-09-28 11:09:02 +01:00
delete-trained-models-aliases.asciidoc	[DOCS] Move trained model APIs out of dataframe analytics (#81315 )	2021-12-03 09:21:09 -08:00
delete-trained-models.asciidoc	[DOCS] Move trained model APIs out of dataframe analytics (#81315 )	2021-12-03 09:21:09 -08:00
get-trained-models-stats.asciidoc	[ML] Low priority trained model deployments (#91234 )	2022-11-04 14:22:30 +02:00
get-trained-models.asciidoc	[ML] add text_similarity nlp task documentation (#88994 )	2022-08-02 12:17:14 -04:00
index.asciidoc	[ML] Add api to update trained model deployment number_of_allocations (#90728 )	2022-10-12 10:04:23 +03:00
infer-trained-model-deployment.asciidoc	[ML] add new trained_models/{model_id}/_infer endpoint for all supervised models and deprecate deployment infer api (#86361 )	2022-05-05 14:58:59 -04:00
infer-trained-model.asciidoc	[ML] Release native inference functionality as beta (#90418 )	2022-09-28 11:09:02 +01:00
ml-trained-models-apis.asciidoc	[ML] Add api to update trained model deployment number_of_allocations (#90728 )	2022-10-12 10:04:23 +03:00
put-trained-model-definition-part.asciidoc	[ML] Release native inference functionality as beta (#90418 )	2022-09-28 11:09:02 +01:00
put-trained-model-vocabulary.asciidoc	[ML] Release native inference functionality as beta (#90418 )	2022-09-28 11:09:02 +01:00
put-trained-models-aliases.asciidoc	[DOCS] Move trained model APIs out of dataframe analytics (#81315 )	2021-12-03 09:21:09 -08:00
put-trained-models.asciidoc	[ML] add text_similarity nlp task documentation (#88994 )	2022-08-02 12:17:14 -04:00
start-trained-model-deployment.asciidoc	[ML] Low priority trained model deployments (#91234 )	2022-11-04 14:22:30 +02:00
stop-trained-model-deployment.asciidoc	[ML] Release native inference functionality as beta (#90418 )	2022-09-28 11:09:02 +01:00
update-trained-model-deployment.asciidoc	[ML] Add api to update trained model deployment number_of_allocations (#90728 )	2022-10-12 10:04:23 +03:00