elasticsearch

mirror of https://github.com/elastic/elasticsearch.git synced 2025-04-25 07:37:19 -04:00

History

David Roberts 15e7b06b79 [ML] Add inference cache hit count to inference node stats (#88807 ) The inference node stats for deployed PyTorch inference models now contain two new fields: `inference_cache_hit_count` and `inference_cache_hit_count_last_minute`. These indicate how many inferences on that node were served from the C++-side response cache that was added in https://github.com/elastic/ml-cpp/pull/2305. Cache hits occur when exactly the same inference request is sent to the same node more than once. The `average_inference_time_ms` and `average_inference_time_ms_last_minute` fields now refer to the time taken to do the cache lookup, plus, if necessary, the time to do the inference. We would expect average inference time to be vastly reduced in situations where the cache hit rate is high.		2022-07-26 17:53:43 +01:00
..
delete-trained-models-aliases.asciidoc	[DOCS] Move trained model APIs out of dataframe analytics (#81315 )	2021-12-03 09:21:09 -08:00
delete-trained-models.asciidoc	[DOCS] Move trained model APIs out of dataframe analytics (#81315 )	2021-12-03 09:21:09 -08:00
get-trained-models-stats.asciidoc	[ML] Add inference cache hit count to inference node stats (#88807 )	2022-07-26 17:53:43 +01:00
get-trained-models.asciidoc	[ML] add roberta/bart docs (#85001 )	2022-03-17 12:14:57 -04:00
index.asciidoc	[DOCS] Removes infer trained model deployment API (#86497 )	2022-05-10 09:56:36 -07:00
infer-trained-model-deployment.asciidoc	[ML] add new trained_models/{model_id}/_infer endpoint for all supervised models and deprecate deployment infer api (#86361 )	2022-05-05 14:58:59 -04:00
infer-trained-model.asciidoc	[DOCS] Updates infer trained model API docs with inference_config (#88500 )	2022-07-13 17:47:05 +02:00
ml-trained-models-apis.asciidoc	[DOCS] Removes infer trained model deployment API (#86497 )	2022-05-10 09:56:36 -07:00
put-trained-model-definition-part.asciidoc	[DOCS] Add preview admonition to infer API (#86486 )	2022-05-05 13:49:02 -07:00
put-trained-model-vocabulary.asciidoc	[DOCS] Add preview admonition to infer API (#86486 )	2022-05-05 13:49:02 -07:00
put-trained-models-aliases.asciidoc	[DOCS] Move trained model APIs out of dataframe analytics (#81315 )	2021-12-03 09:21:09 -08:00
put-trained-models.asciidoc	[DOCS] Updates infer trained model API docs with inference_config (#88500 )	2022-07-13 17:47:05 +02:00
start-trained-model-deployment.asciidoc	[ML] add new cache_size parameter to trained_model deployments API (#88450 )	2022-07-18 09:19:01 -04:00
stop-trained-model-deployment.asciidoc	[DOCS] Add preview admonition to infer API (#86486 )	2022-05-05 13:49:02 -07:00