elasticsearch/docs/reference/ml/trained-models/apis
David Roberts 15e7b06b79
[ML] Add inference cache hit count to inference node stats (#88807)
The inference node stats for deployed PyTorch inference
models now contain two new fields: `inference_cache_hit_count`
and `inference_cache_hit_count_last_minute`.

These indicate how many inferences on that node were served
from the C++-side response cache that was added in
https://github.com/elastic/ml-cpp/pull/2305. Cache hits
occur when exactly the same inference request is sent to the
same node more than once.

The `average_inference_time_ms` and
`average_inference_time_ms_last_minute` fields now refer to
the time taken to do the cache lookup, plus, if necessary,
the time to do the inference. We would expect average inference
time to be vastly reduced in situations where the cache hit
rate is high.
2022-07-26 17:53:43 +01:00
..
delete-trained-models-aliases.asciidoc [DOCS] Move trained model APIs out of dataframe analytics (#81315) 2021-12-03 09:21:09 -08:00
delete-trained-models.asciidoc [DOCS] Move trained model APIs out of dataframe analytics (#81315) 2021-12-03 09:21:09 -08:00
get-trained-models-stats.asciidoc [ML] Add inference cache hit count to inference node stats (#88807) 2022-07-26 17:53:43 +01:00
get-trained-models.asciidoc [ML] add roberta/bart docs (#85001) 2022-03-17 12:14:57 -04:00
index.asciidoc [DOCS] Removes infer trained model deployment API (#86497) 2022-05-10 09:56:36 -07:00
infer-trained-model-deployment.asciidoc [ML] add new trained_models/{model_id}/_infer endpoint for all supervised models and deprecate deployment infer api (#86361) 2022-05-05 14:58:59 -04:00
infer-trained-model.asciidoc [DOCS] Updates infer trained model API docs with inference_config (#88500) 2022-07-13 17:47:05 +02:00
ml-trained-models-apis.asciidoc [DOCS] Removes infer trained model deployment API (#86497) 2022-05-10 09:56:36 -07:00
put-trained-model-definition-part.asciidoc [DOCS] Add preview admonition to infer API (#86486) 2022-05-05 13:49:02 -07:00
put-trained-model-vocabulary.asciidoc [DOCS] Add preview admonition to infer API (#86486) 2022-05-05 13:49:02 -07:00
put-trained-models-aliases.asciidoc [DOCS] Move trained model APIs out of dataframe analytics (#81315) 2021-12-03 09:21:09 -08:00
put-trained-models.asciidoc [DOCS] Updates infer trained model API docs with inference_config (#88500) 2022-07-13 17:47:05 +02:00
start-trained-model-deployment.asciidoc [ML] add new cache_size parameter to trained_model deployments API (#88450) 2022-07-18 09:19:01 -04:00
stop-trained-model-deployment.asciidoc [DOCS] Add preview admonition to infer API (#86486) 2022-05-05 13:49:02 -07:00