elasticsearch/docs/reference/ml
Valeriy Khakhutskyy 95758e88a2
[ML] Explain anomaly score factors (#90675)
This PR surfaces new information about the impact of the factors on the initial anomaly score in the anomaly record:

- single bucket impact is determined by the deviation between actual and typical in the current bucket
- multi-bucket impact is determined by the deviation between actual and typical in the past 12 buckets
- anomaly characteristics are statistical properties of the current anomaly compared to the historical observations
- high variance penalty is the reduction of anomaly score in the buckets with large confidence intervals.
- incomplete bucket penalty is the reduction of anomaly score in the buckets with fewer samples than historically expected.

Additionally, we compute lower- and upper-confidence bounds and the typical value for the anomaly records. This improves the explainability of the cases where the model plot is not activated with only a slight overhead in performance (1-2%).
2022-10-12 16:57:06 +02:00
..
anomaly-detection [ML] Explain anomaly score factors (#90675) 2022-10-12 16:57:06 +02:00
common/apis [ML] Performance improvements for categorization jobs (#89824) 2022-09-08 18:41:01 +01:00
df-analytics/apis [ML] Lift limit of max number of classes for classification to 100 (#89755) 2022-09-01 10:47:58 +03:00
images [DOCS] Updates anomaly detection alert rule type screenshot. (#89532) 2022-08-23 15:37:40 +02:00
trained-models/apis [ML] Add api to update trained model deployment number_of_allocations (#90728) 2022-10-12 10:04:23 +03:00
ml-shared.asciidoc [ML] add text_similarity nlp task documentation (#88994) 2022-08-02 12:17:14 -04:00