Commit graph

8 commits

Author SHA1 Message Date
Lisa Cawley
40f72fd75c
[DOCS] Update datafeed details in ML docs (#76854) (#76948) 2021-08-25 15:15:40 -07:00
elasticsearchmachine
29a50ae5bd
[DOCS] Fixes bulleted list in ML aggregations (#75806) (#75809)
Co-authored-by: Lisa Cawley <lcawley@elastic.co>
2021-07-28 11:49:32 -07:00
István Zoltán Szabó
c5c8ef208c
[DOCS] Removes Kibana charts-related advise about agg interval and bucket span. (#73673) (#73677) 2021-06-02 17:16:43 +02:00
Benjamin Trent
abb182d95c
[7.x] [ML] adding support for composite aggs in anomaly detection (#69970) (#71052)
* [ML] adding support for composite aggs in anomaly detection (#69970)

This commit allows for composite aggregations in datafeeds.

Composite aggs provide a much better solution for having influencers, partitions, etc. on high volume data. Instead of worrying about long scrolls in the datafeed, the calculation is distributed across cluster via the aggregations.

The restrictions for this support are as follows:

- The composite aggregation must have EXACTLY one `date_histogram` source
- The sub-aggs of the composite aggregation must have a `max` aggregation on the SAME timefield as the aforementioned `date_histogram` source
- The composite agg must be the ONLY top level agg and it cannot have a `composite` or `date_histogram` sub-agg
- If using a `date_histogram` to bucket time, it cannot have a `composite` sub-agg.
- The top-level `composite` agg cannot have a sibling pipeline agg. Pipeline aggregations are supported as a sub-agg (thus a pipeline agg INSIDE the bucket).

Some key user interaction differences:
- Speed + resources used by the cluster should be controlled by the `size` parameter in the `composite` aggregation. Previously, we said if you are using aggs, use a specific `chunking_config`. But, with composite, that is not necessary.
- Users really shouldn't use nested `terms` aggs anylonger. While this is still a "valid" configuration and MAY be desirable for some users (only wanting the top 10 of certain terms), typically when users want influencers, partition fields, etc. they want the ENTIRE population. Previously, this really wasn't possible with aggs, with `composite` it is.
- I cannot really think of a typical usecase that SHOULD ever use a multi-bucket aggregation that is NOT supported by composite.
2021-03-30 12:04:54 -04:00
István Zoltán Szabó
ee114e7c90
[DOCS] Fixes typo in Aggregating data for faster performance. (#65354) (#65356) 2020-11-23 13:03:20 +01:00
István Zoltán Szabó
53c64d594b
[DOCS] Adds UI related limitation to configuring aggs docs (#65184) (#65327)
Co-authored-by: Lisa Cawley <lcawley@elastic.co>
2020-11-23 09:40:26 +01:00
István Zoltán Szabó
ea9022551c
[DOCS] Fixes example aggregation syntax in datafeed aggregations. (#64936) (#64942) 2020-11-11 17:45:30 +01:00
Lisa Cawley
fb212269ce
[DOCS] Changes level offset of anomaly detection pages (#59911) (#59940) 2020-07-20 17:04:59 -07:00
Renamed from docs/reference/ml/anomaly-detection/aggregations.asciidoc (Browse further)