elasticsearch

mirror of https://github.com/elastic/elasticsearch.git synced 2025-06-28 17:34:17 -04:00

Author	SHA1	Message	Date
Christos Soulios	cefc6af25b	Histogram field type support for Sum aggregation (#55681 ) Implements Sum aggregation over Histogram fields by summing the value of each bucket multiplied by their count as requested in #53285	2020-04-29 11:09:25 +03:00
Zachary Tong	9f165bd44e	Aggs must specify a `field` or `script` (or both) (#52226 ) * Aggs must specify a `field` or `script` (or both) This adds a validation to VSParserHelper to ensure that a field or script or both are specified by the user. This is technically required today already, but throws an exception much deeper in the agg framework and has a very unintuitive error for the user (as well as eating more resources instead of failing early) * Fix StringStats test * Add yaml test * Skip test on older versions Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com>	2020-04-23 14:26:38 -04:00
Igor Motov	6d28596ead	Add support for filters to T-Test aggregation (#54980 ) Adds support for filters to T-Test aggregation. The filters can be used to select populations based on some criteria and use values from the same or different fields. Closes #53692	2020-04-10 10:19:07 -04:00
Igor Motov	5fc9fc528d	Add Student's t-test aggregation support (#54469 ) Adds t_test metric aggregation that can perform paired and unpaired two-sample t-tests. In this PR support for filters in unpaired is still missing. It will be added in a follow-up PR. Relates to #53692	2020-04-03 11:31:13 -04:00
Gil Raphaelli	4090568797	[DOCS] Fix typos in top metrics agg docs (#54299 )	2020-03-27 10:48:01 -04:00
Paweł Krześniak	de1229cc2b	[DOCS] link fix (#53973 ) Fix bad link in top_metrics.	2020-03-23 13:28:43 -04:00
Zachary Tong	84a59f8447	Add scripting, supported-type tests to ValueCount (#53500 ) Also adds a few small notes to the documentation regarding potentially unintuitive behavior	2020-03-16 15:15:25 -04:00
Lisa Cawley	4a5feab88d	[DOCS] Add anchors for scripted metric aggregations (#53618 )	2020-03-16 12:14:01 -07:00
Nik Everett	230a9a8975	Improve top_metrics docs (#53521 ) * Removes experimental. * Replaces `"v"` (for value) with `"m"` (for metric). * Move the note about tiebreaking into the list of limitations of the sort. * Explain how you ask for `metrics`. * Clean up some wording. * Link to the docs from `top_metrics`. Closes #51813	2020-03-16 13:23:22 -04:00
Nik Everett	8410356c5b	Preserve metric types in top_metrics (#53288 ) This changes the `top_metrics` aggregation to return metrics in their original type. Since it only supports numerics, that means that dates, longs, and doubles will come back as stored, with their appropriate formatter applied.	2020-03-11 16:44:08 -04:00
Anton Dollmaier	e9c8c03fee	[DOCS] Fix parameter formatting for GeoHash grid agg docs (#53032 ) Adds missing colon (`:`) to the parameter definition list.	2020-03-09 08:17:57 -04:00
Nik Everett	56058ab6af	Support multiple metrics in `top_metrics` agg (#52965 ) This adds support for returning multiple metrics to the `top_metrics` agg. It looks like: ``` POST /test/_search?filter_path=aggregations { "aggs": { "tm": { "top_metrics": { "metrics": [ {"field": "v"}, {"field": "m"} ], "sort": {"s": "desc"} } } } } ```	2020-03-05 06:53:37 -05:00
Nik Everett	f4223b6a8f	Add size support to `top_metrics` (#52662 ) This adds support for returning the top "n" metrics instead of just the very top. Relates to #51813	2020-02-27 11:14:57 -05:00
István Zoltán Szabó	14555ca01e	[DOCS] Links transforms in aggregation docs (#52563 ) Co-authored-by: Lisa Cawley <lcawley@elastic.co>	2020-02-21 08:22:04 +01:00
Nik Everett	5b2266601b	Implement top_metrics agg (#51155 ) The `top_metrics` agg is kind of like `top_hits` but it only works on doc values so it should be faster. At this point it is fairly limited in that it only supports a single, numeric sort and a single, numeric metric. And it only fetches the "very topest" document worth of metric. We plan to support returning a configurable number of top metrics, requesting more than one metric and more than one sort. And, eventually, non-numeric sorts and metrics. The trick is doing those things fairly efficiently. Co-Authored by: Zachary Tong <zach@elastic.co>	2020-02-14 07:13:52 -05:00
Igor Motov	0898df4aac	Add histogram field type support to boxplot aggs (#52265 ) Add support for the histogram field type to boxplot aggs. Closes #52233 Relates to #33112	2020-02-13 08:59:44 -05:00
Igor Motov	c50cfa0668	Add Boxplot Aggregation (#51948 ) Adds a `boxplot` aggregation that calculates min, max, medium and the first and the third quartiles of the given data set. Closes #33112	2020-02-07 18:01:20 -05:00
Mark Tozzi	928c663ce0	Fix dangling 'either' in weighted average docs (#51748 )	2020-01-31 12:45:46 -05:00
Elvis Saravia	520da54e63	update pipeline.asciidoc typo	2020-01-24 14:03:01 +01:00
Igor Motov	23be11cf6c	Fix leftover mentions of method parameter in Percentile Aggs (#51272 ) The method parameter is not used in the percentile aggs, instead the method is determined by the presence of `hdr` or `tdigest` objects. Relates to #8324	2020-01-22 05:02:48 -10:00
Tal Levy	6c86606d2a	Adds support for geo-bounds filtering in geogrid aggregations (#50002 ) It is fairly common to filter the geo point candidates in geohash_grid and geotile_grid aggregations according to some viewable bounding box. This change introduces the option of specifying this filter directly in the tiling aggregation. This is even more relevant to `geo_shape` where the bounds will restrict the shape to be within the bounds this optional `bounds` parameter is parsed in an equivalent fashion to the bounds specified in the geo_bounding_box query.	2020-01-14 08:29:10 -08:00
Nik Everett	326d696d9a	Support offset in composite aggs (#50609 ) Adds support for the `offset` parameter to the `date_histogram` source of composite aggs. The `offset` parameter is supported by the normal `date_histogram` aggregation and is useful for folks that need to measure things from, say, 6am one day to 6am the next day. This is implemented by creating a new `Rounding` that knows how to handle offsets and delegates to other rounding implementations. That implementation doesn't fully implement the `Rounding` contract, namely `nextRoundingValue`. That method isn't used by composite aggs so I can't be sure that any implementation that I add will be correct. I propose to leave it throwing `UnsupportedOperationException` until I need it. Closes #48757	2020-01-07 14:49:09 -05:00
James Rodewig	7f35bcdfc9	[DOCS] Warn about using `geo_centroid` as sub-agg to `geohash_grid` (#50038 ) If `geo_point fields` are multi-valued, using `geo_centroid` as a sub-agg to `geohash_grid` could result in centroids outside of bucket boundaries. This adds a related warning to the geo_centroid agg docs.	2020-01-06 07:45:49 -06:00
Nik Everett	a7cc0b0159	Docs: Refine note about `after_key` (#50475 ) * Docs: Refine note about `after_key` I was curious about composite aggregations, specifically I wanted to know how to write a composite aggregation that had all of its buckets filtered out so you had to use the `after_key`. Then I saw that we've declared composite aggregations not to work with pipelines in #44180. So I'm not sure you can do that any more. Which makes the note about `after_key` inaccurate. This rejiggers that section of the docs a little so it is more obvious that you send the `after_key` back to us. And so it is more obvious that you should only use the `after_key` that we give you rather than try to work it out for yourself. * Apply suggestions from code review Co-Authored-By: James Rodewig <james.rodewig@elastic.co> Co-authored-by: James Rodewig <james.rodewig@elastic.co>	2020-01-02 10:02:55 -05:00
James Rodewig	3460dc9542	[DOCS] Percentile aggs are non-deterministic (#50468 ) Percentile aggregations are non-deterministic. A percentile aggregation can produce different results even when using the same data. Based on [this discuss post][0], the non-deterministic property stems from processes in Lucene that can affect the order in which docs are provided to the aggregation. This adds a warning stating that the aggregation is non-deterministic and what that means. [0]: https://discuss.elastic.co/t/different-results-for-same-query/111757	2019-12-23 13:11:31 -05:00
Florian Kelbert	0778c34630	[DOCS] Fix typo in bucket sum aggregation docs (#50431 )	2019-12-20 08:47:24 -05:00
Lisa Cawley	6d608e6a0d	[DOCS] Move transform resource definitions into APIs (#50108 )	2019-12-17 09:01:31 -08:00
Jim Ferenczi	804a5042e7	Optimize composite aggregation based on index sorting (#48399 ) Co-authored-by: Daniel Huang <danielhuang@tencent.com> This is a spinoff of #48130 that generalizes the proposal to allow early termination with the composite aggregation when leading sources match a prefix or the entire index sort specification. In such case the composite aggregation can use the index sort natural order to early terminate the collection when it reaches a composite key that is greater than the bottom of the queue. The optimization is also applicable when a query other than match_all is provided. However the optimization is deactivated for sources that match the index sort in the following cases: * Multi-valued source, in such case early termination is not possible. * missing_bucket is set to true	2019-12-17 14:02:06 +01:00
James Rodewig	2d9ee5ddfe	[DOCS] Correct percentile rank agg example response (#50052 ) The example snippets in the percentile rank agg docs use a test dataset named `latency`, which is generated from docs/gradle.build. At some point the dataset and example snippets were updated, but the text surrounding the snippets was not. This means the text and the example snippets shown no longer match up. This corrects that by changing the snippets using /TESTRESPONSE magic comments.	2019-12-12 08:38:48 -05:00
Ignacio Vera	eade4f03f4	New Histogram field mapper that supports percentiles aggregations. (#48580 ) This commit adds a new histogram field mapper that consists in a pre-aggregated format of numerical data to be used in percentiles aggregations.	2019-11-28 13:58:20 +01:00
Przemko Robakowski	04f6b6fdb2	[DOCS] IDs for doc snippets (#49008 ) * Ids for docs snippets * Ids for tests * Ids for docs snippets * ignoring build folder from idea * Ignoring build-eclipse	2019-11-25 15:30:00 +01:00
Lisa Cawley	a4efab6ab4	[DOCS] Merge rollup config details into API (#49412 )	2019-11-22 08:31:30 -08:00
Christos Soulios	b0e12c936b	Implement stats aggregation for string terms (#47468 ) This PR adds a new metric aggregation called string_stats that operates on string terms of a document and returns the following: min_length: The length of the shortest term max_length: The length of the longest term avg_length: The average length of all terms distribution: The probability distribution of all characters appearing in all terms entropy: The total Shannon entropy value calculated for all terms This aggregation has been implemented as an analytics plugin.	2019-11-14 16:07:54 +02:00
James Rodewig	f53eba024b	[DOCS] Remove binary gendered language (#48362 )	2019-10-23 09:36:31 -05:00
Ian Danforth	24cf883792	[DOCS] Fix typo in percentile rank aggregation docs (#47247 )	2019-10-15 15:56:32 -04:00
Alan Woodward	566e1b7d33	Remove type field from DocWriteRequest and associated Response objects (#47671 ) This commit removes the type field from index, update and delete requests, and their associated responses. Relates to #41059	2019-10-11 10:23:55 +01:00
Alan Woodward	7a622f024f	Remove types from BulkRequest (#46983 ) This commit removes types entirely from BulkRequest, both as a global parameter and as individual entries on update/index/delete lines. Relates to #41059	2019-10-07 13:29:12 +01:00
Mark Tozzi	c26ce1d7f5	DocValueFormat implementation for date range fields (#47472 )	2019-10-04 16:01:28 -04:00
Mark Tozzi	57a679fbbb	Documentation notes for Range field histograms (#46890 )	2019-10-01 10:46:04 -04:00
Alan Woodward	c1f99e2d75	Remove `_type` from SearchHit (#46942 ) This commit removes the `_type` field from all search hit responses. Relates to #41059	2019-09-23 19:14:54 +01:00
Javier Ruiz	e8dac62a4a	[DOCS] Fix calendar interval typos for date histo agg (#46911 )	2019-09-20 15:22:04 -04:00
James Rodewig	370e434986	[DOCS] Correct several [source,console-result] snippets (#46930 )	2019-09-20 11:23:15 -04:00
markharwood	dc0abec595	Remove Adjacency_matrix setting in favour of Lucene Boolean query clause setting (#46327 ) Closes #46324	2019-09-19 16:48:04 +01:00
Philipp Krenn	7c5adcc7c1	Minor improvement to the nested aggregation docs (#46475 ) * Minor improvement to the nested aggregation docs * The attributes name and resellers.name were rather confusing, especially since the first one was dynamically mapped and not shown in the documentation (you had to read the test to see it). This change introduces a unique name for the nested attribute and adds the example document to the documentation. * Change the index name from "index" to something more speaking. * Update docs/reference/aggregations/bucket/nested-aggregation.asciidoc Co-Authored-By: James Rodewig <james.rodewig@elastic.co> * Update docs/reference/aggregations/bucket/nested-aggregation.asciidoc Co-Authored-By: James Rodewig <james.rodewig@elastic.co> * Update docs/reference/aggregations/bucket/nested-aggregation.asciidoc Co-Authored-By: James Rodewig <james.rodewig@elastic.co>	2019-09-11 11:23:39 -04:00
James Rodewig	e43be90e6c	[DOCS] [5 of 5] Change // TESTRESPONSE comments to [source,console-results] (#46449 )	2019-09-06 14:05:36 -04:00
James Rodewig	466c59a4a7	[DOCS] Replace "// TESTRESPONSE" magic comments with "[source,console-result] (#46295 )	2019-09-05 16:47:18 -04:00
James Rodewig	f5827ba0ae	[DOCS] Replace "// CONSOLE" comments with [source,console] (#46159 )	2019-09-04 12:51:02 -04:00
Zachary Tong	758f7999b7	Add CumulativeCard pipeline agg to pipeline index (#46279 ) The Cumulative Cardinality docs weren't linked from the pipeline index page	2019-09-03 12:10:34 -04:00
Zachary Tong	273c35f79c	Add Cumulative Cardinality agg (and Data Science plugin) (#43661 ) This adds a pipeline aggregation that calculates the cumulative cardinality of a field. It does this by iteratively merging in the HLL sketch from consecutive buckets and emitting the cardinality up to that point. This is useful for things like finding the total "new" users that have visited a website (as opposed to "repeat" visitors). This is a Basic+ aggregation and adds a new Data Science plugin to house it and future advanced analytics/data science aggregations.	2019-08-26 10:43:24 -04:00
LHearen	6dadce1112	[DOCS] Correct conditional clause in histogram agg docs (#45643 )	2019-08-19 10:09:10 -04:00

... 3 4 5 6 7 ...

573 commits