elasticsearch

mirror of https://github.com/elastic/elasticsearch.git synced 2025-04-25 15:47:23 -04:00

Author	SHA1	Message	Date
Sean Letendre	67cacde18b	Corrected an incomplete sentence. (#86542 ) * Corrected an incomplete sentence. * Update docs/reference/aggregations/metrics/avg-aggregation.asciidoc Co-authored-by: Christos Soulios <1561376+csoulios@users.noreply.github.com> Co-authored-by: David Kilfoyle <41695641+kilfoyle@users.noreply.github.com> Co-authored-by: Christos Soulios <1561376+csoulios@users.noreply.github.com>	2022-07-12 09:19:58 -04:00
Mark Tozzi	9ee6a19187	Add ability to select execution mode for cardinality aggregation (#87704 ) Plumbs through a new parameter for the cardinality aggregation, to allow configuring the execution mode. This can have significant impacts on speed and memory usage. This PR exposes three collection modes and two heuristics that we can tune going forward. All of these are treated as hints and can be silently ignored, e.g. if not applicable to the given field type. I've change the default behavior to optimize for time, which potentially uses more memory. Users can override this for the old behavior if needed.	2022-07-05 09:11:22 -04:00
Umut Uz	53461f89f1	Remove duplicate text from cardinality aggs docs (#86615 ) The same explanation is repeated twice within a section.	2022-05-19 11:51:31 -07:00
Craig Taverner	5f7ea792ac	Soft-deprecation of point/geo_point formats (#86835 ) * Soft-deprecation of point/geo_point formats Since GeoJSON and WKT are now common formats for all three types: geo_shape, geo_point and point We decided to soft-deprecate the other point formats by ordering: * GeoJSON (object with keys `type` and `coordinates`) * WKT `POINT(x y)` * Object with keys `lat` and `lon` (or `x` and `y` for point) * Array [lon,lat] * String `"lat,lon"` (or `"x,y"` in point) * String with geohash (only in `geo_point`) The geohash is last because it is only in one field type. The string version is second last because it is the most controversial being the only version to reverse the coordinate order from all other formats (for geo_point only, since the coordinates are not reversed in point). In addition we replaced many examples in both documentation and tests to prioritize WKT over the plain string format. Many remaining examples of array format or object with keys still exist and could be replaced by, for example, GeoJSON, if we feel the need. * Incorrect quote position	2022-05-17 23:46:43 +02:00
James Rodewig	74e4add3a8	[DOCS] Update sum aggregation for histograms (#84493 ) (#84496 ) Fixes an error and test snippets for the sum aggregation example for histograms. Closes #84491 Co-authored-by: James Rodewig <40268737+jrodewig@users.noreply.github.com> (cherry picked from commit `fb45ac9dea`) Co-authored-by: Maja Grubic <maja.grubic@elastic.co>	2022-03-01 08:42:05 -05:00
James Rodewig	d31bdd6bf4	[DOCS] Remove unneeded callouts from snippets (#83798 ) These callouts aren't referenced anywhere. Leaving them in can be confusing.	2022-02-10 15:04:46 -05:00
James Rodewig	280fd2fff7	[DOCS] Fix min/max agg snippets for histograms (#83695 ) * Updates the `min` and `max` snippets for histograms. These should now run as docs integration tests. * Fixes a copy/paste error in the `max` aggregation snippet for histograms. Relates to https://github.com/elastic/elasticsearch/pull/83384	2022-02-08 19:48:15 -05:00
William Chaparro	c8e8104f66	[DOCS] Remove experimental language from HDR Histo percentiles/ranks (#81773 ) per issue 60780, decision from team to remove experimental language from HDR Histogram percentiles and ranks. Feature has been in production for quite some time. closes #60780	2021-12-15 14:35:08 -05:00
Salvatore Campagna	2b5ebba94a	[DOCS] Fix the weighed average documentation (#81307 ) The documentations states that if the `weight` field is missing, and no explicit missing configuration is provided, a default value of 1 is used. This is incorrect and does not match the implementation of the weighted average aggregator. In this specific case the document is skipped, instead.	2021-12-03 23:28:41 +01:00
James Rodewig	f56a0f4b66	[DOCS] Remove `testenv` annotations from doc snippet tests (#80023 ) Removes `testenv` annotations and related code. These annotations originally let you skip x-pack snippet tests in the docs. However, that's no longer possible. Relates to #79309, #31619	2021-11-05 18:38:50 -04:00
Christos Soulios	de93d95dcf	Fix rate agg with custom `_doc_count` (#79346 ) When running a rate aggregation without setting the field parameter, the result is computed based on the bucket doc_count. This PR adds support for a custom _doc_count field. Closes #77734	2021-10-19 13:25:54 +03:00
Benjamin Trent	100f222650	Adds support for the rate aggregation under a composite agg (#76992 ) rate aggregation should support being a sub-aggregation of a composite agg. The catch is that the composite aggregation source must be a date histogram. Other sources can be present but their must be exactly one date histogram source otherwise the rate aggregation does not know which interval to compare its unit rate to. closes https://github.com/elastic/elasticsearch/issues/76988	2021-09-01 07:29:13 -04:00
James Rodewig	fc0ac1923d	[DOCS] Correct spelling for geo terms (#76028 ) Changes: * Use "geopoint" when not referring to the literal field type * Use "geoshape" when not referring to the literal field type or query type * Use "GeoJSON" consistently	2021-08-03 09:55:48 -04:00
James Rodewig	0360ce48b4	[DOCS] Clarify supported fields for `top_metrics` agg (#73907 ) Changes: * Notes `metrics.field` supports `boolean` fields and runtime fields. * Notes `metrics.field` doesn't support array values. Closes #72889	2021-06-08 13:19:43 -04:00
Pierre Grimaud	3c44dfec60	[DOCS] Fix typos (#72227 )	2021-04-26 12:40:38 -04:00
Nik Everett	6a1220e7f3	Convert metric aggs docs runtime fields (#71260 ) This replaces the `script` docs for bucket aggregations with runtime fields. We expect runtime fields to be nicer to work with because you can also fetch them or filter on them. We expect them to be faster because their don't need this sort of `instanceof` tree: `a92a647b9f/server/src/main/java/org/elasticsearch/search/aggregations/support/values/ScriptDoubleValues.java (L42)` Relates to #69291 Co-authored-by: James Rodewig <40268737+jrodewig@users.noreply.github.com> Co-authored-by: Adam Locke <adam.locke@elastic.co>	2021-04-05 13:08:13 -04:00
James Rodewig	693807a6d3	[DOCS] Fix double spaces (#71082 )	2021-03-31 09:57:47 -04:00
Nik Everett	1195b20a83	Docs: Add example fetching keyword in top_metrics (#69135 ) Adds an example of fetching a keyword field.	2021-02-17 12:10:34 -05:00
James Rodewig	9b88ae92e6	[DOCS] Fix typos for duplicate words (#69125 )	2021-02-17 10:34:20 -05:00
Dario Gieselaar	a28e45c0c5	[DOCS] Remove keyword/ip from list of unsupported fields in top_metrics agg (#69036 )	2021-02-17 08:41:57 -05:00
James Rodewig	ab0f4d51b2	[DOCS] Add missing newline for bulleted list in top_metrics docs (#68481 ) (#68550 ) Co-authored-by: Nathan L Smith <nathan.smith@elastic.co>	2021-02-04 14:49:02 -05:00
Adam Locke	82bfbe1195	[DOCS] Adding headers in TOC for aggregation docs. (#66604 )	2020-12-18 11:31:42 -05:00
Igor Motov	a065b6d8da	Return an error when a rate aggregation cannot calculate bucket sizes (#65429 ) In some cases when the rate aggregation is not a child of a date histogram aggregation, it is not possible to determine the actual size of the date histogram bucket. In this case the rate aggregation now throws an exception. Closes #63703	2020-11-25 10:05:51 -05:00
Tal Levy	b514d9bf2e	Add geo_line aggregation (#41612 ) A metric aggregation that aggregates a set of points as a GeoJSON LineString ordered by some sort parameter. #### specifics A `geo_line` aggregation request would specify a `geo_point` field, as well as a `sort` field. `geo_point` represents the values used in the LineString, while the `sort` values will be used as the total ordering of the points. the `sort` field would support any numeric field, including date. #### sample usage ``` { "query": { "bool": { "must": [ { "term": { "person": "004" } }, { "term": { "trajectory": "20090131002206.plt" } } ] } }, "aggs": { "make_line": { "geo_line": { "point": {"field": "location"}, "sort": { "field": "timestamp" }, "include_sort": true, "sort_order": "desc", "size": 15 } } } } ``` #### sample response ``` { "took": 21, "timed_out": false, "_shards": {...}, "hits": {...}, "aggregations": { "make_line": { "type": "LineString", "coordinates": [ [ 121.52926194481552, 38.92878997139633 ], [ 121.52922699227929, 38.92876998055726 ], ] } } } ``` #### visual response <img width="540" alt="Screen Shot 2019-04-26 at 9 40 07 AM" src="https://user-images.githubusercontent.com/388837/56834977-cf278e00-6827-11e9-9c93-005ed48433cc.png"> #### limitations Due to the cardinality of points, an initial max of 10k points will be used. This should support many use-cases. One solution to overcome this limitation is to keep a PriorityQueue of points, and simplifying the line once it hits this max. If simplifying makes sense, it may be a nice option, in general. The ability to use a parameter to specify how aggressive one wants to simplify. This parameter could be the number of points. Example algorithm one could use with a PriorityQueue: https://bost.ocks.org/mike/simplify/. This would still require O(m) space, where m is the number of points returned. And would also require heapifying triangles sorted by their areas, which would be O(log(m)) operations. Since sorting is done, anyways, simplifying would still be a O(n log(m)) operation, where n is the total number of points to filter........... something to explore closes #41649	2020-11-23 10:26:27 -08:00
Mark Tozzi	f666ccb3bc	Add supports for upper and lower values on boxplot based on the IQR value (#63617 )	2020-11-04 14:39:05 -05:00
James Rodewig	2e9f95aa73	[DOCS] Change agg titles to sentence case (#64425 )	2020-10-30 13:25:21 -04:00
Igor Motov	e6c70f6811	Add value_count mode to rate agg (#63687 ) Adds a new value count mode to the rate aggregation. Closes #63575	2020-10-15 18:00:44 -04:00
Igor Motov	34bff3f776	Add support for histogram fields to rate aggregation (#63289 ) The rate aggregation now supports histogram fields. At the moment only sum is supported. Closes #62939	2020-10-08 16:54:25 -04:00
Christos Soulios	b857768bb5	Histogram field type support for min/max aggregations (#62532 ) Implement min/max aggregations for histogram fields. Closes #60951	2020-09-19 23:34:43 +03:00
Julie Tibshirani	f29c743a47	Support the 'fields' option in inner_hits and top_hits. (#62259 ) This PR adds support for the 'fields' option in the following places: * Anytime `inner_hits` is used, for both fetching nested/ child docs and field collapsing * The `top_hits` aggregation Addresses #61949.	2020-09-14 10:08:58 -07:00
Igor Motov	f107dba741	Add rate aggregation (#61369 ) Adds a new rate aggregation that can calculate a document rate for buckets of a date_histogram. Closes #60674	2020-08-25 11:32:20 -04:00
James Rodewig	456c37b186	[DOCS] Add usage tips to `top_hits` agg (#61215 )	2020-08-17 12:42:04 -04:00
Adam Locke	fdc867e395	[DOCS] Update info about geo_shape bounding boxes (#61214 ) * Adding information about geo_shape bounding boxes. * Fixing cross link and incorporating review feedback.	2020-08-17 11:07:18 -04:00
James Rodewig	a94e5cb7c4	[DOCS] Replace Wikipedia links with attribute (#61171 )	2020-08-17 09:44:24 -04:00
James Rodewig	6b9b8c5e31	[DOCS] Move script and stored fields content to search fields page (#60826 ) Changes: * Moves `Retrieve selected fields` to its own page and adds a title abbreviation. * Adds existing script and stored fields content to `Retrieve selected fields` * Adds a xref for `Retrieve selected fields` to `Search your data` * Adds related redirects and updates existing xrefs	2020-08-06 12:45:03 -04:00
James Rodewig	929033f9dd	[DOCS] Move named query content to bool query (#60748 )	2020-08-05 13:27:10 -04:00
James Rodewig	a4dc336c16	[DOCS] Replace `twitter` dataset in search/agg docs (#60667 )	2020-08-04 13:31:52 -04:00
Alexander Reelsen	c7ac9e7073	[DOCS] http -> https, remove outdated plugin docs (#60380 ) Plugin discovery documentation contained information about installing Elasticsearch 2.0 and installing an oracle JDK, both of which is no longer valid. While noticing that the instructions used cleartext HTTP to install packages, this commit replaces HTTPs links instead of HTTP where possible. In addition a few community links have been removed, as they do not seem to exist anymore.	2020-07-31 15:58:38 -04:00
James Rodewig	d5b03f668b	[DOCS] Move search sort docs to separate page (#60123 ) Moves the search sort docs from the deprecated 'Request Body Search' page to a new subpage of 'Run a search'. No substantive changes were made to the content.	2020-07-23 12:58:57 -04:00
Howard	b8e3ba783a	[DOCS] Fix missing punctuation in agg docs (#59822 )	2020-07-21 10:17:59 -04:00
James Rodewig	2c5d6e9c95	[DOCS] Reformat agg snippets to use two-space indents (#59912 )	2020-07-20 15:08:04 -04:00
James Rodewig	8a57800f1b	[DOCS] Add performance warning for scripts (#59890 )	2020-07-20 14:04:35 -04:00
James Rodewig	aa3ddfeefb	[DOCS] Move highlighting docs to separate page (#59768 ) Moves the highlighting docs from the deprecated 'Request Body Search' chapter to the new subpage of the 'Run a search chapter' section. No substantive changes were made to the content.	2020-07-17 10:15:20 -04:00
Cris da Rocha	b5de14d3f6	Missing comma between value types (#58383 ) This applies to all versions of this document (7.7, 7.8, 7.x, current and master).	2020-06-19 23:01:25 +02:00
Tal Levy	c765993d82	add geo_shape documentation for supported aggregations (#58284 ) This commit adds documentation for geo_shape fields in aggregations Closes #55495.	2020-06-18 10:17:49 -07:00
James Rodewig	7826bbee87	[DOCS] Move search API's `docvalue_fields` examples (#57760 ) Changes: * Condenses and relocates the `docvalue_fields` example to the 'Run a search' page. * Adds docs for the `docvalue_fields` request body parameter. * Updates several related xrefs. Co-authored-by: debadair <debadair@elastic.co>	2020-06-11 10:57:15 -04:00
andrewjohnson2	a791d6723d	Added standard deviation / variance sampling to extended stats (#49782 ) Per 49554 I added standard deviation sampling and variance sampling to the extended stats interface. Closes #49554 Co-authored-by: Igor Motov <igor@motovs.org>	2020-06-10 15:00:50 -04:00
James Rodewig	51e3d5ab63	[DOCS] Fix source filtering xrefs (#57720 )	2020-06-05 08:46:26 -04:00
Christos Soulios	caf6c5ac19	Histogram field type support for ValueCount and Avg aggregations (#55933 ) Implements value_count and avg aggregations over Histogram fields as discussed in #53285 - value_count returns the sum of all counts array of the histograms - avg computes a weighted average of the values array of the histogram by multiplying each value with its associated element in the counts array	2020-05-04 10:24:35 +03:00
Christos Soulios	cefc6af25b	Histogram field type support for Sum aggregation (#55681 ) Implements Sum aggregation over Histogram fields by summing the value of each bucket multiplied by their count as requested in #53285	2020-04-29 11:09:25 +03:00

1 2 3 4

163 commits