elasticsearch

mirror of https://github.com/elastic/elasticsearch.git synced 2025-04-25 07:37:19 -04:00

Author	SHA1	Message	Date
James Rodewig	67f113314d	[DOCS] Fix acasting for agg types (#67469 )	2021-01-13 14:44:54 -05:00
Adam Locke	82bfbe1195	[DOCS] Adding headers in TOC for aggregation docs. (#66604 )	2020-12-18 11:31:42 -05:00
James Rodewig	77dc63b2de	[DOCS] Fix `search.max_buckets` default (#66311 )	2020-12-14 21:55:27 -05:00
Nik Everett	524f39f61e	Drop experimental from variable width histogram (#66055 ) Its been several months and we haven't bumped into any good reason to rework the variable width histogram. So let's drop experimental from it! Closes #58573	2020-12-08 14:15:21 -05:00
Mike Barretta	12c9ee4d80	Update inference-bucket-aggregation.asciidoc tiny change to properly align the first code example and to add a missing word	2020-12-03 11:48:45 -05:00
James Rodewig	e955f7752b	[DOCS] Fix typo in histogram agg docs (#65822 )	2020-12-03 09:55:47 -05:00
Igor Motov	a065b6d8da	Return an error when a rate aggregation cannot calculate bucket sizes (#65429 ) In some cases when the rate aggregation is not a child of a date histogram aggregation, it is not possible to determine the actual size of the date histogram bucket. In this case the rate aggregation now throws an exception. Closes #63703	2020-11-25 10:05:51 -05:00
Tal Levy	a6755c3be8	Add mention of geo_shape support in geotile and geohash grid agg docs (#61129 ) Previously, geo_shape support was only mentioned in a dedicated x-pack section. This may be misleading, as the introductory paragraph only mentions geo_point. Co-authored-by: James Rodewig <40268737+jrodewig@users.noreply.github.com>	2020-11-24 13:57:42 -08:00
Tal Levy	b514d9bf2e	Add geo_line aggregation (#41612 ) A metric aggregation that aggregates a set of points as a GeoJSON LineString ordered by some sort parameter. #### specifics A `geo_line` aggregation request would specify a `geo_point` field, as well as a `sort` field. `geo_point` represents the values used in the LineString, while the `sort` values will be used as the total ordering of the points. the `sort` field would support any numeric field, including date. #### sample usage ``` { "query": { "bool": { "must": [ { "term": { "person": "004" } }, { "term": { "trajectory": "20090131002206.plt" } } ] } }, "aggs": { "make_line": { "geo_line": { "point": {"field": "location"}, "sort": { "field": "timestamp" }, "include_sort": true, "sort_order": "desc", "size": 15 } } } } ``` #### sample response ``` { "took": 21, "timed_out": false, "_shards": {...}, "hits": {...}, "aggregations": { "make_line": { "type": "LineString", "coordinates": [ [ 121.52926194481552, 38.92878997139633 ], [ 121.52922699227929, 38.92876998055726 ], ] } } } ``` #### visual response <img width="540" alt="Screen Shot 2019-04-26 at 9 40 07 AM" src="https://user-images.githubusercontent.com/388837/56834977-cf278e00-6827-11e9-9c93-005ed48433cc.png"> #### limitations Due to the cardinality of points, an initial max of 10k points will be used. This should support many use-cases. One solution to overcome this limitation is to keep a PriorityQueue of points, and simplifying the line once it hits this max. If simplifying makes sense, it may be a nice option, in general. The ability to use a parameter to specify how aggressive one wants to simplify. This parameter could be the number of points. Example algorithm one could use with a PriorityQueue: https://bost.ocks.org/mike/simplify/. This would still require O(m) space, where m is the number of points returned. And would also require heapifying triangles sorted by their areas, which would be O(log(m)) operations. Since sorting is done, anyways, simplifying would still be a O(n log(m)) operation, where n is the total number of points to filter........... something to explore closes #41649	2020-11-23 10:26:27 -08:00
Wylie Conlon	10ee0f2878	Clarify field data cache behavior in docs (#64375 ) * Clarify that field data cache includes global ordinals * Describe that the cache should be cleared once the limit is reached * Clarify that the `_id` field does not supported aggregations anymore * Fold the `fielddata` mapping parameter page into the `text field docs * Improve cross-linking	2020-11-20 13:53:23 -08:00
Adam Locke	9fdcd79927	Explicitly defining types for sources parameter (#65006 )	2020-11-12 16:09:04 -05:00
Mark Tozzi	f666ccb3bc	Add supports for upper and lower values on boxplot based on the IQR value (#63617 )	2020-11-04 14:39:05 -05:00
James Rodewig	8bc922512c	[DOCS] Redirect moving avg aggregation (#64435 )	2020-10-30 14:12:09 -04:00
James Rodewig	2e9f95aa73	[DOCS] Change agg titles to sentence case (#64425 )	2020-10-30 13:25:21 -04:00
James Rodewig	37b6adaf91	[DOCS] Rewrite aggs overview (#64318 ) - Replaces more abstract docs about object structure and values source with task-based examples. - Relocates several sections from the current `misc.asciidoc` file. - Alphabetically sorts agg categories in the nav. - Removes the matrix agg family. Moves the stats matrix agg under the metric agg family Co-authored-by: debadair <debadair@elastic.co>	2020-10-30 08:39:38 -04:00
István Zoltán Szabó	6093518f4a	[DOCS] Changes experimental flag to beta in DFA related docs (#63992 )	2020-10-26 17:02:46 +01:00
Hugo Chargois	ff736f078b	Allow mixing set-based and regexp-based include and exclude (#63325 ) * Allow mixing set-based and regexp-based include and exclude * Coding style * Disallow having both set and regexp include (resp. exclude) * Test correctness of every combination of include/exclude	2020-10-21 10:26:42 -04:00
Aref Razavi	245663e5b7	Remove useless parentheses in bucket_key formula (#63868 )	2020-10-19 11:54:21 +02:00
Igor Motov	e6c70f6811	Add value_count mode to rate agg (#63687 ) Adds a new value count mode to the rate aggregation. Closes #63575	2020-10-15 18:00:44 -04:00
Igor Motov	34bff3f776	Add support for histogram fields to rate aggregation (#63289 ) The rate aggregation now supports histogram fields. At the moment only sum is supported. Closes #62939	2020-10-08 16:54:25 -04:00
Przemyslaw Gomulka	b38eaae47f	[doc] Rounding range query rules (#63109 ) a documentation explaining defaulting of missing fields when using date math parser. relates #62268	2020-10-02 08:59:27 +02:00
Benjamin Trent	1084aaf18a	[ML] renames /inference apis to /trained_models (#63097 ) This commit renames all `inference` CRUD APIs to `trained_models`. This aligns with internal terminology, documentation, and use-cases.	2020-10-01 12:13:49 -04:00
Lisa Cawley	ecf9e929ba	[DOCS] Add experimental tag to inference processor and bucket aggregation (#63023 )	2020-09-30 07:20:38 -07:00
James Rodewig	277709004e	[DOCS] Fix elasticsearch-croneval chunking (#63008 )	2020-09-29 09:53:20 -04:00
Christos Soulios	b857768bb5	Histogram field type support for min/max aggregations (#62532 ) Implement min/max aggregations for histogram fields. Closes #60951	2020-09-19 23:34:43 +03:00
Julie Tibshirani	f29c743a47	Support the 'fields' option in inner_hits and top_hits. (#62259 ) This PR adds support for the 'fields' option in the following places: * Anytime `inner_hits` is used, for both fetching nested/ child docs and field collapsing * The `top_hits` aggregation Addresses #61949.	2020-09-14 10:08:58 -07:00
Igor Motov	f107dba741	Add rate aggregation (#61369 ) Adds a new rate aggregation that can calculate a document rate for buckets of a date_histogram. Closes #60674	2020-08-25 11:32:20 -04:00
István Zoltán Szabó	8da6bba0fc	[DOCS] Adds example to the inference aggregation description (#61290 )	2020-08-19 11:20:42 +02:00
Nik Everett	cebd5d47e2	Redo experimental tag on vwh (#61065 ) The docs didn't have the standard experimental text. This adds it.	2020-08-18 10:00:54 -04:00
James Rodewig	456c37b186	[DOCS] Add usage tips to `top_hits` agg (#61215 )	2020-08-17 12:42:04 -04:00
Adam Locke	fdc867e395	[DOCS] Update info about geo_shape bounding boxes (#61214 ) * Adding information about geo_shape bounding boxes. * Fixing cross link and incorporating review feedback.	2020-08-17 11:07:18 -04:00
James Rodewig	a94e5cb7c4	[DOCS] Replace Wikipedia links with attribute (#61171 )	2020-08-17 09:44:24 -04:00
Gilad Gal	8534bd5ce7	Update normalize-aggregation.asciidoc The second method normalizes linearly between 0..100	2020-08-12 22:24:36 +03:00
James Rodewig	a0f4edff66	[DOCS] Fix chunking in query docs (#61053 ) Changes: * Moves "Notes" sections for the joining queries and percolate query pages to the parent page * Adds related redirects for the moved "Notes" pages * Assigns explicit anchor IDs to other "Notes" headings. This was required for the redirects to work.	2020-08-12 13:45:49 -04:00
James Rodewig	6b9b8c5e31	[DOCS] Move script and stored fields content to search fields page (#60826 ) Changes: * Moves `Retrieve selected fields` to its own page and adds a title abbreviation. * Adds existing script and stored fields content to `Retrieve selected fields` * Adds a xref for `Retrieve selected fields` to `Search your data` * Adds related redirects and updates existing xrefs	2020-08-06 12:45:03 -04:00
Mark Tozzi	65caee9163	Extensibility for Composite Agg (#59648 ) This PR adds the ability to plug new ValuesSourceType support into Composite aggregations via the ValuesSourceRegistry. This should let plugins which define new field types wire those types into composite. It also updates composite's use of ValueType to follow the conventions we're using in the rest of aggregations, namely splitting the user supplied value out from the default value.	2020-08-06 12:34:14 -04:00
James Rodewig	929033f9dd	[DOCS] Move named query content to bool query (#60748 )	2020-08-05 13:27:10 -04:00
James Rodewig	a4dc336c16	[DOCS] Replace `twitter` dataset in search/agg docs (#60667 )	2020-08-04 13:31:52 -04:00
Alexander Reelsen	c7ac9e7073	[DOCS] http -> https, remove outdated plugin docs (#60380 ) Plugin discovery documentation contained information about installing Elasticsearch 2.0 and installing an oracle JDK, both of which is no longer valid. While noticing that the instructions used cleartext HTTP to install packages, this commit replaces HTTPs links instead of HTTP where possible. In addition a few community links have been removed, as they do not seem to exist anymore.	2020-07-31 15:58:38 -04:00
James Rodewig	aec26b1a23	[DOCS] Move search pagination content to one page (#60515 )	2020-07-31 11:43:06 -04:00
Julie Tibshirani	8a89d95372	Add search `fields` parameter to support high-level field retrieval. (#60100 ) This feature adds a new `fields` parameter to the search request, which consults both the document `_source` and the mappings to fetch fields in a consistent way. The PR merges the `field-retrieval` feature branch. Addresses #49028 and #55363.	2020-07-27 13:25:55 -07:00
James Rodewig	441c3a21b1	[DOCS] Update my-index examples (#60132 ) Changes the following example index names to `my-index-000001` for consistency: * `my-index` * `my_index` * `myindex`	2020-07-27 14:46:39 -04:00
James Rodewig	74c9e56735	[DOCS] Fix default gap policy for moving fn, moving avg aggs (#60223 ) (#60230 )	2020-07-27 12:32:35 -04:00
James Rodewig	d5b03f668b	[DOCS] Move search sort docs to separate page (#60123 ) Moves the search sort docs from the deprecated 'Request Body Search' page to a new subpage of 'Run a search'. No substantive changes were made to the content.	2020-07-23 12:58:57 -04:00
James Rodewig	2774cd6938	[DOCS] Swap `[float]` for `[discrete]` (#60124 ) Changes instances of `[float]` in our docs for `[discrete]`. Asciidoctor prefers the `[discrete]` tag for floating headings: https://asciidoctor.org/docs/asciidoc-asciidoctor-diffs/#blocks	2020-07-23 11:48:22 -04:00
Howard	b8e3ba783a	[DOCS] Fix missing punctuation in agg docs (#59822 )	2020-07-21 10:17:59 -04:00
James Rodewig	2c5d6e9c95	[DOCS] Reformat agg snippets to use two-space indents (#59912 )	2020-07-20 15:08:04 -04:00
James Rodewig	8a57800f1b	[DOCS] Add performance warning for scripts (#59890 )	2020-07-20 14:04:35 -04:00
Igor Motov	6bfde550f9	Add hard_bounds documentation (#59809 ) Fixes #59774	2020-07-20 09:54:02 -04:00
Nik Everett	27efb5f3b8	Clean up a few of vwh's rough edges (#59341 ) This cleans up a few rough edged in the `variable_width_histogram`, mostly found by @wwang500: 1. Setting its tuning parameters in an unexpected order could cause the request to fail. 2. We checked that the maximum number of buckets was both less than 50000 and MAX_BUCKETS. This drops the 50000. 3. Fixes a divide by 0 that can occur of the `shard_size` is 1. 4. Fixes a divide by 0 that can occur if the `shard_size * 3` overflows a signed int. 5. Requires `shard_size * 3 / 4` to be at least `buckets`. If it is less than `buckets` we will very consistently return fewer buckets than requested. For the most part we expect folks to leave it at the default. If they change it, we expect it to be much bigger than `buckets`. 6. Allocate a smaller `mergeMap` in when initially bucketing requests that don't use the entire `shard_size * 3 / 4`. Its just a waste. 7. Default `shard_size` to `10 * buckets` rather than `100`. It looks like that was our intention the whole time. And it feels like it'd keep the algorithm humming along more smoothly. 8. Default the `initial_buffer` to `min(10 * shard_size, 50000)` like we've documented it rather than `5000`. Like the point above, this feels like the right thing to do to keep the algorithm happy. Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com>	2020-07-17 13:39:28 -04:00

1 2 3 4 5 ...

449 commits