elasticsearch

mirror of https://github.com/elastic/elasticsearch.git synced 2025-04-25 07:37:19 -04:00

Author	SHA1	Message	Date
Julie Tibshirani	bb04fbcd96	For constant_keyword, make sure exists query handles missing values. (#55757 ) It's possible for a constant_keyword to have a 'null' value before any documents are seen that contain a value for the field. In this case, no documents have a value for the field, and 'exists' queries should return no documents.	2020-05-04 09:06:34 -07:00
Christos Soulios	caf6c5ac19	Histogram field type support for ValueCount and Avg aggregations (#55933 ) Implements value_count and avg aggregations over Histogram fields as discussed in #53285 - value_count returns the sum of all counts array of the histograms - avg computes a weighted average of the values array of the histogram by multiplying each value with its associated element in the counts array	2020-05-04 10:24:35 +03:00
Christos Soulios	cefc6af25b	Histogram field type support for Sum aggregation (#55681 ) Implements Sum aggregation over Histogram fields by summing the value of each bucket multiplied by their count as requested in #53285	2020-04-29 11:09:25 +03:00
Ben Skelker	a085183d6e	[DOCS] Add `ip_range` datatype to core datatypes range list (#55446 )	2020-04-20 08:54:19 -04:00
markharwood	35188e0091	Remove normalizer support from wildcard field while we decide on approach for handling case insensitvity (#55294 ) Closes #55288	2020-04-17 10:47:33 +01:00
Ignacio Vera	6182db5b77	Add new point field. (#53804 ) This commit adds a new point field that is able to index arbitrary pair of values (x/y) in the cartesian space. It only supports filtering using shape queries at the moment.	2020-04-07 13:08:02 +02:00
markharwood	d83798f237	Add pre-configured “lowercase” normalizer (#53882 ) Add pre-configured “lowercase” normalizer Includes tests that user-defined "lowercase" normalizer overrides the default one. Closes #53872	2020-04-03 10:12:06 +01:00
bellengao	5865a899d8	[DOCS] Correct field name in date_nanos doc (#54556 )	2020-04-01 14:22:04 +02:00
markharwood	2537e02a7d	Wildcard field - add normalizer support (#53851 ) * Add support for normalisation to wildcard field * Tidied imports * Added docs about params * Fix outdated error message * Avoid normaliser butchering wildcard query special characters * Fix broken test expectations * Fix wrong toString method * Address review comments - common method for normalising wildcard patterns and checkCompatibility * Remove unused import	2020-03-24 09:35:29 +00:00
Julie Tibshirani	7161bd44cb	Remove the top-level 'mapping type' section. (#53374 ) It seemed confusing for users that our top-level mapping page still had a prominent section named 'Mapping Type'. This PR reworks the docs to remove this reference and adds a note about types removal (similar to the note we added to other APIs like put mapping).	2020-03-23 12:26:22 -07:00
Lisa Cawley	0ea4324e22	[DOCS] Adds data nanos transform limitation (#53826 )	2020-03-23 09:48:00 -07:00
C.J. Jameson	34feb3cde1	[DOCS] Clarify routing enforcement in docs (#53945 ) Removes a mention of the `_doc` mapping type that's no longer applicable now that mapping types are removed/deprecated.	2020-03-23 12:34:49 -04:00
markharwood	024deb41a4	Wildcard field docs formatting fix Bullet points weren't rendering correctly	2020-03-18 15:38:00 +00:00
Alan Woodward	534a4a9b32	Remove [removal-of-types] docs page, and point to 7x docs (#53670 ) Given that types have been removed entirely in 8.0, we don't need a detailed page explaining what they are and why they are going away. This commit replaces the page with a short paragraph saying that types are no longer supported, and a link to the [removal-of-types] page in 7x Relates to #41059	2020-03-18 14:13:00 +00:00
James Rodewig	f2a2dcbb8a	[DOCS] Streamline `analyzer` mapping parm def (#51874 ) Simplifies the `analyzer` mapping parameter definition to remove duplicated analysis content and examples.	2020-03-18 09:42:25 -04:00
Alan Woodward	795a92707f	Remove deprecation warning when doc scripts refer to '_type' field (#53605 ) We currently emit a warning in 8x when a script refers to the _type field for a document. However, in 8x this field no longer exists, so the deprecation warning is not required. Relates to #41059	2020-03-18 11:50:36 +00:00
markharwood	a2a4756736	New wildcard field optimised for wildcard queries (#49993 ) Indexes values using size 3 ngrams and also stores the full original as a binary doc value. Wildcard queries operate by using a cheap approximation query on the ngram field followed up by a more expensive verification query using an automaton on the binary doc values. Also supports aggregations and sorting.	2020-03-16 09:54:10 +00:00
James Rodewig	febe7af62e	[DOCS] Clarify `max_shingle_size` parm def (#53480 ) Rewrites the `search_as_you_type` field datatype's `max_shingle_size` mapping parameter to improve clarity and better communicate trade-offs regarding index size. Relates to [elastic/kibana#55161][0]. Closes #51774. [0]: https://github.com/elastic/kibana/pull/55161#discussion_r368107177	2020-03-13 04:07:40 -04:00
Ignacio Vera	dba2a6e199	remove sneaked placeholder from histogram docs (#53391 )	2020-03-11 13:47:01 +01:00
James Rodewig	9f641dc07d	[DOCS] Fix several Asciidoctor double arrow replacements (#52827 ) Per the [Asciidoctor docs][0], Asciidoctor replaces the following syntax with double arrows in the rendered HTML: * => renders as ⇒ * <= renders as ⇐ This escapes several unintended replacements, such as in the Painless docs. Where appropriate, it also replaces some double arrow instances with single arrows for consistency. [0]: https://asciidoctor.org/docs/user-manual/#replacements	2020-03-04 08:42:37 -05:00
Adrien Grand	0b2a62822c	Introduce a `constant_keyword` field. (#49713 ) This field is a specialization of the `keyword` field for the case when all documents have the same value. It typically performs more efficiently than keywords at query time by figuring out whether all or none of the documents match at rewrite time, like `term` queries on `_index`. The name is up for discussion. I liked including `keyword` in it, so that we still have room for a `singleton_numeric` in the future. However I'm unsure whether to call it `singleton`, `constant` or something else, any opinions? For this field there is a choice between 1. accepting values in `_source` when they are equal to the value configured in mappings, but rejecting mapping updates 2. rejecting values in `_source` but then allowing updates to the value that is configured in the mapping This commit implements option 1, so that it is possible to reindex from/to an index that has the field mapped as a keyword with no changes to the source.	2020-03-02 16:46:46 +01:00
James Rodewig	340c08a6a8	[DOCS] Correct guidance for `index_options` mapping parm (#52899 ) Adds a warning admonition stating that the `index_options` mapping parameter is intended only for `text` fields. Removes an outdated statement regarding default values for numeric and other datatypes.	2020-03-02 07:37:48 -05:00
Martijn van Groningen	31b29875c9	Add validation for dynamic templates (#51233 ) Tries to load a `Mapper` instance for the mapping snippet of a dynamic template. This should catch things like using an analyzer that is undefined or mapping attributes that are unused. This is best effort: * If `{{name}}` placeholder is used in the mapping snippet then validation is skipped. * If `match_mapping_type` is not specified then validation is performed for all mapping types. If parsing succeeds with a single mapping type then this the dynamic mapping is considered valid. If is detected that a dynamic template mapping snippet is invalid at mapping update time then the mapping update is failed for indices created on 8.0.0-alpha1 and later. For indices created on prior version a deprecation warning is omitted instead. In 7.x clusters the mapping update will never fail in case of an invalid dynamic template mapping snippet and a deprecation warning will always be omitted. Closes #17411 Closes #24419 Co-authored-by: Adrien Grand <jpountz@gmail.com>	2020-02-27 11:52:27 +01:00
James Rodewig	12ed6f12e9	[DOCS] Document `include_in_*` nested mapping parms (#52648 ) Adds documentation for the `include_in_parent` and `include_in_root` mapping parameters for the `nested` mapping datatype.	2020-02-25 07:12:34 -05:00
Ignacio Vera	3b004b1936	Add support for multipoint shape queries (#52564 )	2020-02-24 12:42:17 +01:00
Ignacio Vera	e2b410e15e	Add support for multipoint geoshape queries (#52133 ) Currently multi-point queries are not supported when indexing your data using BKD-backed geoshape strategy. This commit removes this limitation.	2020-02-20 08:53:01 +01:00
Valentin Crettaz	3c21eca5e9	[DOCS] Clarify that "now" cannot be used in `date_range` at index time (#52446 ) `date_range` fields do not accept `"now"` as a value of either bounds at indexing time. This corrects an error in the range data type mapping docs.	2020-02-19 12:55:06 -05:00
James Rodewig	27f45aeb9d	[DOCS] Remove 'analyzed string' references (#51946 ) The `string` field datatype was replaced by the `text` and `keyword` field datatypes in [5.0][0]. This removes several outdated references to 'analyzed string' fields. [0]:https://www.elastic.co/guide/en/elasticsearch/reference/5.0/breaking_50_mapping_changes.html#_string_fields_replaced_by_textkeyword_fields	2020-02-14 12:33:21 -05:00
Igor Motov	0898df4aac	Add histogram field type support to boxplot aggs (#52265 ) Add support for the histogram field type to boxplot aggs. Closes #52233 Relates to #33112	2020-02-13 08:59:44 -05:00
Ryan Ernst	f7c989731e	Fix incorrect date nanos docs example (#52249 ) The example of how to access the nano value of a date_nanos field has been broken since it was created. This commit fixes it to use the correct scripting methods. closes #51931	2020-02-12 15:54:39 -08:00
Marios Trivyzas	a8b39ed842	Add a cluster setting to disallow expensive queries (#51385 ) Add a new cluster setting `search.allow_expensive_queries` which by default is `true`. If set to `false`, certain queries that have usually slow performance cannot be executed and an error message is returned. - Queries that need to do linear scans to identify matches: - Script queries - Queries that have a high up-front cost: - Fuzzy queries - Regexp queries - Prefix queries (without index_prefixes enabled - Wildcard queries - Range queries on text and keyword fields - Joining queries - HasParent queries - HasChild queries - ParentId queries - Nested queries - Queries on deprecated 6.x geo shapes (using PrefixTree implementation) - Queries that may have a high per-document cost: - Script score queries - Percolate queries Closes: #29050	2020-02-12 18:06:04 +01:00
Mayya Sharipova	620996287a	Remove docs related to index time boosting (#51704 ) As there is no really index time boosting, as boost is only applied during query time, this removes mentions of index time boosting.	2020-01-31 09:01:52 -05:00
Yannick Welsch	a57a9a31c3	Stricter checks of setup and teardown in docs tests (#51430 ) Made checks stricter after backporting PR.	2020-01-28 17:53:57 +01:00
junmuz	06cec760da	[DOCS] Correct typo in `ignore_malformed` mapping parm docs (#50780 )	2020-01-13 09:49:10 -05:00
James Rodewig	d3cd4fbd62	[DOCS] Fix typo in mapping date format docs	2020-01-08 07:55:21 -06:00
arkel-s	f20367e405	[DOCS] Add example format for `date_optional_time` (#50458 ) Adds an example format for `date_optional_time` to the `format` mapping parameter docs. Closes #50457	2020-01-07 10:07:29 -06:00
James Rodewig	e8a6d4a3fb	[DOCS] Remove unneeded redirects (#50476 ) The docs/reference/redirects.asciidoc file stores a list of relocated or deleted pages for the Elasticsearch Reference documentation. This prunes several older redirects that are no longer needed and don't require work to fix broken links in other repositories.	2019-12-26 07:49:41 -05:00
Xiang Dai	432bd0e92c	Fix docs typos (#50365 ) Fixes a few typos in the docs. Signed-off-by: Xiang Dai 764524258@qq.com	2019-12-23 10:35:14 -05:00
Adrien Grand	2d627ba757	Add per-field metadata. (#49419 ) This PR adds per-field metadata that can be set in the mappings and is later returned by the field capabilities API. This metadata is completely opaque to Elasticsearch but may be used by tools that index data in Elasticsearch to communicate metadata about fields with tools that then search this data. A typical example that has been requested in the past is the ability to attach a unit to a numeric field. In order to not bloat the cluster state, Elasticsearch requires that this metadata be small: - keys can't be longer than 20 chars, - values can only be numbers or strings of no more than 50 chars - no inner arrays or objects, - the metadata can't have more than 5 keys in total. Given that metadata is opaque to Elasticsearch, field capabilities don't try to do anything smart when merging metadata about multiple indices, the union of all field metadatas is returned. Here is how the meta might look like in mappings: ```json { "properties": { "latency": { "type": "long", "meta": { "unit": "ms" } } } } ``` And then in the field capabilities response: ```json { "latency": { "long": { "searchable": true, "aggreggatable": true, "meta": { "unit": [ "ms" ] } } } } ``` When there are no conflicts, values are arrays of size 1, but when there are conflicts, Elasticsearch includes all unique values in this array, without giving ways to know which index has which metadata value: ```json { "latency": { "long": { "searchable": true, "aggreggatable": true, "meta": { "unit": [ "ms", "ns" ] } } } } ``` Closes #33267	2019-12-18 17:27:38 +01:00
James Rodewig	230d4765d3	[DOCS] Add identifier mapping tip to numeric and keyword datatype docs (#49933 ) Users often mistakenly map numeric IDs to numeric datatypes. However, this is often slow for the `term` and other term-level queries. The "Tune for search speed" docs includes advice for mapping numeric IDs to `keyword` fields. However, this tip is not included in the `numeric` or `keyword` field datatype doc pages. This rewords the tip in the "Tune for search speed" docs, relocates it to the `numeric` field docs, and reuses it using tagged regions.	2019-12-17 09:31:07 -05:00
Ignacio Vera	7c559be31c	"CONTAINS" support for BKD-backed geo_shape and shape fields (#50141 ) Lucene 8.4 added support for "CONTAINS", therefore in this commit those changes are integrated in Elasticsearch. This commit contains as well a bug fix when querying with a geometry collection with "DISJOINT" relation.	2019-12-16 07:43:42 +01:00
Adrien Grand	1329acc094	Upgrade to lucene 8.4.0-snapshot-662c455. (#50016 ) Lucene 8.4 is about to be released so we should check it doesn't cause problems with Elasticsearch.	2019-12-10 17:09:36 +01:00
Ignacio Vera	eade4f03f4	New Histogram field mapper that supports percentiles aggregations. (#48580 ) This commit adds a new histogram field mapper that consists in a pre-aggregated format of numerical data to be used in percentiles aggregations.	2019-11-28 13:58:20 +01:00
Jim Ferenczi	c2deb287f1	Add a cluster setting to disallow loading fielddata on _id field (#49166 ) This change adds a dynamic cluster setting named `indices.id_field_data.enabled`. When set to `false` any attempt to load the fielddata for the `_id` field will fail with an exception. The default value in this change is set to `false` in order to prevent fielddata usage on this field for future versions but it will be set to `true` when backporting to 7x. When the setting is set to true (manually or by default in 7x) the loading will also issue a deprecation warning since we want to disallow fielddata entirely when https://github.com/elastic/elasticsearch/issues/26472 is implemented. Closes #43599	2019-11-27 13:38:09 +01:00
Julie Tibshirani	548fcd09fb	Stop ignoring types warnings in REST tests. (#49333 ) In 7.x we added logic to the REST test harness to ignore warnings related to types removal. This allowed us to continue to run mixed-cluster tests that included 6.x nodes. Now that master is on 8.x, we've no longer need to include 6.x nodes in testing and have removed almost all typed calls. The logic to ignore warnings can therefore be removed.	2019-11-20 08:23:56 -08:00
Mayya Sharipova	19b6b3693a	Increase the number of vector dims to 2048 (#46895 )	2019-11-20 07:43:14 -05:00
Julie Tibshirani	dd6f0a35e4	Remove the 'experimental' marking from vector fields. (#49120 ) We wrapped up the API changes we wanted to make, and vector fields can now be considered GA.	2019-11-18 11:57:18 -08:00
Antoine Garcia	529ebea2b9	[Docs] Specify field types not supporting doc values (#49041 ) The `string` type (with option `analyzed`) has been replaced by `text` after `6.0`, also the `annonated_text` field do not support doc values and should be mentioned.	2019-11-18 16:37:51 +01:00
Julie Tibshirani	460d545921	Remove support for sparse vectors. (#48781 ) Follow up to #48368. This PR removes support for `sparse_vector` on new indices. On 7.x indices a `sparse_vector` can still be defined, but it is not possible to index or search on it.	2019-11-14 16:54:48 -05:00
Julie Tibshirani	7d500bacee	Ensure parameters are updated when merging flattened mappings. (#48971 ) This PR makes the following two fixes around updating flattened fields: * Make sure that the new value for ignore_above is immediately taken into affect. Previously we recorded the new value but did not use it when parsing documents. * Allow depth_limit to be updated dynamically. It seems plausible that a user might want to tweak this setting as they encounter more data.	2019-11-12 13:44:53 -08:00

1 2 3 4 5 ...

541 commits