elasticsearch

mirror of https://github.com/elastic/elasticsearch.git synced 2025-06-28 17:34:17 -04:00

Author	SHA1	Message	Date
Jim Ferenczi	54af815ad9	Refactor SourceProvider creation to consistently use MappingLookup (#128213 ) This change updates the code to always create SourceProvider instances via MappingLookup, avoiding direct exposure to the underlying source format (synthetic or stored). It also aligns source filtering behaviour between SourceProvider and SourceLoader, ensuring consistent application of filters. This change is needed to enable source filtering to occur earlier in the fetch phase, for example, when constructing a synthetic source.	2025-05-22 14:45:13 +01:00
Rene Groeschke	ba61f8c7f7	Update Gradle wrapper to 8.12 (#118683 ) This updates the gradle wrapper to 8.12 We addressed deprecation warnings due to the update that includes: - Fix change in TestOutputEvent api - Fix deprecation in groovy syntax - Use latest ospackage plugin containing our fix - Remove project usages at execution time - Fix deprecated project references in repository-old-versions	2024-12-30 15:34:24 +01:00
Armin Braun	e94f145350	Fix a bunch of non-final static fields (#119185 ) Fixing almost all missing `final` spots, who knows maybe we get a small speedup from some constant folding here and there.	2024-12-26 19:14:36 +01:00
Oleksandr Kolomiiets	2b8e4e727c	Migrate mapper-related modules to internal-*-rest-test (#117298 )	2024-11-23 00:35:24 +00:00
Rene Groeschke	f6ac6e1c3b	[Build] Remove deprecated BuildParams (#116984 )	2024-11-22 16:30:57 +01:00
Rene Groeschke	13c8aaeffa	[Gradle] Remove static use of BuildParams (#115122 ) Static fields dont do well in Gradle with configuration cache enabled. - Use buildParams extension in build scripts - Keep BuildParams.ci for now for easy serverless migration - Tweak testing doc	2024-11-15 17:58:57 +01:00
Kostas Krikellas	4573ab8ec1	[TEST] Replace _source.mode with index.mapping.source.mode in integration tests - take 2 (#116072 ) * Reapply "[TEST] Replace _source.mode with index.mapping.source.mode in integra…" (#116069) This reverts commit `e8bf344a28`. * [TEST] Replace _source.mode with index.mapping.source.mode in integration tests * add reason * add reason * spotless * revert unneeded	2024-11-04 09:39:34 +02:00
Kostas Krikellas	e8bf344a28	Revert "[TEST] Replace _source.mode with index.mapping.source.mode in integra…" (#116069 ) This reverts commit `a360757968`.	2024-11-01 10:53:08 +02:00
Kostas Krikellas	a360757968	[TEST] Replace _source.mode with index.mapping.source.mode in integration tests (#115926 ) * Replace _source.mode with index.mapping.source.mode in integration tests * fix tests * revert 40_source_mode_setting.yml	2024-11-01 09:46:06 +02:00
Mark Vieira	a59c182f9f	Add AGPLv3 as a supported license	2024-09-13 15:29:46 -07:00
Kostas Krikellas	f3bc281978	Refactor build params for FieldMapper, adding SourceKeepMode (#112455 ) * Refactor build params for FieldMapper * more mappers and tests * more mappers * more mappers * spotless * spotless * stored by default * Revert "stored by default" This reverts commit `bbd247d64b`. * restore storeIgnored * sync * list valid values for SourceKeepMode * small refactoring * spotless	2024-09-06 14:16:17 +03:00
Luca Cavanna	915e4a50c5	Rename Mapper#name to Mapper#fullPath (#110040 ) This addresses a long standing TODO that caused quite a few bugs over time, in that the mapper name does not include its full path, while the MappedFieldType name does. We have renamed Mapper.Builder#name to leafName (#109971) and Mapper#simpleName to leafName (#110030). This commit renames Mapper#name to fullPath for clarity This required some adjustments in FieldAliasMapper to avoid confusion between the existing path method and fullPath. I renamed path to targetPath for clarity. ObjectMapper already had a fullPath method that returned name, and was effectively a copy of name, so it could be removed.	2024-06-21 22:47:27 +02:00
Luca Cavanna	54e7b4d93b	Rename Mapper#simpleName to Mapper#leafName (#110030 ) This addresses a long standing TODO that caused quite a few bugs over time, in that the mapper name does not include its full path, while the MappedFieldType name does. We have method called simpleName to signal that, but leafName signals that more clearly and aligns with the name we have recently introduced in Mapper.Builder (renamed from name to leafName). Relates to #109971	2024-06-21 14:28:36 +02:00
Luca Cavanna	15c7abe111	Rename Mapper#name to Mapper#leafName (#109971 ) This addresses a long standing TODO that caused quite a few bugs over time, in that the mapper name does not include its full path, while the MappedFieldType name does.	2024-06-21 11:48:17 +02:00
Oleksandr Kolomiiets	6c82f87074	Add test for docvalue_fields retrieval of murmur3 (#107880 )	2024-04-30 08:07:48 -07:00
Adrien Grand	62f19e3a0c	Disable dynamic pruning on unindexed fields. (#107194 ) In order to know whether it can apply dynamic pruning using the points index, Lucene simply looks at whether a field has points. Unfortunately, this doesn't work well with our support for archive indexes, where numeric/date fields report that they have points, but they only support metadata operations on these points (min/max values, doc count), with the goal of quickly filtering out such archive indexes during the `can_match` phase. In order to address this discrepancy, dynamic pruning is now disabled when mappings report that a field is not indexed. This works because archive indexes automatically set `index: false` to make sure that filters run on doc values and not points. However, this is not a great fix as this increases our reliance on disabling dynamic pruning, which is currently marked as deprecated and scheduled for removal in the next Lucene major. So we'll need to either add it back to Lucene or find another approach. Closes #107168	2024-04-09 17:01:32 +02:00
Felix Barnsteiner	5920c917aa	Encapsulate Mapper.Builder#name and make it private (#105648 ) This is in preparation to make the field mutable, which is needed in the context of https://github.com/elastic/elasticsearch/pull/103542	2024-02-20 15:53:14 +01:00
Armin Braun	1452658e35	Remove dead code from mappers codebase (#103273 ) Just some random findings from investigating unrelated things.	2023-12-11 17:57:19 +01:00
Armin Braun	574fb05946	Deduplicate org.apache.lucene.document.FieldType instances across mappers (#99361 ) We mostly have a handful of `FieldType` values here across all mappers and none of them contain attributes. There's only so many combinations here, lets deduplicate these to save some heap and set up subsequent mapper heap savings.	2023-09-08 22:18:35 +02:00
Armin Braun	f1a376c317	Remove CopyTo.Builder (#99368 ) The copyTo builder is really hard to reason about when it comes to mapper merging, because the `reset` method would actually mutate an existing mapper. That seems dangerous and the whole thing is quite inefficient as well. -> this PR just removes it and uses a copy constructor for copy on write, avoiding instance creation on mapper merges here and there and leaving no doubt about these things being immutable.	2023-09-08 13:24:31 -04:00
Benjamin Trent	8b52d85a37	Add more testing for murmur3 mapper (#96745 ) The key here is that doc value fields for the murmur3 mapper are the hash, where as the `fields` API pulls from `_source` which contains the unmodified string. closes: https://github.com/elastic/elasticsearch/issues/96742	2023-06-13 09:21:37 -04:00
Simon Cooper	56d53da381	Migrate LuceneDocument.getFields(String) to a List (#94830 )	2023-03-29 11:08:36 +01:00
Mark Vieira	c2eda511de	Add JUnit rule based integration test cluster orchestration framework (#92379 ) This commit adds a new test framework for configuring and orchestrating test clusters for both Java and YAML REST testing. This will eventually replace the existing "test-clusters" Gradle plugin and the build-time cluster orchestration.	2022-12-21 15:33:46 -08:00
Nik Everett	bc49392bfb	Support malformed numbers in synthetic _source (#90428 ) This adds support for `ignore_malformed` to numeric fields other than `scaled_float` in synthetic `_source`. Their values are saved to a stored field and loaded to render the `_source`.	2022-10-04 12:17:30 -04:00
Nik Everett	f4fad2548f	Always support ignore_malformed in the same way (#90565 ) This makes sure that all field types that support `ignore_malfored` do so in the same way. Production changes: * All mapper has an `ignoreMalformed` method that must return `true` if the field accepts the `ignore_malformed` mapping parameter was configured. It defaults to `false` because many fields either don't have a concept of "malformed" value or don't have the ability to ignore malformed values. * Fix the `scaled_float` field to store it's field name in `_ignored` if it ignores any malfored values. This is how all other field mappers work. Test changes: * `MapperTestCase` forces subclasses to declare if their `supportIgnoreMalformed` or not. * If `MapperTestCase` subclasses `supportIgnoreMalfored` they must define some `exampleMalformedValues`. * `MapperTestCase` always grows three new tests: * One that creates a field without setting `ignore_malformed` and verifies that all `exampleMalformedValues` throw expected errors * On that explicitly configured `ignore_malformed` to false and, if `supportIgnoreMalformed` it verifies the errors again. If not `supportIgnoreMalformed` it verifies that the parameter is unknown. * On that explicitly configured `ignore_malformed` to true and, if `supportIgnoreMalformed` it verifies that parsing doesn't produce errors and correctly produces `_ignored`. If not `supportIgnoreMalformed` it verifies that the parameter is unknown. * Moved some subclasesses of `MapperTestCase` from `internalClusterTests` to `tests`. This isn't strictly required but that's the right place for them.	2022-10-03 06:18:02 -04:00
Alan Woodward	bc8ebbf540	Add FieldDataContext (#88779 ) MappedFieldType#fieldDataBuilder() currently takes two parameters, a fully qualified index name and a supplier for a SearchLookup. We expect to add more parameters here as we add support for loading fielddata from source. Rather than telescoping the parameter list, this commit instead introduces a new FieldDataContext carrier object which will allow us to add to these context parameters more easily.	2022-07-26 14:47:50 +01:00
Armin Braun	7a25453dec	Speed up FieldMapper construction/parsing/serialization (#86860 ) Speeding this up some more as it's now 50% of the bootstrap time of the many shards benchmarks. Iterating an array here in all cases is quite a bit faster than iterating various kinds of lists and doesn't complicate the code. Also removes a redundant call to `getValue()` for each parameter during serialization.	2022-05-23 12:09:00 +02:00
Nik Everett	a589456b81	Synthetic source (#85649 ) This attempts to shrink the index by implementing a "synthetic _source" field. You configure it by in the mapping: ``` { "mappings": { "_source": { "synthetic": true } } } ``` And we just stop storing the `_source` field - kind of. When you go to access the `_source` we regenerate it on the fly by loading doc values. Doc values don't preserve the original structure of the source you sent so we have to make some educated guesses. And we have a rule: the source we generate would result in the same index if you sent it back to us. That way you can use it for things like `_reindex`. Fetching the `_source` from doc values does slow down loading somewhat. See numbers further down. ## Supported fields This only works for the following fields: * `boolean` * `byte` * `date` * `double` * `float` * `geo_point` (with precision loss) * `half_float` * `integer` * `ip` * `keyword` * `long` * `scaled_float` * `short` * `text` (when there is a `keyword` sub-field that is compatible with this feature) ## Educated guesses The synthetic source generator makes `_source` fields that are: * sorted alphabetically * as "objecty" as possible * pushes all arrays to the "leaf" fields * sorts most array values * removes duplicate text and keyword values These are mostly artifacts of how doc values are stored. ### sorted alphabetically ``` { "b": 1, "c": 2, "a": 3 } ``` becomes ``` { "a": 3, "b": 1, "c": 2 } ``` ### as "objecty" as possible ``` { "a.b": "foo" } ``` becomes ``` { "a": { "b": "foo" } } ``` ### pushes all arrays to the "leaf" fields ``` { "a": [ { "b": "foo", "c": "bar" }, { "c": "bort" }, { "b": "snort" } } ``` becomes ``` { "a" { "b": ["foo", "snort"], "c": ["bar", "bort"] } } ``` ### sorts most array values ``` { "a": [2, 3, 1] } ``` becomes ``` { "a": [1, 2, 3] } ``` ### removes duplicate text and keyword values ``` { "a": ["bar", "baz", "baz", "baz", "foo", "foo"] } ``` becomes ``` { "a": ["bar", "baz", "foo"] } ``` ## `_recovery_source` Elasticsearch's shard "recovery" process needs `_source` sometimes. So does cross cluster replication. If you disable source or filter it somehow we store a `_recovery_source` field for as long as the recovery process might need it. When everything is running smoothly that's generally a few seconds or minutes. Then the fields is removed on merge. This synthetic source feature continues to produce `_recovery_source` and relies on it for recovery. It's possible to synthesize `_source` during recovery but we don't do it. That means that synethic source doesn't speed up writing the index. But in the future we might be able to turn this on to trade writing less data at index time for slower recovery and cross cluster replication. That's an area of future improvement. ## perf numbers I loaded the entire tsdb data set with this change and the size: ``` standard -> synthetic store size 31.0 GB -> 7.0 GB (77.5% reduction) _source 24695.7 MB -> 47.6 MB (99.8% reduction - synthetic is in _recovery_source) ``` A second _forcemerge a few minutes after rally finishes should removes the remaining 47.6MB of _recovery_source. With this fetching source for 1,000 documents seems to take about 500ms. I spot checked a lot of different areas and haven't seen any different hit. I expect this performance impact is based on the number of doc values fields in the index and how sparse they are.	2022-05-10 07:46:58 -04:00
Przemyslaw Gomulka	037261356e	Convert 'id' and '_id' values in REST API tests to strings (#82681 ) Follow-up from #77144 (comment) with converting id/_id to always be strings instead of integers. This makes the type value in the Elasticsearch specification be only string instead of string \| number. this change was generated using following command on ubuntu find . -type f -name ".yml" -print0 \| xargs -0 sed -i -r 's/([^a-zA-Z0-9_\.]id\|[^a-zA-Z0-9_]_id):(\s)([0-9]+)/\1:\2"\3"/g'	2022-02-10 09:14:17 +01:00
weizijun	b6e8b59880	TSDB: fix reindex failed tests without feature flag (#81967 ) fix as the #80945 do. register a settings update consumer for the end_time for the tsdb index even when the end_time setting wasn't registered. Pass the feature flag to reindex yaml tests. Co-authored-by: Igor Motov <igor@motovs.org>	2022-01-06 14:45:08 -05:00
Stuart Tettemer	c937a099af	Script: fields API for x-pack version, doc version, seq no, mumur3 (#81476 ) Adds scripting fields API support the rest of the long fields: * [`_version`](https://www.elastic.co/guide/en/elasticsearch/reference/current/docs-index_.html#index-versioning) - `VersionDocValuesField` * [`_seq_no`](https://www.elastic.co/guide/en/elasticsearch/reference/master/optimistic-concurrency-control.html) - `SeqNoDocValuesField` * [`murmur3`](https://www.elastic.co/guide/en/elasticsearch/plugins/current/mapper-murmur3-usage.html) - `Murmur3DocValueField` * Added Painless support to the murmur3 mapper plugin. All `SortedNumericDocValues` that are interpreted as longs are now subclasses of `AbstractLongDocValuesField`, including murmur, doc version and seq no above as well as `LongDocValuesField` and `UnsignedLongDocValuesField` Also adds: * [x-pack's version](https://www.elastic.co/guide/en/elasticsearch/reference/master/version.html) - `VersionStringDocValuesField` * Created new `Version` value type as a location for future helpers for comparing versions. * Implements `toString` for the expected representation of the version * Implements `asString(String)` and `asString(int, String)`, `asStrings()` converters on field. Refs: #79105	2022-01-05 17:04:38 -06:00
Jack Conradson	1adb59c041	Split off the values supplier for ScriptDocValues (#80635 ) This change makes all ScriptDocValues purely a wrapper around a supplier. (Similar to what FieldValues was.) However, there are some important differences: * This is meant to be transitory. As more DocValuesFields are completed, more of the simple suppliers (ones that aren't DocValuesFields) can be removed. * ScriptDocValues is the wrapper rather than the supplier. DocValuesFields are eventually the target suppliers which makes it really easy to remove the simple suppliers once they are no longer necessary. * ScriptDocValues can be easily deprecated and removed without having to move their code to DocValuesFields. Once ScriptDocValues is removed we can remove the supplier code from DocValuesFields. * DelegateDocValuesField ensures that any ScriptDocValues field are not supplied by another DocValuesField with an assert statement. This helps us to identify bugs during testing. * ScriptDocValues no longer have setNextDocId. This helps us identify bugs during compilation. * Conversions will not share/wrap suppliers since the suppliers are transitory.	2021-11-29 09:41:03 -08:00
Jack Conradson	449d8e406b	Add plumbing for the scripting fields api that returns values based on a mapped type (#80286 ) This change adds a ToScriptField class with the expectation it will be subclassed based on the needs of each mapped type to produce a DocValuesField used by the scripting fields api. This is intended to replace the more generic return of ScriptDocValues. The change made here only targets classes implementing the LeafNumericFieldData interface to keep the initial change smaller, but is also an example for how this would work for other types of LeafFieldData as well. It starts with the fielddataBuilder method of each MappedFieldType (where the appropriate subclass of ToScriptField is specified) then passes through the IndexFieldData.Builder to the IndexData.load method. From here the generated LeafFieldData uses the ToScriptField.getScriptField method to generate the appropriate type of DocValuesField as required by the new scripting fields api. This design seems like the best way to meet the requirements for the scripting fields api by allowing enough information to pass all the way to the LeafFieldData, but without directly coupling the LeafFieldData to a mapped type so that the separation remains. There is also a precedent already set for this design in the keyword field family that uses a scriptFunction to generate a ScriptDocValues of the appropriate type. ToScriptField would eventually replace scriptFunction.	2021-11-09 09:50:57 -08:00
Mark Vieira	12ad399c48	Reformat Elasticsearch source	2021-10-27 08:19:51 -07:00
Chris Hegarty	20c9f756d2	Fix split package org.elasticsearch.common.xcontent (#78831 ) Fix the split package org.elasticsearch.common.xcontent, between server and the x-content lib. Move the x-content lib exported package from org.elasticsearch.common.xcontent to org.elasticsearch.xcontent ( following the naming convention of similar libraries ). Removing split packages is a prerequisite to modularization.	2021-10-08 17:14:26 +01:00
Alan Woodward	9312eba5ed	Change Mapper.build() to take a context object (#77108 ) Mapper.build() currently takes a ContentPath object that it can use to generate field type names that will include its parent names. We would like to expand field types to include more information about their parents, and ContentPath does not hold this information. This commit replaces the ContentPath parameter with a new MapperBuilderContext, which currently holds only the content path information but can be expanded in future to hold parent relationship information. Relates to #75474	2021-09-08 16:34:14 +01:00
Rene Groeschke	35ec6f348c	Introduce simple public yaml-rest-test plugin (#76554 ) This introduces a basic public yaml rest test plugin that is supposed to be used by external elasticsearch plugin authors. This is driven by #76215 - Rename yaml-rest-test to intern-yaml-rest-test - Use public yaml plugin in example plugins Co-authored-by: Mark Vieira <portugee@gmail.com>	2021-08-31 08:45:52 +02:00
Luca Cavanna	c6641bf00c	Rename ParseContext to DocumentParserContext (#74963 ) ParseContext is used to parse documents. It was easily confused with ParserContext (now renamed to MappingParserContext) which is instead used to parse mappings. To remove any confusion, this commit renames ParseContext to DocumentParserContext and adapts its subclasses accordingly.	2021-07-06 09:15:59 -04:00
Alan Woodward	b27eaa38dc	Remove 'external values', and replace with swapped out XContentParsers (#72203 ) The majority of field mappers read a single value from their positioned XContentParser, and do not need to call nextToken. There is a general assumption that the same holds for any multifields defined on them, and so the XContentParser is passed down to their multifields builder as-is. This assumption does not hold for mappers that accept json objects, and so we have a second mechanism for passing values around called 'external values', where a mapper can set a specific value on its context and child mappers can then check for these external values before reading from xcontent. The disadvantage of this is that every field mapper now needs to check its context for external values. Because the values are defined by their java class, we can also know that in the vast majority of cases this functionality is unused. We have only two mappers that actually make use of this, CompletionFieldMapper and GeoPointFieldMapper. This commit removes external values entirely, and replaces it with the ability to pass a modified XContentParser to multifields. FieldMappers can just check the parser attached to their context for data and don't need to worry about multiple sources. Plugins implementing field mappers will need to take the removal of external values into account. Implementations that are passing structured objects as external values should instead use ParseContext.switchParser and wrap the objects using MapXContentParser.wrapObject(). GeoPointFieldMapper passes on a fake parser that just wraps its input data formatted as a geohash; CompletionFieldMapper has a slightly more complicated parser that in general wraps its metadata, but if textOrNull() is called without the parser being advanced just returns its text input. Relates to #56063	2021-04-29 09:17:18 +01:00
Jake Landis	b1ef1fd800	Introduce yamlRestCompatTests for :plugins projects (#71440 )	2021-04-08 16:11:50 -05:00
Mark Vieira	6339691fe3	Consolidate REST API specifications and publish under Apache 2.0 license (#70036 )	2021-03-26 16:20:14 -07:00
Nik Everett	91c700bd99	Super randomized tests for fetch fields API (#70278 ) We've had a few bugs in the fields API where is doesn't behave like we'd expect. Typically this happens because it isn't obvious what we expct. So we'll try and use randomized testing to ferret out what we want. This adds a test for most field types that asserts that `fields` works similarly to `docvalues_fields`. We expect this to be true for most fields. It does so by forcing all subclasses of `MapperTestCase` to define a method that makes random values. It declares a few other hooks that subclasses can override to further randomize the test. We skip the test for a few field types that don't have doc values: * `annotated_text` * `completion` * `search_as_you_type` * `text` We should come up with some way to test these without doc values, even if it isn't as nice. But that is a problem for another time, I think. We skip the test for a few more types just because I wanted to cut this PR in half so we could get to reviewing it earlier. We'll get to those in a follow up change. I've filed a few bugs for things that are inconsistent with `docvalues_fields`. Typically that means that we have to limit the random values that we generate to those that do round trip properly.	2021-03-24 14:16:27 -04:00
Mark Vieira	a92a647b9f	Update sources with new SSPL+Elastic-2.0 license headers As per the new licensing change for Elasticsearch and Kibana this commit moves existing Apache 2.0 licensed source code to the new dual license SSPL+Elastic license 2.0. In addition, existing x-pack code now uses the new version 2.0 of the Elastic license. Full changes include: - Updating LICENSE and NOTICE files throughout the code base, as well as those packaged in our published artifacts - Update IDE integration to now use the new license header on newly created source files - Remove references to the "OSS" distribution from our documentation - Update build time verification checks to no longer allow Apache 2.0 license header in Elasticsearch source code - Replace all existing Apache 2.0 license headers for non-xpack code with updated header (vendored code with Apache 2.0 headers obviously remains the same). - Replace all Elastic license 1.0 headers with new 2.0 header in xpack.	2021-02-02 16:10:53 -08:00
Julie Tibshirani	5852fbedf5	Rename QueryShardContext -> SearchExecutionContext. (#67490 ) We decided to rename `QueryShardContext` to clarify that it supports all parts of search request execution. Before there was confusion over whether it should only be used for building queries, or maybe only used in the query phase. This PR also updates the javadocs. Closes #64740.	2021-01-14 09:11:59 -08:00
Julie Tibshirani	f4a462d05e	Simplify how source is passed to fetch subphases. (#65292 ) This PR simplifies how the document source is passed to each fetch subphase. A summary of the strategy: * For each document, we try to eagerly load the source and store it on `HitContext`. Most subphases that access source, like source filtering and highlighting, use `HitContext`. For nested hits, we filter the parent source and also store this source on `HitContext`. * Only for non-nested documents, we also store the loaded source on `QueryShardContext#lookup`. This allows subphases that access source through `SearchLookup` to use the pre-loaded source when possible. This is now a common occurrence, since runtime fields are supported in the 'fields' option and may soon be supported in highlighting. There is no longer a special `SearchLookup` just for the fetch phase. This was not necessary and was mostly caused by a misunderstanding of how `QueryShardContext` should be used. Addresses #62511.	2020-11-20 14:09:41 -08:00
Alan Woodward	0fd70ae383	Remove Mapper.BuilderContext (#64625 ) Mapper.BuilderContext is a simple wrapper around two objects, some IndexSettings and a ContentPath. The IndexSettings are the same as those provided in the ParserContext, so we can simplify things here by removing them and just passing ContentPath directly to Mapper.Builder#build()	2020-11-05 10:48:39 +00:00
Luca Cavanna	344ad33a16	Remove ValueFetcher depedendency from MapperService (#64524 ) The signature of MappedFieldType#valueFetcher requires MapperService as an argument which is unfortunate as that is one of the reasons why FetchContext exposes the whole MapperService. Such use of MapperService can be replaced with exposing the QueryShardContext which encapsulates the MapperService.	2020-11-04 12:08:34 +01:00
Alan Woodward	a5168572d5	Collapse ParametrizedFieldMapper into FieldMapper (#64365 ) Now that all our FieldMapper implementations extend ParametrizedFieldMapper, we can collapse the two classes together, and remove a load of cruft from FieldMapper that is unused. In particular: * we no longer need the lucene FieldType field on FieldMapper * we no longer use clone() for merging, so we can remove it from all impls * the serialization code in FieldMapper that assumes we're looking at text fields can go	2020-11-02 15:07:52 +00:00
Luca Cavanna	f491422e1e	Ensure field types consistency on supporting text queries (#63487 ) Some supported field types don't support term queries, and throw exception in their termQuery method. That exception is either an IllegalArgumentException or a QueryShardException. There is logic in MatchQuery that skips the field or not depending on the exception that is thrown. Also, such field types should hold a TextSearchInfo.NONE while that is not always the case. With this commit we make the following changes: - streamline using TextSearchInfo.NONE in all field types that don't support text queries - standardize the exception being thrown when a field type does not support term queries to be IllegalArgumentException. Note that this is not a breaking change as both exceptions previously returned translated to 400 status code. - Adapt the MatchQuery logic to skip fields that don't support term queries. There is no need to call termQuery passing an empty string and catch exceptions potentially thrown. We can rather check the TextSearchInfo which tells already whether the field supports text queries or not. - add a test method to MapperTestCase that verifies the consistency of a field type by verifying that it is not searchable whenever it uses TextSearchInfo.NONE, while it is otherwise. This is what triggered all of the above changes.	2020-10-13 11:05:43 +02:00
Julie Tibshirani	8c56bbc3e6	Add factory methods for common value fetchers. (#63438 ) This PR adds factory methods for the most common implementations: * `SourceValueFetcher.identity` to pass through the source value untouched. * `SourceValueFetcher.toString` to simply convert the source value to a string.	2020-10-08 11:58:36 -07:00

1 2 3 4

152 commits