elasticsearch

mirror of https://github.com/elastic/elasticsearch.git synced 2025-04-24 07:07:57 -04:00

Author	SHA1	Message	Date
Oleksandr Kolomiiets	d3923d765c	Migrate mapper-related modules to internal-*-rest-test (#117298 ) (#117407 ) (cherry picked from commit `2b8e4e727c`) # Conflicts: # modules/mapper-extras/build.gradle # plugins/mapper-annotated-text/build.gradle # plugins/mapper-murmur3/build.gradle # x-pack/plugin/mapper-unsigned-long/build.gradle # x-pack/plugin/mapper-version/build.gradle # x-pack/plugin/wildcard/build.gradle	2024-11-25 08:01:16 -08:00
Ignacio Vera	8a1231c248	Use LongArray instead of long[] for owning ordinals when building Internal aggregations (#116874 ) (#117015 ) This commit changes the signature of InternalAggregation#buildAggregations(long[]) to InternalAggregation#buildAggregations(LongArray) to avoid allocations of humongous arrays.	2024-11-19 23:36:04 +11:00
Kostas Krikellas	2439869034	[8.x] [TEST] Replace _source.mode with index.mapping.source.mode in integration tests - take 2 (#116072 ) (#116161 ) * [TEST] Replace _source.mode with index.mapping.source.mode in integration tests - take 2 (#116072) * Reapply "[TEST] Replace _source.mode with index.mapping.source.mode in integra…" (#116069) This reverts commit `e8bf344a28`. * [TEST] Replace _source.mode with index.mapping.source.mode in integration tests * add reason * add reason * spotless * revert unneeded (cherry picked from commit `4573ab8ec1`) # Conflicts: # server/src/main/java/org/elasticsearch/index/mapper/MapperFeatures.java * Update MapperFeatures.java	2024-11-04 19:45:47 +11:00
Mark Vieira	0279c0a909	Add AGPLv3 as a supported license	2024-09-13 14:30:33 -07:00
Kostas Krikellas	f3bc281978	Refactor build params for FieldMapper, adding SourceKeepMode (#112455 ) * Refactor build params for FieldMapper * more mappers and tests * more mappers * more mappers * spotless * spotless * stored by default * Revert "stored by default" This reverts commit `bbd247d64b`. * restore storeIgnored * sync * list valid values for SourceKeepMode * small refactoring * spotless	2024-09-06 14:16:17 +03:00
Christoph Büscher	5e455db10e	Revert "Remove Scorable#docID implementations" This reverts commit `55ed03fddf`.	2024-08-29 10:04:27 +02:00
Christoph Büscher	55ed03fddf	Remove Scorable#docID implementations This method was removed in https://github.com/apache/lucene/pull/12407 so we also need to remove it in implementations of Scorable.	2024-08-29 10:03:28 +02:00
Ignacio Vera	e3e44af4cf	Remove InternalAggregations#asMap and InternalAggregations#getAsMap methods (#110250 ) This methods are not needed as we have InternalAggregations#get to find aggregations by name.	2024-07-01 08:27:14 +02:00
Luca Cavanna	915e4a50c5	Rename Mapper#name to Mapper#fullPath (#110040 ) This addresses a long standing TODO that caused quite a few bugs over time, in that the mapper name does not include its full path, while the MappedFieldType name does. We have renamed Mapper.Builder#name to leafName (#109971) and Mapper#simpleName to leafName (#110030). This commit renames Mapper#name to fullPath for clarity This required some adjustments in FieldAliasMapper to avoid confusion between the existing path method and fullPath. I renamed path to targetPath for clarity. ObjectMapper already had a fullPath method that returned name, and was effectively a copy of name, so it could be removed.	2024-06-21 22:47:27 +02:00
Luca Cavanna	54e7b4d93b	Rename Mapper#simpleName to Mapper#leafName (#110030 ) This addresses a long standing TODO that caused quite a few bugs over time, in that the mapper name does not include its full path, while the MappedFieldType name does. We have method called simpleName to signal that, but leafName signals that more clearly and aligns with the name we have recently introduced in Mapper.Builder (renamed from name to leafName). Relates to #109971	2024-06-21 14:28:36 +02:00
Luca Cavanna	15c7abe111	Rename Mapper#name to Mapper#leafName (#109971 ) This addresses a long standing TODO that caused quite a few bugs over time, in that the mapper name does not include its full path, while the MappedFieldType name does.	2024-06-21 11:48:17 +02:00
Oleksandr Kolomiiets	1080425a65	Enable fallback synthetic source by default (#109370 )	2024-06-07 09:21:22 -07:00
Moritz Mack	b71fc0c561	Migrate remaining usage of skip version in YAML specs to cluster_features (#108055 )	2024-05-07 09:42:17 +02:00
eyalkoren	ee262954ee	Adding aggregations support for the `_ignored` field (#101373 ) Enables aggregations on the _ignored metadata field replacing the stored field with doc values.	2024-04-29 16:41:34 +02:00
Luca Cavanna	223e7f829b	Avoid attempting to load the same empty field twice in fetch phase (#107551 ) During the fetch phase, there's a number of stored fields that are requested explicitly or loaded by default. That information is included in `StoredFieldsSpec` that each fetch sub phase exposes. We attempt to provide stored fields that are already loaded to the fields lookup that scripts as well as value fetchers use to load field values (via `SearchLookup`). This is done in `PreloadedFieldLookupProvider.` The current logic makes available values for fields that have been found, so that scripts or value fetchers that request them don't load them again ad-hoc. What happens though for stored fields that don't have a value for a specific doc, is that they are treated like any other field that was not requested, and loaded again, although they will not be found, which causes overhead. This change makes available to `PreloadedFieldLookupProvider` the list of required stored fields, so that it can better distinguish between fields that we already attempted to load (although we may not have found a value for them) and those that need to be loaded ad-hoc (for instance because a script is requesting them for the first time). This is an existing issue, that has become evident as we moved fetching of metadata fields to `FetchFieldsPhase`, that relies on value fetchers, and hence on `SearchLookup`. We end up attempting to load default metadata fields (`_ignored` and `_routing`) twice when they are not present in a document, which makes us call `LeafReader#storedFields` additional times for the same document providing a `SingleFieldVisitor` that will never find a value. Another existing issue that this PR fixes is for the `FetchFieldsPhase` to extend the `StoredFieldsSpec` that it exposes to include the metadata fields that the phase is now responsible for loading. That results in `_ignored` being included in the output of the debug stored fields section when profiling is enabled. The fact that it was previously missing is an existing bug (it was missing in `StoredFieldLoader#fieldsToLoad`). Yet another existing issues that this PR fixes is that `_id` has been until now always loaded on demand when requested via fetch fields or script. That is because it is not part of the preloaded stored fields that the fetch phase passes over to the `PreloadedFieldLookupProvider`. That causes overhead as the field has already been loaded, and should not be loaded once again when explicitly requested.	2024-04-17 19:37:04 +02:00
Salvatore Campagna	4dfcb0897e	Fetch meta fields in FetchFieldsPhase using ValueFetcher (#106325 ) Here we extract the logic to populate metadata fields such as _ignored, _routing, _size and the deprecated _type into FetchFieldsPhase so that we can use the ValueFetcher interface to retrieve field values. This allows us to fetch values no matter if the Mapper uses stored or doc values.	2024-04-15 11:02:18 +02:00
Moritz Mack	1f5e04b721	Migrate YAML REST tests to synthetic cluster feature check (#107068 ) To simplify the migration away from version based skip checks in YAML specs, this PR adds a synthetic version feature `gte_vX.Y.Z` for any version at or before 8.14.0. New test specs for 8.14 or later are expected to use respective new cluster features, or a test-only feature supplied via ESRestTestCase#createAdditionalFeatureSpecifications if sufficient.	2024-04-11 18:22:38 +02:00
Kostas Krikellas	e58f4b4ef9	Introduce TimeSeriesRoutingIdFieldMapper and use it to create TSDB ids (#106080 ) Supporting non-keyword fields requires updating non-keyword fields in the routing path to be included in routing calculations. Routing is performed in coordinating nodes that lack mappings (or mappings haven't been created yet, for dynamically-defined dimensions), so the routing hash they calculate are passed to data nodes and stored in a new fields, namely _ts_routind_hash. This is included in the _id field, in turn, so that it can consistently reach the right shard for get-by-id and delete-by-id operations. A few interesting points: - The hash is passed from the coordinating to data nodes using the `routing` field in `IndexRequest`; adding another field to the latter requires updating dozens of classes. - We explicitly skip (double-) storing the hash to the routing field, as the latter is not optimized for storage using the TSDB codec. - The routing hash may not be available in Translog operations, it can then be retrieved from the `id` prefix. Related to https://github.com/elastic/elasticsearch/issues/103567	2024-03-13 11:37:09 -04:00
Benjamin Trent	8a7dfdfe24	Deprecate allowing `fields` in scenarios where its ignored (#106031 ) closes: https://github.com/elastic/elasticsearch/issues/106026	2024-03-08 08:02:50 -05:00
Armin Braun	1f1636e1f7	Fix error 500 on invalid ParentIdQuery (#105693 ) We need to enforce non-null values here, otherwise we'll error out and return a 500 when a user fails to set either id or type. closes #105366	2024-02-21 14:09:54 +01:00
Felix Barnsteiner	5920c917aa	Encapsulate Mapper.Builder#name and make it private (#105648 ) This is in preparation to make the field mutable, which is needed in the context of https://github.com/elastic/elasticsearch/pull/103542	2024-02-20 15:53:14 +01:00
John Verwolf	98a37c7b6b	Enhancement: Metrics for Search Took Times using Action Listeners (#104996 ) * Instrument search took times * Update assertion helper method to use client param * Update docs/changelog/104996.yaml * spotless * Fix test	2024-02-01 12:51:12 -08:00
Ignacio Vera	79f801b754	Remove unused ParsedAggregation (#104848 ) This abstraction was introduced to support the high level rest client and is not needed any more.	2024-01-29 14:59:11 +01:00
Ignacio Vera	72fe8b30d5	Remove ParsedAggregation from tests (#104790 ) This commit removes any reference to ParsedAggregation from the test framework	2024-01-29 07:46:06 +01:00
John Verwolf	80b222c395	Update elasticsearch.modules.parent-join.internalClusterTest (#102189 ) Part of the broader work covered in https://github.com/elastic/elasticsearch/issues/102030 Updates tests in: - ChildQuerySearchIT - TokenCountFieldMapperIntegrationIT	2023-12-10 00:09:02 +01:00
Armin Braun	143f4208d1	Fix remaining leaked SearchResponse issues in :server:integTests (#102896 ) This should be the last round for this module, found these using a prototype that has `SearchResponse` ref-counted already.	2023-12-04 19:09:48 +01:00
Luca Cavanna	2e0600dc4a	Enable inter-segment concurrency for terms aggs (#101390 ) This commit enables inter-segment search concurrency for terms aggs, when the cardinality of the field being aggregated on is lower than the shard size. This is to avoid precision errors compared that would be caused by parallelizing the executor across slices for fields with high cardinality. For terms agg that are ordered by key, we still take cardinality into account to parallelize only low cardinality fields and avoid performance overhead caused by parallelizing high cardinality fields.	2023-11-27 14:01:06 +01:00
Armin Braun	cdc83ad29b	Add shorthand for `prepareIndex` to test infrastructure (#101187 ) Same as #101175, shorten `client().prepareIndex(index)` and `client().prepareIndex().setIndex(index)` via a test utility. Saves lots of code now and sets up some follow-up simplifcations.	2023-11-23 15:47:36 +01:00
Armin Braun	a9c286b25c	Collapse verbose .execute().actionGet() calls in tests (#102502 ) Cleaning this up a little even though it's still quite horrible. `.get()` in this API actually means `actionGet()` so to speak. I think a good first step to cleaning this up is to at least reduce the duplication though and save 1k lines.	2023-11-23 10:10:10 +01:00
Armin Braun	433517ad01	Misc cleanup in o.e.search.fetch (#101939 ) Just some random findings from researching other things. Removing all kinds of dead code and fixing obvious duplication in 2 spots.	2023-11-09 13:46:00 +01:00
Armin Braun	ae6d180379	Clean up some more dead code in o.e.s.aggregations (#101820 ) Another iteration of mostly automatic cleanup on top of #101806.	2023-11-07 20:07:17 +01:00
David Turner	4a37aef80b	Deprecate `ExternalTestCluster` (#101844 ) `ExternalTestCluster` doesn't really make sense now that the transport client is removed. We only use it in the ML integ test suite and it'd be good to avoid expanding its usage further, so this commit deprecates it and removes the functionality in `ESIntegTestCase` that might quietly switch to using it in a new test suite if running with certain system properties. Relates #49582	2023-11-06 14:30:07 -05:00
Ignacio Vera	8a9f4fed55	Remove explicit SearchResponse references from LegacyGeo, Aggregations and parent-join modules (#101250 )	2023-10-24 17:46:25 +02:00
David Turner	9794c6e205	Use ESIntegTestCase#prepareSearch more (#101179 ) The refactoring in #101175 only covered all the one-arg call sites. This PR does the rest.	2023-10-20 18:33:00 +01:00
David Turner	1eda6ac74b	Extract ESIntegTestCase#prepareSearch (#101175 ) Relates #101172	2023-10-20 06:18:58 -04:00
Armin Braun	ca6295e582	Remove more explicit references to SearchResponse in tests (#101092 ) Remove `assertSearchResponse` which was just an alias for `assertNoFailures` and then cleanup many spots in the result by combining the hit count and no failure assertion into a single method. follow-up to #100966	2023-10-19 17:53:13 +02:00
Armin Braun	03ea4bbe6e	Remove more explicit references to SearchResponse in tests (#101052 ) Follow up to #100966 introducing new combined assertion `assertSearchHitsWithoutFailures` to combine no-failure, count, and id assertions into one block.	2023-10-18 20:27:52 +02:00
Armin Braun	dcaba064dd	Remove more explicit SearchResponse references from test code (#100985 ) Follow-up to #100966 adding more overrides to assertions that consume a request builder.	2023-10-18 07:20:01 +02:00
Armin Braun	bae6991fb3	Remove ~600 references to SearchResponse in tests (#100966 ) We'd like to make `SearchResponse` reference counted and pooled but there are around 6k instances of tests that create a `SearchResponse` local variable that would need to be released manually to avoid leaks in the tests. This does away with about 10% of these spots by adding an override for `assertHitCount` that handles the actual execution of the search request and its release automatically and making use of it in all spots where the `.get()` on the request build could be inlined semi-automatically and in a straight-forward fashion without other code changes.	2023-10-17 15:43:36 +02:00
Armin Braun	b7eafce32c	Make some practically static methods static (#97565 ) Another round of automated fixes to this, marking things that can be made static as static. Saves some JIT cycles but also turns some lambdas from capturing to non-capturing and makes the "utilityness" of some classes visible.	2023-10-06 23:37:07 +02:00
Mark Tozzi	6660503592	Aggs error codes part 1 (#99963 ) As part of our effort to increase the supportability of Elasticsearch, this PR changes many aggregations errors from being 500 class (which is the default for `AggregationExecutionException`) to 400 class (which is the default for `IllegalArgumentException`). All of these cases are errors which should not be retried, as they are failing directly related to the content of the request and/or state of the index. There are definitely more cases where we are returning an incorrect error code, but for this PR I focused on just changing the low hanging fruit.	2023-10-04 16:12:34 -04:00
Ignacio Vera	bcdd7d5f42	Set ParentAggregationBuilder not to support concurrent execution (#99809 )	2023-09-25 09:29:27 +02:00
Alan Woodward	4e1fb3fca5	Automatically disable `ignore_malformed` on datastream `@timestamp` fields (#99346 ) Data-stream mappings require a @timestamp field to be present and configured as a date with a specific set of parameters. The index-wide setting of ignore_malformed can cause problems here if it is set to true, because it needs to be false for the @timestamp field. This commit detects if a set of mappings is configured for a datastream by checking for the presence of a DataStreamTimestampFieldMapper metadata field, and passes that information on during Mapper construction as part of the MapperBuilderContext. DateFieldMapper.Builder now checks to see if it is specifically for a data stream timestamp field, and if it is, sets ignore_malformed to false. Relates to #96051	2023-09-13 15:02:22 +01:00
Ryan Ernst	19257125b1	Move transport version constants to TransportVersions (#97990 ) Constants for TransportVersion currently live alongeside the class definition. This has been fine since there was only one set of constants. However, to support serverless, some constants will need to be defined elsewhere. This commit moves the existing constants to a new holder class, TransportVersions. It is almost entirely mechanical, using IntelliJ move members. The only non mechanical part was slightly shifting how CURRENT is found, defining a LATEST in TransportVersions that is automatically calculated (since we already have it, no need to manually define it).	2023-09-06 15:14:41 -04:00
Ignacio Vera	424a4c6d71	Hide IndexSearcher in AggregatorTestCase (#98924 ) Hide the creation of the index searcher from the implementers by changing the signature of AggregatorTestCase#searchAndReduce and AggregatorTestCase#createAggregationContext to take an IndexReader instead of an IndexSearcher.	2023-08-28 16:29:21 +08:00
Matteo Piergiovanni	e719057209	Explicit parsing object capabilities of FieldMappers (#98684 ) When the subobject property is set to false and we encounter an object while parsing we need a way to understand if its FieldMapper is able to parse an object. If that's the case we can provide the entire object to the FieldMapper otherwise its name becomes the part of the dotted field name of each internal value. This has being achieved by adding the `supportsParsingObject()` method to the `FieldMapper` class. This method defaults to `false` since the majority of FieldMappers do not support parsing objects and is overwritten to return `true` by the ones that do support objects.	2023-08-22 10:16:59 +02:00
Armin Braun	63e64ae61b	Cleanup Stream usage in various spots (#97306 ) Lots of spots where we did weird things around streams like redundant stream creation, redundant collecting before adding all the collected elements to another collection or so, redundant streams for joining strings and using less efficient `Collectors.toList` and in a few cases also incorrectly relying on the result being mutable.	2023-07-03 14:24:57 +02:00
Simon Cooper	a873e26cf7	Convert IndexVersion.CURRENT to a method with a pluggable interface (#97132 )	2023-06-27 14:47:32 +01:00
Armin Braun	3f8ee82ef8	Use indices admin client shortcut in most integration tests (#96946 ) Replacing the remaining usages that I could automatically replace and a couple that I did by hand in this PR. Also, added the same shortcut to the single node tests to save some duplication there.	2023-06-20 13:32:59 +02:00
Simon Cooper	71c12262fb	Migrate index created version to IndexVersion (#96066 )	2023-06-14 09:43:31 +01:00

1 2 3 4 5 ...

323 commits