elasticsearch

mirror of https://github.com/elastic/elasticsearch.git synced 2025-06-27 17:10:22 -04:00

Author	SHA1	Message	Date
Chris Hegarty	4d3b699067	JDKVectorLibrary: update low-level bounds checks and add benchmark (#130216 ) This commit updates the low-level bounds checks in JDKVectorLibrary and add benchmark, so that we can more easily bench the low-level operations. Note: I added the mr-jar gradle plugin to the benchmarks so that we can compile with preview features in Java 21, namely MemorySegment.	2025-06-27 19:21:04 +01:00
Gal Lalouche	6970bd24a0	ESQL: Aggressive release of shard contexts (#129454 ) Keep better track of shard contexts using RefCounted, so they can be released more aggressively during operator processing. For example, during TopN, we can potentially release some contexts if they don't pass the limit filter. This is done in preparation of TopN fetch optimization, which will delay the fetching of additional columns to the data node coordinator, instead of doing it in each individual worker, thereby reducing IO. Since the node coordinator would need to maintain the shard contexts for a potentially longer duration, it is important we try to release what we can eariler. An even more advanced optimization is to delay fetching to the main cluster coordinator, but that would be more involved, since we need to first figure out how to transport the shard contexts between nodes. Summary of main changes: DocVector now maintains a RefCounted instance per shard. Things which can build or release DocVectors (e.g., LuceneSourceOperator, TopNOperator), can also hold RefCounted instances, so they can pass them to DocVector and also ensure contexts aren't released if they can still be potentially used later. Driver's main loop iteration (runSingleLoopIteration), now closes its operators even between different operator processing. This is extra aggressive, and was mostly done to improve testability. Added a couple of tests to TopNOperator and a new integration test EsqlTopNShardManagementIT, which uses the pausable plugin framework to check that TopNOperator releases things as early as possible..	2025-06-26 09:49:40 +10:00
Ignacio Vera	ffea6ca2bf	Introduce an int4 off-heap vector scorer (#129824 ) * Introduce an int4 off-heap vector scorer * iter * Update server/src/main/java/org/elasticsearch/index/codec/vectors/DefaultIVFVectorsReader.java Co-authored-by: Benjamin Trent <ben.w.trent@gmail.com> --------- Co-authored-by: Benjamin Trent <ben.w.trent@gmail.com>	2025-06-23 18:44:12 +02:00
Chris Hegarty	1255a64832	Upgrade to Lucene 10.2.2 (#129546 ) This commit upgrades to Upgrade to Lucene 10.2.2. With the release of 10.2.2, we no longer need to workaround the Lucene bug mentioned in 128671.	2025-06-22 13:37:22 +01:00
Benjamin Trent	4e926ae41a	Minor ivf cleanups and fixing quantization performance (#129566 ) We are accidentally utilizing the non-vectorized quantizer when building ivf indices. This provides a 3-5x speed improvement on quantizing on my mac This fixes that and addresses some minor fixes (removing unused code, etc.) Here is a small benchmark result. time spent quantizing goes down significantly. <img width="652" alt="image" src="https://github.com/user-attachments/assets/9f46398c-c587-4e74-bc91-f2e07a63b406" /> vs. <img width="673" alt="image" src="https://github.com/user-attachments/assets/c4f4679f-d7a7-4486-841f-7dd3e75a11cb" />	2025-06-18 05:49:38 +10:00
Jordan Powers	5d1999781a	Use optimized text in match_only_text fields (#129371 ) Follow-up to #126492 to use the json parsing optimizations for match_only_text fields. Relates to #129072.	2025-06-17 08:15:40 -07:00
Iván Cea Fontenla	d405d3a4a9	ESQL: Skip unused STATS groups by adding a Top N BlockHash implementation (#127148 ) - Add a new `LongTopNBlockHash` implementation taking care of skipping unused values. - Add a `TopNUniqueSet` to take care of storing the top N values (without nulls). - Add a `TopNMultivalueDedupeLong` class helping with it (An adaptation of the existing `MultivalueDedupeLong`). - Add some tests to `HashAggregationOperator`. It wasn't changed much, but helps a bit with the E2E. - Add MicroBenchmarks for TopN groupings, to ensure we're actually improving things with this.	2025-06-11 13:59:59 +02:00
Jordan Powers	496fb2d5a4	Skip UTF8 to UTF16 conversion during document indexing (#126492 ) When parsing documents, we receive the document as UTF-8 encoded data which we then parse and convert the fields to java-native UTF-16 encoded Strings. We then convert these strings back to UTF-8 for storage in lucene. This patch skips the redundant conversion, instead passing lucene a direct reference to the received UTF-8 bytes when possible.	2025-06-05 19:50:09 -07:00
Benjamin Trent	2a44166a2c	Applying Apache Lucene fix: https://github.com/apache/lucene/pull/14732 (#128671 ) * Applying Apache Lucene fix: https://github.com/apache/lucene/pull/14732 * fixing test * fixing annot	2025-06-02 09:50:25 -04:00
Bogdan Pintea	0a8091605b	ESQL: Pushdown constructs doing case-insensitive regexes (#128393 ) This introduces an optimization to pushdown to Lucense those language constructs that aim at case-insensitive regular expression matching, used with `LIKE` and `RLIKE` operators, such as: * `\| WHERE TO_LOWER(field) LIKE "abc"` `\| WHERE TO_UPPER(field) RLIKE "ABC.*"` These are now pushed as case-insensitive `wildcard` and `regexp` respectively queries down to Lucene. Closes #127479	2025-05-30 10:55:00 +02:00
Ievgen Degtiarenko	b909a503a5	Create a pipeline to run micro-benchmarks periodically (#128507 )	2025-05-27 16:34:08 +02:00
Nik Everett	45bfaab448	ESQL: ROUND_TO function (#128278 ) Creates a `ROUND_TO` function that rounds it's input to one of the provided values. Like so: ``` ROUND_TO(v, 0, 5000, 10000, 20000, 40000, 100000) v \| ROUND_TO 0 \| 0 100 \| 0 6000 \| 5000 45001 \| 40000 999999 \| 100000 ``` For some sequences of numbers you could do this with the `/` operator - but for arbitrary sequences of numbers you needed `CASE` which is quite slow. And hard to read! Rewriting the example above would look like: ``` CASE ( v < 5000, 0, v < 10000, 5000, v < 20000, 10000, v < 40000, 20000, v < 100000, 40000, 100000 ) ``` Even better, this is fast: ``` (operation) Mode Cnt Score Error Units round_to_4_via_case avgt 7 138.124 ± 0.738 ns/op round_to_4 avgt 7 0.805 ± 0.011 ns/op round_to_3 avgt 7 0.739 ± 0.011 ns/op round_to_2 avgt 7 0.651 ± 0.009 ns/op date_trunc avgt 7 2.425 ± 0.018 ns/op ``` I've included a comparison to `DATE_TRUNC` above because we should be able to rewrite `DATE_TRUNC` into `ROUND_TO` when we know the date range of the index. This doesn't do it now, but it should be possible.	2025-05-23 10:14:30 -04:00
Jim Ferenczi	54af815ad9	Refactor SourceProvider creation to consistently use MappingLookup (#128213 ) This change updates the code to always create SourceProvider instances via MappingLookup, avoiding direct exposure to the underlying source format (synthetic or stored). It also aligns source filtering behaviour between SourceProvider and SourceLoader, ensuring consistent application of filters. This change is needed to enable source filtering to occur earlier in the fetch phase, for example, when constructing a synthetic source.	2025-05-22 14:45:13 +01:00
Benjamin Trent	1324ee0115	Reapply "Adds new unexposed and experimental IVF format (#127528 )" (#128005 ) (#128051 ) This reverts commit `8a17a5ed5f`. reapplying ivf format, but with a fix.	2025-05-14 08:47:59 +10:00
John Wagster	8a17a5ed5f	Revert "Adds new unexposed and experimental IVF format (#127528 )" (#128005 ) This reverts commit `ebe8ea6136`.	2025-05-09 17:10:11 -05:00
Nhat Nguyen	7b87266d6c	Optimize ordinal inputs in Values aggregation (#127849 ) Currently, time-series aggregations use the `values` aggregation to collect dimension values. While we might introduce a specialized aggregation for this in the future, for now, we are using `values`, and the inputs are likely ordinal blocks. This change speeds up the `values` aggregation when the inputs are ordinal-based. Execution time reduced from 461ms to 192ms for 1000 groups. ``` ValuesAggregatorBenchmark.run BytesRef 10000 avgt 7 461.938 ± 6.089 ms/op ValuesAggregatorBenchmark.run BytesRef 10000 avgt 7 192.898 ± 1.781 ms/op ```	2025-05-07 18:24:27 -07:00
Benjamin Trent	ebe8ea6136	Adds new unexposed and experimental IVF format (#127528 )	2025-05-07 14:59:57 -04:00
Ievgen Degtiarenko	7d466c9d59	Make OptimizerExpressionRule conditional (#127500 )	2025-05-06 13:58:20 +02:00
Tim Vernum	5f256ccb2d	Reduce use of deprecated Metadata builder method (#124292 ) This removes all non-test usage of Metadata.Builder.put(IndexMetadata, boolean) And replaces it with appropriate calls to the equivalent method on `ProjectMetadata.Builder` In most cases this _does not_ make the code project aware, but does reduce the number of deprecated methods in use.	2025-05-02 09:25:39 +02:00
Ievgen Degtiarenko	51959da7c0	Query planning benchmark (#127550 )	2025-04-30 11:55:40 +02:00
Nik Everett	10336c950c	ESQL: Speed loading stored fields (#127348 ) This speeds up loading from stored fields by opting more blocks into the "sequential" strategy. This really kicks in when loading stored fields like `text`. And when you need less than 100% of documents, but more than, say, 10%. This is most useful when you need 99.9% of field documents. That sort of thing. Here's the perf numbers: ``` %100.0 {"took": 403 -> 401,"documents_found":1000000} %099.9 {"took":3990 -> 436,"documents_found": 999000} %099.0 {"took":4069 -> 440,"documents_found": 990000} %090.0 {"took":3468 -> 421,"documents_found": 900000} %030.0 {"took":1213 -> 152,"documents_found": 300000} %020.0 {"took": 766 -> 104,"documents_found": 200000} %010.0 {"took": 397 -> 55,"documents_found": 100000} %009.0 {"took": 352 -> 375,"documents_found": 90000} %008.0 {"took": 304 -> 317,"documents_found": 80000} %007.0 {"took": 273 -> 287,"documents_found": 70000} %005.0 {"took": 199 -> 204,"documents_found": 50000} %001.0 {"took": 46 -> 46,"documents_found": 10000} ``` Let's explain this with an example. First, jump to `main` and load a million documents: ``` rm -f /tmp/bulk for a in {1..1000}; do echo '{"index":{}}' >> /tmp/bulk echo '{"text":"text '$(printf %04d $a)'"}' >> /tmp/bulk done curl -s -uelastic:password -HContent-Type:application/json -XDELETE localhost:9200/test for a in {1..1000}; do echo -n $a: curl -s -uelastic:password -HContent-Type:application/json -XPOST localhost:9200/test/_bulk?pretty --data-binary @/tmp/bulk \| grep errors done curl -s -uelastic:password -HContent-Type:application/json -XPOST localhost:9200/test/_forcemerge?max_num_segments=1 curl -s -uelastic:password -HContent-Type:application/json -XPOST localhost:9200/test/_refresh echo ``` Now query them all. Run this a few times until it's stable: ``` echo -n "%100.0 " curl -s -uelastic:password -HContent-Type:application/json -XPOST 'localhost:9200/_query?pretty' -d'{ "query": "FROM test \| STATS SUM(LENGTH(text))", "pragma": { "data_partitioning": "shard" } }' \| jq -c '{took, documents_found}' ``` Now fetch 99.9% of documents: ``` echo -n "%099.9 " curl -s -uelastic:password -HContent-Type:application/json -XPOST 'localhost:9200/_query?pretty' -d'{ "query": "FROM test \| WHERE NOT text.keyword IN (\"text 0998\") \| STATS SUM(LENGTH(text))", "pragma": { "data_partitioning": "shard" } }' \| jq -c '{took, documents_found}' ``` This should spit out something like: ``` %100.0 { "took":403,"documents_found":1000000} %099.9 {"took":4098, "documents_found":999000} ``` We're loading fewer documents but it's slower! What in the world?! If you dig into the profile you'll see that it's value loading: ``` $ curl -s -uelastic:password -HContent-Type:application/json -XPOST 'localhost:9200/_query?pretty' -d'{ "query": "FROM test \| STATS SUM(LENGTH(text))", "pragma": { "data_partitioning": "shard" }, "profile": true }' \| jq '.profile.drivers[].operators[] \| select(.operator \| contains("ValuesSourceReaderOperator"))' { "operator": "ValuesSourceReaderOperator[fields = [text]]", "status": { "readers_built": { "stored_fields[requires_source:true, fields:0, sequential: true]": 222, "text:column_at_a_time:null": 222, "text:row_stride:BlockSourceReader.Bytes": 1 }, "values_loaded": 1000000, "process_nanos": 370687157, "pages_processed": 222, "rows_received": 1000000, "rows_emitted": 1000000 } } $ curl -s -uelastic:password -HContent-Type:application/json -XPOST 'localhost:9200/_query?pretty' -d'{ "query": "FROM test \| WHERE NOT text.keyword IN (\"text 0998\") \| STATS SUM(LENGTH(text))", "pragma": { "data_partitioning": "shard" }, "profile": true }' \| jq '.profile.drivers[].operators[] \| select(.operator \| contains("ValuesSourceReaderOperator"))' { "operator": "ValuesSourceReaderOperator[fields = [text]]", "status": { "readers_built": { "stored_fields[requires_source:true, fields:0, sequential: false]": 222, "text:column_at_a_time:null": 222, "text:row_stride:BlockSourceReader.Bytes": 1 }, "values_loaded": 999000, "process_nanos": 3965803793, "pages_processed": 222, "rows_received": 999000, "rows_emitted": 999000 } } ``` It jumps from 370ms to almost four seconds! Loading fewer values! The second big difference is in the `stored_fields` marker. In the second on it's `sequential: false` and in the first `sequential: true`. `sequential: true` uses Lucene's "merge" stored fields reader instead of the default one. It's much more optimized at decoding sequences of documents. Previously we only enabled this reader when loading compact sequences of documents - when the entire block looks like ``` 1, 2, 3, 4, 5, ... 1230, 1231 ``` If there are any gaps we wouldn't enable it. That was a very conservative thing we did long ago without doing any experiments. We knew it was faster without any gaps, but not otherwise. It turns out it's a lot faster in a lot more cases. I've measured it as faster for 99% gaps, at least on simple documents. I'm a bit worried that this is too aggressive, so I've set made it configurable and made the default being to use the "merge" loader with 10% gaps. So we'd use the merge loader with a block like: ``` 1, 11, 21, 31, ..., 1231, 1241 ```	2025-04-29 23:20:15 +02:00
Benjamin Trent	74faf47121	New bulk scorer for binary quantized vectors via optimized scalar quantization (#127189 ) * New bulk scorer for binary quantized vectors via optimized scalar quantization * fixing headers * fixing tests	2025-04-29 07:42:08 -04:00
Benjamin Trent	059f91c90c	Panama vector accelerated optimized scalar quantization (#127118 ) * Adds accelerates optimized scalar quantization with vectorized functions * Adding benchmark * Update docs/changelog/127118.yaml * adjusting benchmark and delta	2025-04-23 12:51:04 -04:00
Nik Everett	85749d606c	Add benchmark script (#126596 ) Adds a simple script to run benchmarks for ESQL and collect their results. The script has a `--test` mode which takes about ten minutes. Running without `--test` takes a four hours fifteen minutes. To speed up `--test` I reworked the "self test" that each benchmark runs to be optional and disabled in `--test` mode.	2025-04-18 19:09:38 +02:00
Martijn van Groningen	0d41e9a2a5	Tsdb doc values inline building jump table (#126499 ) Build jump table (disi) while iterating over SortedNumericDocValues for encoding the values, instead of separately iterating over SortedNumericDocValues just to build the jump table. In case when indexing sorting is active, this requires an additional merge sort. Follow up from #125403	2025-04-17 12:08:16 +02:00
Martijn van Groningen	0033de9ab3	Tweak TSDBDocValuesMergeBenchmark (#126825 ) to use benchmark mode single shot time. Which makes more sense for benchmarking force merge. The sample time mode would invoke the benchmark methods many times, which in case of force merge is a noop.	2025-04-15 13:03:01 +02:00
Ignacio Vera	ffdfcec334	Upgrade to Lucene 10.2.0 (#126594 ) This commit upgrade Elasticsearch to lucene 10.2.0	2025-04-14 13:50:52 +02:00
Tim Vernum	a0dd4e76ec	Reduce use of deprecated Metadata builder method (#124290 ) This removes all non-test usage of Metadata.Builder.put(IndexMetadata.Builder) And replaces it with appropriate calls to the equivalent method on `ProjectMetadata.Builder` In most cases this _does not_ make the code project aware, but does reduce the number of deprecated methods in use	2025-04-11 18:36:19 +10:00
Martijn van Groningen	065c5830cb	First step optimizing tsdb doc values codec merging. (#125403 ) The doc values codec iterates a few times over the doc value instance that needs to be written to disk. In case when merging and index sorting is enabled, this is much more expensive, as each time the doc values instance is iterated a merge sorting is performed (in order to get the doc ids of new segment in order of index sorting). There are several reasons why the doc value instance is iterated multiple times: * To compute stats (num values, number of docs with value) required for writing values to disk. * To write bitset that indicate which documents have a value. (indexed disi, jump table) * To write the actual values to disk. * To write the addresses to disk (in case docs have multiple values) This applies for numeric doc values, but also for the ordinals of sorted (set) doc values. This PR addresses solving the first reason why doc value instance needs to be iterated. This is done only when in case of merging and when the segments to be merged with are also of type es87 doc values, codec version is the same and there are no deletes. Note this optimized merged is behind a feature flag for now.	2025-04-09 07:50:16 +02:00
Nik Everett	7e1e45eaa4	ESQL: Speed up TO_IP (#126338 ) Speed up the TO_IP method by converting directly from utf-8 encoded strings to the ip encoding. Previously we did: ``` utf-8 -> String -> INetAddress -> ip encoding ``` In a step towards solving #125460 this creates three IP parsing functions, one the rejects leading zeros, one that interprets leading zeros as decimal numbers, and one the interprets leading zeros as octal numbers. IPs have historically been parsed in all three of those ways. This plugs the "rejects leading zeros" parser into `TO_IP` because that's the behavior it had before. Here is the performance: ``` Benchmark Score Error Units leadingZerosAreDecimal 14.007 ± 0.093 ns/op leadingZerosAreOctal 15.020 ± 0.373 ns/op leadingZerosRejected 14.176 ± 3.861 ns/op original 32.950 ± 1.062 ns/op ``` So this is roughly 45% faster than what we had.	2025-04-07 09:34:53 -04:00
Sam Xiao	bddc14c232	Add multi-project support for health indicator shards_availability (#125512 )	2025-03-31 11:12:52 -04:00
Simon Cooper	7f1203e472	Add panama implementations of byte-bit and float-bit script operations (#124722 )	2025-03-25 13:59:11 +00:00
Nik Everett	c5e76847ad	ESQL: Keep ordinals in conversion functions (#125357 ) Make the conversion functions that process `BytesRef`s into `BytesRefs` keep the `OrdinalBytesRefVector`s when processing. Let's use `TO_LOWER` as an example. First, the performance numbers: ``` (operation) Mode Score Error -> Score Error Units to_lower 30.662 ± 6.163 -> 30.048 ± 0.479 ns/op to_lower_ords 30.773 ± 0.370 -> 0.025 ± 0.001 ns/op to_upper 33.552 ± 0.529 -> 35.775 ± 1.799 ns/op to_upper_ords 35.791 ± 0.658 -> 0.027 ± 0.001 ns/op ``` The test has a 8192 positions containing alternating `foo` and `bar`. Running `TO_LOWER` via ordinals is super duper faster. No longer `O(positions)` and now `O(unique_values)`. Let's paint some pictures! `OrdinalBytesRefVector` is a lookup table. Like this: ``` +-------+----------+ \| bytes \| ordinals \| \| ----- \| -------- \| \| FOO \| 0 \| \| BAR \| 1 \| \| BAZ \| 2 \| +-------+ 1 \| \| 1 \| \| 0 \| +----------+ ``` That lookup table is one block. When you read it you look up the `ordinal` and match it to the `bytes`. Previously `TO_LOWER` would process each value one at a time and make: ``` bytes ----- foo bar baz bar bar foo ``` So it'd run `TO_LOWER` once per `ordinal` and it'd make an ordinal non-lookup table. With this change `TO_LOWER` will now make: ``` +-------+----------+ \| bytes \| ordinals \| \| ----- \| -------- \| \| foo \| 0 \| \| bar \| 1 \| \| baz \| 2 \| +-------+ 1 \| \| 1 \| \| 0 \| +----------+ ``` We don't even have to copy the `ordinals` - we can reuse those from the input and just bump the reference count. That's why this goes from `O(positions)` to `O(unique_values)`.	2025-03-21 20:00:15 +02:00
Nik Everett	7ac6e5fd3c	ESQL: Fix EvalBenchmark (#124736 ) Fix the benchmark for `EVAL` which was failing because of a strange logging error. The benchmarks really didn't want to run when we use commons-logging. That's fine - we can use the ES logging facade thing. I also added a test to the benchmarks which should run the self-tests for `EVAL` on `gradle check`.	2025-03-14 20:19:20 +00:00
Simon Cooper	d7864f4af6	Refactor JMH script vector distance benchmark to add panama benchmarks (#124351 ) Add vector benchmarks vs scalar, and automatically pick up new implementations as they get added	2025-03-12 13:15:16 +00:00
Tim Vernum	f7e80e7fd2	Merge branch 'main' into feature/multi-project	2025-02-27 12:09:08 +11:00
Salvatore Campagna	86a6c93bd6	Benchmark date field range query with doc values sparse index (#123251 )	2025-02-26 16:50:36 +01:00
Niels Bauman	116b045139	Merge main into multi-project	2025-02-24 17:43:47 +01:00
Nik Everett	319e53a350	ESQL: Benchmark TO_LOWER and TO_UPPER (#123268 ) This adds a microbenchmark for TO_LOWER and TO_UPPER. They are quite common probably could use some optimizing.	2025-02-24 11:18:06 -05:00
Tim Vernum	21a16acbd4	Merge main into multi-project	2025-02-24 14:23:18 +11:00
Nik Everett	67293ba8f4	ESQL: Speed up VALUES for many buckets (#123073 ) Speeds up the VALUES agg when collecting from many buckets. Specifically, this speeds up the algorithm used to `finish` the aggregation. Most specifically, this makes the algorithm more tollerant to large numbers of groups being collected. The old algorithm was `O(n^2)` with the number of groups. The new one is `O(n)` ``` (groups) 1 219.683 ± 1.069 -> 223.477 ± 1.990 ms/op 1000 426.323 ± 75.963 -> 463.670 ± 7.275 ms/op 100000 36690.871 ± 4656.350 -> 7800.332 ± 2775.869 ms/op 200000 89422.113 ± 2972.606 -> 21920.288 ± 3427.962 ms/op 400000 timed out at 10 minutes -> 40051.524 ± 2011.706 ms/op ``` The `1` group version was not changed at all. That's just noise in the measurement. The small bump in the `1000` case is almost certainly worth it and real. The huge drop in the `100000` case is quite real.	2025-02-23 18:29:55 +00:00
Tim Vernum	680e7a6979	Merge revision `5c00341c2b` into multi-project	2025-02-14 17:17:41 +11:00
Oleksandr Kolomiiets	b8d7e99cb9	Use FallbackSyntheticSourceBlockLoader for number fields (#122280 )	2025-02-12 16:12:19 -08:00
Yang Wang	04d459009b	Merge main into multi-project	2025-02-12 09:57:09 +11:00
Iván Cea Fontenla	7bea3a5610	ESQL: Remove AggregateMapper reflection, and delegate intermediate state to suppliers (#122023 ) To avoid having AggregateMapper find aggregators based on their names with reflection, I'm doing some changes: - Make the suppliers have methods returning the intermediate states - To allow this, the suppliers constructor won't receive the chanells as params. Instead, its methods will ask for them - Most changes in this PR are because of this - After those changes, I'm leaving AggregateMapper still there, as it still converts AggregateFunctions to its NamedExpressions	2025-02-10 13:01:59 +01:00
Niels Bauman	621a18d947	Merge main into multi-project	2025-01-30 17:26:28 +10:00
Armin Braun	453db3fd71	Optimize InternalAggregations construction a little (#120868 ) We can streamline and optimize this logic a little to see less copying and more compact results.	2025-01-28 11:50:47 +01:00
Lorenzo Dematté	81a9348431	[Entitlements] Enable native access based on policies (#120638 )	2025-01-24 08:29:38 +01:00
Niels Bauman	6495dcbb40	Merge main into multi-project	2025-01-24 15:48:39 +10:00
Nik Everett	dc4fa26174	Speed up COALESCE significantly (#120139 ) ``` before after (operation) Score Error Score Error Units coalesce_2_noop 75.949 ± 3.961 -> 0.010 ± 0.001 ns/op 99.9% coalesce_2_eager 99.299 ± 6.959 -> 4.292 ± 0.227 ns/op 95.7% coalesce_2_lazy 113.118 ± 5.747 -> 26.746 ± 0.954 ns/op 76.4% ``` We tend to advise folks that "COALESCE is faster than CASE", but, as of 8.16.0/https://github.com/elastic/elasticsearch/pull/112295 that wasn't the true. I was working with someone a few days ago to port a scripted_metric aggregation to ESQL and we saw COALESCE taking ~60% of the time. That won't do. The trouble is that CASE and COALESCE have to be lazy, meaning that operations like: ``` COALESCE(a, 1 / b) ``` should never emit a warning if `a` is not `null`, even if `b` is `0`. In 8.16/https://github.com/elastic/elasticsearch/pull/112295 CASE grew an optimization where it could operate non-lazily if it was flagged as "safe". This brings a similar optimization to COALESCE, see it above as "case_2_eager", a 95.7% improvement. It also brings and arguably more important optimization - entire-block execution for COALESCE. The schort version is that, if the first parameter of COALESCE returns no nulls we can return it without doing anything lazily. There are a few more cases, but the upshot is that COALESCE is pretty much free in cases where long strings of results are `null` or not `null`. That's the `coalesce_2_noop` line. Finally, when there mixed null and non-null values we were using a single builder with some fairly inefficient paths. This specializes them per type and skips some slow null-checking where possible. That's the `coalesce_2_lazy` result, a more modest 76.4%. NOTE: These %s of improvements on COALESCE itself, or COALESCE with some load-overhead operators like `+`. If COALESCE isn't taking a ton time in your query don't get particularly excited about this. It's fun though. Closes #119953	2025-01-23 17:40:09 +00:00

1 2 3 4 5 ...

425 commits