elasticsearch

mirror of https://github.com/elastic/elasticsearch.git synced 2025-04-19 20:57:27 -04:00

Author	SHA1	Message	Date
Lorenzo Dematté	115062c643	Fix vec_caps to test for OS support too (on x64) (#126911 ) On x64, we are testing if we support vector capabilities (1 = "basic" = AVX2, 2 = "advanced" = AVX-512) in order to enable and choose a native implementation for some vector functions, using CPUID. However, under some circumstances, this is not sufficient: the OS on which we are running also needs to support AVX/AVX2 etc; basically, it needs to acknowledge it knows about the additional register and that it is able to handle them e.g. in context switches. To do that we need to a) test if the CPU has xsave feature and b) use the xgetbv to test if the OS set it (declaring it supports AVX/AVX2/etc). In most cases this is not needed, as all modern OSes do that, but for some virtualized situations (hypervisors, emulators, etc.) all the component along the chain must support it, and in some cases this is not a given. This PR introduces a change to the x64 version of vec_caps to check for OS support too, and a warning on the Java side in case the CPU supports vector capabilities but those are not enabled at OS level. Tested by passing noxsave to my linux box kernel boot options, and ensuring that the avx flags "disappear" from /proc/cpuinfo, and we fall back to the "no native vector" case. Fixes #126809	2025-04-16 16:06:46 +02:00
Simon Cooper	1f249c74eb	Tweak the delta used for vector scorer tests (#126849 ) New panama operations in Lucene 10.2 change the results we get from vector operations slightly	2025-04-15 15:46:23 +01:00
Ignacio Vera	ffdfcec334	Upgrade to Lucene 10.2.0 (#126594 ) This commit upgrade Elasticsearch to lucene 10.2.0	2025-04-14 13:50:52 +02:00
Simon Cooper	7f1203e472	Add panama implementations of byte-bit and float-bit script operations (#124722 )	2025-03-25 13:59:11 +00:00
Simon Cooper	da9ed5ae41	Re-enable SIMD operations on JDK 24 (#125484 )	2025-03-24 13:13:15 +00:00
Simon Cooper	2ba9e9f8ed	Panama implementation of painless float-byte vector ops (#123270 )	2025-03-24 10:30:52 +00:00
Simon Cooper	82668b40f4	Add basic implementations of float-byte script comparisons (#122381 ) Add implementations of `cosineSimilarity` and `dotProduct` to query byte vector fields using float vectors	2025-03-03 09:38:37 +00:00
Rene Groeschke	ba61f8c7f7	Update Gradle wrapper to 8.12 (#118683 ) This updates the gradle wrapper to 8.12 We addressed deprecation warnings due to the update that includes: - Fix change in TestOutputEvent api - Fix deprecation in groovy syntax - Use latest ospackage plugin containing our fix - Remove project usages at execution time - Fix deprecated project references in repository-old-versions	2024-12-30 15:34:24 +01:00
Benjamin Trent	e10fc3c90d	Speed up bit compared with floats or bytes script operations (#117199 ) Instead of doing an "if" statement, which doesn't lend itself to vectorization, I switched to expand to the bits and multiply the 1s and 0s. This led to a marginal speed improvement on ARM. I expect that Panama vector could be used here to be even faster, but I didn't want to spend anymore time on this for the time being. ``` Benchmark (dims) Mode Cnt Score Error Units IpBitVectorScorerBenchmark.dotProductByteIfStatement 768 thrpt 5 2.952 ± 0.026 ops/us IpBitVectorScorerBenchmark.dotProductByteUnwrap 768 thrpt 5 4.017 ± 0.068 ops/us IpBitVectorScorerBenchmark.dotProductFloatIfStatement 768 thrpt 5 2.987 ± 0.124 ops/us IpBitVectorScorerBenchmark.dotProductFloatUnwrap 768 thrpt 5 4.726 ± 0.136 ops/us ``` Benchmark I used. https://gist.github.com/benwtrent/b0edb3975d2f03356c1a5ea84c72abc9	2024-12-03 04:19:03 +11:00
Benjamin Trent	374c88a832	Correct bit * byte and bit * float script comparisons (#117404 ) I goofed on the bit * byte and bit * float comparisons. Naturally, these should be bigendian and compare the dimensions with the binary ones appropriately. Additionally, I added a test to ensure that this is handled correctly.	2024-11-26 03:38:06 +11:00
Rene Groeschke	f6ac6e1c3b	[Build] Remove deprecated BuildParams (#116984 )	2024-11-22 16:30:57 +01:00
Rene Groeschke	13c8aaeffa	[Gradle] Remove static use of BuildParams (#115122 ) Static fields dont do well in Gradle with configuration cache enabled. - Use buildParams extension in build scripts - Keep BuildParams.ci for now for easy serverless migration - Tweak testing doc	2024-11-15 17:58:57 +01:00
Benjamin Trent	d33a03ce6b	Add support for bitwise inner-product in painless (#116082 ) This adds bitwise inner product to painless. The idea here is: - For two bit arrays, which we determine to be a byte array whose dimensions match `dense_vector.dim/8`, we simply return bitwise `&` - For a stored bit array (remember, with `dense_vector.dim/8` bytes), sum up the provided byte or float array using the bit array as a mask. This is effectively supporting asynchronous quantization. A prime example of how this works is: https://github.com/cohere-ai/BinaryVectorDB Basically, you do your initial search against the binary space and then rerank with a differently quantized vector allowing for more information without additional storage space. closes: https://github.com/elastic/elasticsearch/issues/111232	2024-11-06 09:22:04 +11:00
Ryan Ernst	e5d5c17c99	Use directory name as project name for libs (#115720 ) The libs projects are configured to all begin with `elasticsearch-`. While this is desireable for the artifacts to contain this consistent prefix, it means the project names don't match up with their directories. Additionally, it creates complexities for subproject naming that must be manually adjusted. This commit adjusts the project names for those under libs to be their directory names. The resulting artifacts for these libs are kept the same, all beginning with `elasticsearch-`.	2024-10-29 13:02:28 -07:00
Luca Cavanna	8efd08b019	Upgrade to Lucene 10 (#114741 ) The most relevant ES changes that upgrading to Lucene 10 requires are: - use the appropriate IOContext - Scorer / ScorerSupplier breaking changes - Regex automaton are no longer determinized by default - minimize moved to test classes - introduce Elasticsearch900Codec - adjust slicing code according to the added support for intra-segment concurrency - disable intra-segment concurrency in tests - adjust accessor methods for many Lucene classes that became a record - adapt to breaking changes in the analysis area Co-authored-by: Christoph Büscher <christophbuescher@posteo.de> Co-authored-by: Mayya Sharipova <mayya.sharipova@elastic.co> Co-authored-by: ChrisHegarty <chegar999@gmail.com> Co-authored-by: Brian Seeders <brian.seeders@elastic.co> Co-authored-by: Armin Braun <me@obrown.io> Co-authored-by: Panagiotis Bailis <pmpailis@gmail.com> Co-authored-by: Benjamin Trent <4357155+benwtrent@users.noreply.github.com>	2024-10-21 13:38:23 +02:00
ChrisHegarty	2a0d5ffc02	Fix simdvec gradle runtime java check	2024-10-08 09:39:21 +01:00
Rene Groeschke	d47ca34b16	Fix Gradle configuration in idea for :libs:simdvec (#114251 )	2024-10-07 18:51:15 +02:00
Chris Hegarty	7decd52132	Allow incubating Panama Vector in simdvec, and add vectorized ipByteBin (#112933 ) Add support for vectorized ipByteBin. The structure of the implementation and loading framework mirror that of Lucene, but is simplified by avoiding reflective loading since ES has support for a MRJar section for 21. For now, we just disable warnings-as-errors in this small sourceset, since -Xlint:-incubating is only support since JDK 22. The number of source files is small here. Will investigate how to assert that just the single incubating warning is emitted by javac, at a later point.	2024-10-07 15:08:23 +01:00
Chris Hegarty	32dde26e49	Upgrade to Lucene 9.12.0 (#113333 ) This commit upgrades to Lucene 9.12.0. Co-authored-by: Adrien Grand <jpountz@gmail.com> Co-authored-by: Armin Braun <me@obrown.io> Co-authored-by: Benjamin Trent <ben.w.trent@gmail.com> Co-authored-by: Chris Hegarty <chegar999@gmail.com> Co-authored-by: John Wagster <john.wagster@elastic.co> Co-authored-by: Luca Cavanna <javanna@apache.org> Co-authored-by: Mayya Sharipova <mayya.sharipova@elastic.co>	2024-10-01 08:39:27 +01:00
Mark Vieira	a59c182f9f	Add AGPLv3 as a supported license	2024-09-13 15:29:46 -07:00
Lorenzo Dematté	0bc2b19ead	Add AVX-512 optimised vector distance functions for int7 on x64 (#109084 ) * Add vec_caps and inner implementation for AVX-512-F (without VNNI) * select FNNI function name based on vec_caps; native templated implementation for manual unrolling * Switched compiler to clang for x64, as gcc has a bug	2024-06-28 11:15:35 +02:00
Chris Hegarty	fa364bfcaf	Rename the vec module to better reflect that it provides SIMD optimized vector scorers (#109661 ) This commit renames the vector module to better reflect its intent - to provide SIMD optimized vector scorer implementations.	2024-06-17 11:10:02 +01:00

22 commits