elasticsearch

mirror of https://github.com/elastic/elasticsearch.git synced 2025-06-28 17:34:17 -04:00

Author	SHA1	Message	Date
Jan Kuipers	7f12f8009b	fix MachineDependentHeapTests (#129699 )	2025-06-19 14:07:39 +02:00
Jan Kuipers	087747b363	Include direct memory and non-heap memory in ML memory calculations (take #2 ) (#128742 ) * Include direct memory and non-heap memory in ML memory calculations. * Reduce ML_ONLY heap size, so that direct memory is accounted for. * [CI] Auto commit changes from spotless * changelog * improve docs * Reuse direct memory to heap factor * feature flag --------- Co-authored-by: elasticsearchmachine <infra-root+elasticsearchmachine@elastic.co>	2025-06-10 13:36:42 +02:00
Chris Hegarty	19550a838f	Add dense vector off-heap stats to Node stats and Index stats APIs (#126704 ) This change enhances the dense_vector section of the Nodes stats and Index stats APIs so that they report the desired size of off-heap memory for all indexed vectors. The dense_vector section of the Custer stats API remains unchanged. The retrieval mechanism and structure of the new stats is the same across the various three stats APIs, but more fine-grained information is disclosed as when moving from Cluster -> Node -> Index API. For Node stats, we aggregate the total byte sizes for all vectors, categorised by the data type. For example: "dense_vector" : { "value_count" : 5, "off_heap" : { "total_size_in_bytes" : 27, "total_veb_size_in_bytes" : 3, "total_vec_size_in_bytes" : 23, "total_veq_size_in_bytes" : 0, "total_vex_size_in_bytes" : 1 } } Index stats: same as Node stats with included field break down . For example: "dense_vector" : { "value_count" : 5, "off_heap" : { "total_size_in_bytes" : 27, "total_veb_size_in_bytes" : 3, "total_vec_size_in_bytes" : 23, "total_veq_size_in_bytes" : 0, "total_vex_size_in_bytes" : 1, "fielddata" : { "bar" : { "veb_size_in_bytes" : 3, "vec_size_in_bytes" : 14, "vex_size_in_bytes" : 1 }, "foo" : { "vec_size_in_bytes" : 9 } } } The implementation accesses the actual statistics through reflection. This will be completely removed when Lucene exposes this, which is expected in Lucene 10.3	2025-04-23 15:04:44 +01:00
Ryan Ernst	b5e92db171	Remove security manager from tests (#127087 ) Now that entitlements are always used, there is no need to run tests with security manager (a future enhancement will run tests with entitlements). This commit removes setting up security manager from tests.	2025-04-22 18:08:09 +02:00
Ryan Ernst	42dc870ece	Ensure logs dir exists before using as working dir (#126566 ) With the change to using the logs dir as the working dir of the Elasticsearch process we need to ensure the logs dir exists within the CLI instead of later during startup. relates #124966	2025-04-17 12:59:47 -07:00
Ryan Ernst	3bac50e818	Use logs dir as working directory (#124966 ) In the unexpected case that Elasticsearch dies due to a segfault or other similar native issue, a core dump is useful in diagnosing the problem. Yet core dumps are written to the working directory, which is read-only for most installations of Elasticsearch. This commit changes the working directory to the logs dir which should always be writeable.	2025-04-09 07:07:11 -07:00
Lorenzo Dematté	a4d7297944	Permanently switch from SecurityManager to Entitlements (#124865 ) (#125117 ) The JDK team has completely disabled the Java SecurityManager from Java 24. Elasticsearch has always used the Java SecurityManager as an additional protection mechanism; in order to retain this second line of defense, the Elasticsearch Core/Infra team has been working on the Entitlements project. Similar to SecurityManager, Entitlements only allow calling specific methods in the JDK when the caller has a matching policy attached. In other words, if some code (in the main Elasticsearch codebase, in a plugin/module, or in a script) attempts to perform a "privileged" operation and it is not entitled to do so, a NotEntitledException will be thrown. This PR includes the minimal set of changes to always use Entitlements, regardless of system properties or Java version. Relates to ES-10921	2025-03-18 18:38:45 +02:00
Ryan Ernst	11961730da	Cleanup command line setting errors (#124963 ) This commit improves the error cases when command line settings are found that are duplicates or conflict with special system properties.	2025-03-18 09:38:14 -07:00
Ryan Ernst	789eb2fadb	Log system information early during startup (#124926 ) Information about the system, and in particular the running JVM, is logged relatively late in startup. When problems occur in low level initialization, it is often useful to have this information. This commit moves logging of system information to right after logging has been initialized. Note that it no longer utilizes the JvmInfo class as most was already easily available and the static init of JvmInfo does a lot.	2025-03-15 14:55:02 +00:00
Ryan Ernst	47454df24e	Ensure we always use vectorization in Lucene (#124914 ) By default Lucene only uses vectorization up to the JDK version available at the time of its release. When testing against EA versions, we want to still use vectorization. This commit adds a sysprop to force lucene to use the current version for vectorization.	2025-03-14 20:08:15 +00:00
Mark Vieira	d61b86477d	Add jdk.management.agent module to server boot layer on start (#123938 )	2025-03-04 11:32:42 -08:00
Arpad Kiraly	196e083502	Remove @UpdateForV9 usages that are related to security manager removal (#123690 ) * Remove @UpdateForV9 annotation in RuntimeVersionFeature Re ES-10338 the Core/Infra team agreed that this is not blocked on security manager removal. We should go ahead and remove these 2 instances of the @UpdateForV9 annotation, the code will be cleaned up soon with the removal of Security Manager. * Remove @UpdateForV9 annotation from SystemJvmOptions re ES-10338 the Core/Infra team agreed this is not blocked on security manager removal. We should go ahead and remove this instance of the @UpdateForV9 annotation.	2025-03-03 11:34:46 +01:00
Ryan Ernst	b5e482bc91	Enable entitlements by default (#122907 ) Entitlements are almost complete. This commit enables them by default, in preparation for 8.18/9.0.	2025-02-25 09:58:39 -08:00
Lorenzo Dematté	6d042511a1	Create test entitlement libs dir and jar to pass validation (#123013 )	2025-02-20 13:34:19 -05:00
Lorenzo Dematté	a4c7d29675	[Entitlements] Instrumentation of NIO file channels (#122816 )	2025-02-19 23:18:07 +01:00
Ryan Ernst	0cf42f2388	Rename environment dir accessors (#121803 ) The node environment has many paths. The accessors for these currently use a "file" suffix, but they are always directories. This commit renames the accessors to make it clear these paths are directories.	2025-02-05 14:03:28 -08:00
Moritz Mack	6648a03bab	Revert "Revert "Remove deprecated tracing.apm.* settings for v9 (#119926 )"" (#120895 ) This reverts commit `5f44911ef8`. Reverts revert in #120268 after blockers have been resolved. Relates to ES-10293	2025-01-27 16:04:36 +01:00
Lorenzo Dematté	81a9348431	[Entitlements] Enable native access based on policies (#120638 )	2025-01-24 08:29:38 +01:00
Lorenzo Dematté	cbb7c24ada	[Entitlements] Network access checks for miscellanea (#120262 )	2025-01-17 09:48:22 +01:00
Moritz Mack	5f44911ef8	Revert "Remove deprecated tracing.apm.* settings for v9 (#119926 )" (#120268 ) Temporarily reverts elastic/elasticsearch#119926 due to unforeseen usage. Relates to ES-10293	2025-01-16 22:47:49 +11:00
Moritz Mack	5123b948a9	Remove deprecated tracing.apm.* settings for v9 (#119926 )	2025-01-14 08:41:11 +01:00
Ryan Ernst	38b2fc53a7	Default to entitlements only for JDK 24+ (#119885 ) Since entitlements are still being developed, we are not yet ready to enable them for all JDK versions. But we must use them for JDK 24+ since the security manager is not useable there. This commit tweaks the logic for determining whether to use entitlements to take into account the runtime version.	2025-01-10 09:19:39 -08:00
Lorenzo Dematté	28ce53f0aa	[Entitlements] Fix "No SecurityManager when entitlements are enabled" (#119742 )	2025-01-08 15:17:41 +01:00
Lorenzo Dematté	3464adb3ae	Adjust Bootstrap and JVM options to ensure the SM is never used when entitlements are enabled (#119689 )	2025-01-08 10:24:03 +01:00
Patrick Doyle	6484f94614	Entitlements for JDK-wide global state changes (#119592 ) * Refactor: separate check method name vs signature parsing * Cosmetic: change checker comment format * Entitlements for JDK-wide global state * [CI] Auto commit changes from spotless * Comment explaining entitlement add-exports * @SuppressForbidden * Refactor: rename dummy subclases --------- Co-authored-by: elasticsearchmachine <infra-root+elasticsearchmachine@elastic.co>	2025-01-08 01:12:27 +00:00
Lorenzo Dematté	ba9e0cef4d	Do not try to enable SecurityManager on JDK 24 (#117999 )	2024-12-10 18:00:58 +01:00
Patrick Doyle	338c0538b7	Dynamic entitlement agent (#116125 ) * Refactor: treat "maybe" JVM options uniformly * WIP * Get entitlement running with bridge all the way through, with qualified exports * Cosmetic changes to SystemJvmOptions * Disable entitlements by default * Bridge module comments * Fixup forbidden APIs * spotless * Rename EntitlementChecker * Fixup InstrumenterTests * exclude recursive dep * Fix some compliance stuff * Rename asm-provider * Stop using bridge in InstrumenterTests * Generalize readme for asm-provider * InstrumenterTests doesn't need EntitlementCheckerHandle * Better javadoc * Call parseBoolean * Add entitlement to internal module list * Docs as requested by Lorenzo * Changes from Jack * Rename ElasticsearchEntitlementChecker * Remove logging javadoc * exportInitializationToAgent should reference EntitlementInitialization, not EntitlementBootstrap. They're currently in the same module, but if that ever changes, this code would have become wrong. * Some suggestions from Mark --------- Co-authored-by: Ryan Ernst <ryan@iernst.net>	2024-11-06 00:07:52 +01:00
Ryan Ernst	e5d5c17c99	Use directory name as project name for libs (#115720 ) The libs projects are configured to all begin with `elasticsearch-`. While this is desireable for the artifacts to contain this consistent prefix, it means the project names don't match up with their directories. Additionally, it creates complexities for subproject naming that must be manually adjusted. This commit adjusts the project names for those under libs to be their directory names. The resulting artifacts for these libs are kept the same, all beginning with `elasticsearch-`.	2024-10-29 13:02:28 -07:00
Simon Cooper	f9aa6f40cd	Always use CLDR locale on ES v9 (#113184 ) Regardless of JDK version, ES should always use CLDR locale database from 9.0.0. This also removes IsoCalendarDataProvider used to override week-date calculations for the root locale only.	2024-09-23 11:05:08 +01:00
Mark Vieira	a59c182f9f	Add AGPLv3 as a supported license	2024-09-13 15:29:46 -07:00
Simon Cooper	a36d90cf34	Use CLDR locale provider on JDK 23+ (#110222 ) JDK 23 removes the COMPAT locale provider, leaving CLDR as the only option. This commit configures Elasticsearch to use the CLDR provider when on JDK 23, but still use the existing COMPAT provider when on JDK 22 and below. This causes some differences in locale behaviour; this also adapts various tests to still work whether run on COMPAT or CLDR.	2024-09-04 13:42:40 +01:00
Ryan Ernst	84bd31eb8e	Fix shutdown race condition in server start (#112300 ) When the server CLI is preparing to start Elasticsearch, it may receive a SIGTERM. That signal causes the close method of the CLI to be invoked, which checks for a non-null server process, and stops it if it exists. Since shutdown occurs in a different thread, it is possible close is invoked, sees the server process is null, so skips calling close, but then before close finishes the server process is started. Normally the above case is ok; the close method will cause Java to exit, and the child process will be sent a SIGTERM. However, in the case of serverless, close is wrapped so that the CLI can wait on uploading possible heapdumps. This presents the possibility that close thinks the child process has been stopped, but it hasn't, so the wrapped close waits indefinitely on the child process that was started up during the race. This commit guards against the startup race by locking the critical sections that create the child process and stop it. Either the child process is created first, and will then be stopped immediately, or close is invoked first, marking that the CLI is shutting down, and creating the child process will be skipped.	2024-08-29 10:20:43 -07:00
Ryan Ernst	0aa4758f02	Stop setting java.library.path (#112119 ) Native libraries in Java are loaded by calling System.loadLibrary. This method inspects paths in the java.library.path to find the requested library. Elasticsearch previously used this to find libsystemd, but now the only remaining use is to set the additional platform directory in which Elasticsearch keeps its own native libraries. One issue with setting java.library.path is that its not set for the cli process, which makes loading the native library infrastructure from clis difficult. This commit reworks how Elasticsearch native libraries are found in order to avoid needing to set java.library.path. There are two cases. The simplest is production, where the working directory is the Elasticsearch installation directory, so the platform specific directory can be constructed. The second case is for tests where we don't have an installtion. We already pass in java.library.path there, so this change renames the system property to be a test specific property that the new loading infrastructure looks for.	2024-08-23 11:16:18 -07:00
Ryan Ernst	8417d3f141	Move preallocate functionality to native access (#110678 ) This commit moves the file preallocation functionality into NativeAccess. The code is basically the same. One small tweak is that instead of breaking Java access boundaries in order to get an open file handle, the new code uses posix open directly. relates #104876	2024-07-11 09:42:44 -07:00
Chris Hegarty	6b62c5129d	Replace use of reflection with MemorySegmentAccessInput (#109061 ) This commit replaces the use of reflection with the newly added MemorySegmentAccessInput.	2024-05-27 17:49:49 +01:00
elasticsearchmachine	0ce5dadc6b	Merge remote-tracking branch 'origin/main' into lucene_snapshot	2024-05-15 10:02:23 +00:00
Ryan Ernst	db70623442	Fix G1 JDK bug workaround (#108641 ) The workaround requires to JDK args, but SystemJvmOptions actually operates on individual JDK args. This commit adjusts SystemJvmOptions to allow adding sets of jdk args together.	2024-05-14 19:42:35 -04:00
Ryan Ernst	b2bc951ac5	Workaround G1 bug for JDK 22 and 22.0.1 (#108571 ) See https://bugs.openjdk.org/browse/JDK-8329528. The applied workaround was suggested on the linked issue, and was tested and confirmed to avoid the G1 bug.	2024-05-14 06:17:14 -07:00
elasticsearchmachine	1d9cbbcded	Merge remote-tracking branch 'origin/main' into lucene_snapshot	2024-05-08 10:01:34 +00:00
Moritz Mack	4c263e2ab4	CLI launcher / terminal improvements (#106470 ) This removes getWriter() and getErrorWriter() from Terminal in order to prepare for using log4j in server-cli. Additionally, ensure IO failures on the pump thread are handled once completed (closed) and catch throwables in Command#main to print them (formatted) via Terminal.	2024-05-07 16:09:52 +02:00
elasticsearchmachine	e42f38c5c7	Merge remote-tracking branch 'origin/main' into lucene_snapshot	2024-05-02 10:01:20 +00:00
Ryan Ernst	a6e93c8fc6	Minor cleanups to auto heap (#107961 ) Just a couple minor things: expose constants to subclass of auto heap and removed unused test method	2024-05-01 18:54:52 -07:00
Chris Hegarty	1e233a993c	Enable native access for Lucene madvise	2024-04-29 11:35:32 +01:00
Ryan Ernst	d7e524fcf9	Make auto heap configuration configurable in server cli subclasses (#107919 ) This commit makes auto heap configuration extendible so that serverless can tweak the configuration based on project settings.	2024-04-26 08:08:24 -07:00
Ryan Ernst	7402f93839	Expose server process to server cli subclasses (#107729 ) This commit gives protected access to subclasses of ServerCli (ie for tests or serverless).	2024-04-23 17:17:34 -07:00
Chris Hegarty	6b52d7837b	Add an optimised int8 vector distance function for aarch64. (#106133 ) This commit adds an optimised int8 vector distance implementation for aarch64. Additional platforms like, say, x64, will be added as a follow-up. The vector distance implementation outperforms Lucene's Pamana Vector implementation for binary comparisons by approx 5x (depending on the number of dimensions). It does so by means of compiler intrinsics built into a separate native library and link by Panama's FFI. Comparisons are performed on off-heap mmap'ed vector data. The implementation is currently only used during merging of scalar quantized segments, through a custom format ES814HnswScalarQuantizedVectorsFormat, but its usage will likely be expanded over time. Co-authored-by: Benjamin Trent <ben.w.trent@gmail.com> Co-authored-by: Lorenzo Dematté <lorenzo.dematte@elastic.co> Co-authored-by: Mark Vieira <portugee@gmail.com> Co-authored-by: Ryan Ernst <ryan@iernst.net>	2024-04-12 08:44:21 +01:00
Ryan Ernst	4114eea1a2	Allow force stopping server process (#107170 ) This commit allows the cli access to sending SIGKILL to the underlying jvm process.	2024-04-10 20:38:22 -04:00
Ryan Ernst	f5a7d25f06	Handle x86_64 os.arch for native libraries (#107289 ) On some systems Java appears to return amd64 (even if not an amd processor), but on others it returns x86_64. This commit handles the latter case to correctly associate the arch with the appropriate platform dir.	2024-04-10 14:51:52 -07:00
Ryan Ernst	e39fd58c5d	Fix jvm ergonomics tests (#106969 ) closes #106554	2024-04-02 13:40:33 -07:00
David Turner	751742422a	AwaitsFix for #106554	2024-03-20 14:22:38 +00:00

1 2 3

111 commits