Increase node up timeout in AbstractLocalClusterFactory (#129639)

In the last two months a lot of tests were converted to use the newer rest test framework. Some tests start 1 node, other start 3 nodes, others even more, the framework runs tests in parallel but it doesn't know how many nodes its tests needs meaning that running 3 tests in parallel, for example, can be very different when they are single node clusters or 3 node clusters etc. During this execution we saw the 3x more CPU load than what we would want to have ideally.

Currently there is no good solution for this because if dial down the concurrency we will use the nodes inefficiently, but if we keep the concurrency to where it is we risk longer start up times. Considering that the starting time of elasticsearch is not related to this test, we choose to increase the timeout to reduce the noise.
This commit is contained in:
Mary Gouseti 2025-06-19 17:37:37 +03:00 committed by GitHub
parent 34ccaba56d
commit ee5d652411
No known key found for this signature in database
GPG key ID: B5690EEEBB952194

View file

@ -73,7 +73,7 @@ public abstract class AbstractLocalClusterFactory<S extends LocalClusterSpec, H
implements
LocalClusterFactory<S, H> {
private static final Logger LOGGER = LogManager.getLogger(AbstractLocalClusterFactory.class);
private static final Duration NODE_UP_TIMEOUT = Duration.ofMinutes(3);
private static final Duration NODE_UP_TIMEOUT = Duration.ofMinutes(6);
private static final Map<Pair<Version, DistributionType>, DistributionDescriptor> TEST_DISTRIBUTIONS = new ConcurrentHashMap<>();
private static final String TESTS_CLUSTER_MODULES_PATH_SYSPROP = "tests.cluster.modules.path";
private static final String TESTS_CLUSTER_PLUGINS_PATH_SYSPROP = "tests.cluster.plugins.path";