fix: do not let _resolve/cluster hang if remote is unresponsive (#119516) (#119528)

* fix: do not let `_resolve/cluster` hang if remote is unresponsive Previously, `_resolve/cluster` would wait for a response from a remote as part of the connection strategy. If the remote were to be unresponsive, this API would wait until `netty` would terminate the connection with a handshake exception. The threshold for terminating the connection is `10s`. This means that the API would wait for `10s` before determining that the remote is unresponsive. This strategy is now replaced with a fail fast where a response is sent back to the user immediately rather than waiting for a connection termination. * Update docs/changelog/119516.yaml
2025-07-15 10:13:33 -04:00 · 2025-01-03 17:45:12 +00:00 · 2025-01-03 17:45:12 +00:00 · d41813c99e
commit d41813c99e
parent 46ec08f2de
2 changed files with 6 additions and 1 deletions
--- a/docs/changelog/119516.yaml
+++ b/docs/changelog/119516.yaml
@ -0,0 +1,5 @@
 pr: 119516
 summary: "Fix: do not let `_resolve/cluster` hang if remote is unresponsive"
 area: Search
 type: bug
 issues: []
--- a/server/src/main/java/org/elasticsearch/action/admin/indices/resolve/TransportResolveClusterAction.java
+++ b/server/src/main/java/org/elasticsearch/action/admin/indices/resolve/TransportResolveClusterAction.java
@ -141,7 +141,7 @@ public class TransportResolveClusterAction extends HandledTransportAction<Resolv
                RemoteClusterClient remoteClusterClient = remoteClusterService.getRemoteClusterClient(
                    clusterAlias,
                    searchCoordinationExecutor,
-                    RemoteClusterService.DisconnectedStrategy.RECONNECT_IF_DISCONNECTED
+                    RemoteClusterService.DisconnectedStrategy.FAIL_IF_DISCONNECTED
                );
                var remoteRequest = new ResolveClusterActionRequest(originalIndices.indices(), request.indicesOptions());
                // allow cancellation requests to propagate to remote clusters