Commit graph

27 commits

Author SHA1 Message Date
Ievgen Degtiarenko
12299b89d0
Troubleshooting unbalanced cluster docs (#105774)
This adds initial page with explanation on balancing approach
and steps to troubleshoot it.
2024-03-14 14:10:13 +01:00
Niels Bauman
64891011d3
Extend repository_integrity health indicator for unknown and invalid repos (#104614)
This PR extends the repository integrity health indicator to cover also unknown and invalid repositories. Because these errors are local to a node, we extend the `LocalHealthMonitor` to monitor the repositories and report the changes in their health regarding the unknown or invalid status.
To simplify this extension in the future, we introduce the `HealthTracker` abstract class that can be used to create new local health checks.
Furthermore, we change the severity of the health status when the repository integrity indicator reports unhealthy from `RED` to `YELLOW` because even though this is a serious issue, there is no user impact yet.
2024-02-07 15:18:55 +01:00
David Turner
a3c2b2d611
Add stable master indicator troubleshooting links (#104395)
Adds links from the stable master health indicator to the relevant
troubleshooting docs, as well as making the "contact support" link a
versioned link directly to the right subsection of the troubleshooting
docs page.
2024-01-16 15:22:33 -05:00
James Rodewig
255c9a7f95
[DOCS] Move x-pack docs to docs/reference dir (#99209)
**Problem:**
For historical reasons, source files for the Elasticsearch Guide's security, watcher, and Logstash API docs are housed in the `x-pack/docs` directory. This can confuse new contributors who expect Elasticsearch Guide docs to be located in `docs/reference`. 

**Solution:**
- Move the security, watcher, and Logstash API doc source files to the `docs/reference` directory
- Update doc snippet tests to use security

Rel: https://github.com/elastic/platform-docs-team/issues/208
2023-09-12 14:53:41 -04:00
Abdon Pijpelink
af76a3a436
[DOCS] Add 'Troubleshooting an unstable cluster' to nav (#99287)
* [DOCS] Add 'Troubleshooting an unstable cluster' to nav

* Adjust docs links in code

* Revert "Adjust docs links in code"

This reverts commit f3846b1d78.

---------

Co-authored-by: David Turner <david.turner@elastic.co>
2023-09-08 13:42:50 +02:00
Abdon Pijpelink
0421c4fe9b
[DOCS] Remote cluster troubleshooting guide (#99128)
* [DOCS] Remote cluster troubleshooting guide

* Fix test failures

* Apply suggestions from code review

Co-authored-by: Yang Wang <ywangd@gmail.com>

* Review feedback

* Group issues under 'common' and 'API key'

* Apply suggestions from code review

Co-authored-by: Yang Wang <ywangd@gmail.com>

---------

Co-authored-by: Yang Wang <ywangd@gmail.com>
2023-09-05 15:10:45 +02:00
David Turner
7a517cb4a0
Add note on jstack frequency for troubleshooting (#95764)
Suggest calling `jstack` every 15s to ensure that at least one capture
shows a stuck thread. Also adds a link to this guide to the list on the
troubleshooting overview page.
2023-05-03 10:04:13 +01:00
Pablo Alcantar Morales
253fe6325d
Add shards capacity troubleshooting guide (#95208) 2023-04-19 09:24:07 +02:00
Abdon Pijpelink
2808512397
[DOCS] Improve watermark troubleshooting documentation (#94222) 2023-03-01 14:34:14 +01:00
Mary Gouseti
cfd23d512f
Disk indicator troubleshooting guides (#90504) 2022-10-14 15:24:21 +02:00
James Baiera
db73aa0498
Add repeated snapshot failure troubleshooting guide (#89762)
This troubleshooting guide is what will be returned from the SLM health indicator 
when a SLM policy has suffered from too many repeat failures without a successful 
execution.
2022-09-15 17:01:32 -04:00
Abdon Pijpelink
346f7848e6
[DOCS] Add troubleshooting searches guide (#89583)
* [DOCS] Adds troubleshooting searches guide

* Additional troubleshooting steps

* Apply review suggestions

* Replace separate _cat aliases/indices requests with one get indices call

* Reorder steps to move field caps forward

* Add note about ignore_unavailable

Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com>
2022-09-08 14:30:21 +02:00
Leaf-Lin
942e5fd9fc
Adding specific items into troubleshooting guide (#88105)
* Update troubleshooting.asciidoc

Adding items into the troubleshooting guide

* Resolve conflicts

* Reorganizes troubleshooting links

Co-authored-by: Abdon Pijpelink <abdon.pijpelink@elastic.co>
2022-08-03 17:00:34 +02:00
David Turner
7103053f03
Add troubleshooting docs about data corruption (#88760)
Adds some docs giving more detailed background about what data
corruption really means and some suggestions about how to narrow down
the root cause.

Co-authored-by: Henning Andersen <33268011+henningandersen@users.noreply.github.com>
2022-07-28 11:23:23 +01:00
Mary Gouseti
89903bbe23
Troubleshooting docs for ACTION_RESTORE_FROM_SNAPSHOT (#87692)
Troubleshooting guide to restore indices and data streams that have
missing data from a snapshot.

This will be associated with the user action
`ACTION_RESTORE_FROM_SNAPSHOT`.

Preview link:
https://elasticsearch_87692.docs-preview.app.elstc.co/guide/en/elasticsearch/reference/master/restore-from-snapshot.html
2022-07-27 23:37:08 +09:30
Andrei Dan
f3431e1bff
Add troubleshooting guide for corrupt repository (#88391)
Co-authored-by: Lee Hinman <dakrone@users.noreply.github.com>
2022-07-14 13:37:02 +01:00
Leaf-Lin
945cb27782
[DOCS] Adding discovery troubleshooting link in the master get help page (#87344)
* Adding discovery troubleshooting link

* Add tags to pull in discovery troubleshooting content

* Move discovery troubleshooting to separate page and add redirects

Co-authored-by: Adam Locke <adam.locke@elastic.co>
2022-07-06 15:51:43 -04:00
Andrei Dan
6e98072db5
Add start slm user action (#87854)
This creates a user action for the slm health indicator that will help
the user to start SLM.
2022-06-23 11:04:45 +01:00
Andrei Dan
a4e7064b0e
Create ILM not running user action (#87852)
This creates a user action for the ilm health indicator that will help
the user to start ILM.
2022-06-23 09:54:31 +01:00
Andrei Dan
2ec4a9e006
Add troubleshooting doc for missing tier (#87526)
Co-authored-by: Lee Hinman <dakrone@users.noreply.github.com>
2022-06-17 12:24:33 +01:00
Leaf-Lin
ed0fb8b819
Move fix common cluster issues to troubleshooting (#87440)
* Move fix common cluster issues to troubleshooting

* Include fix common cluster issues in the troubleshooting doc

* [DOCS] Remove extra include from How-To

Co-authored-by: Deb Adair <debadair@elastic.co>
2022-06-13 17:16:17 -07:00
Keith Massey
6caf39c109
How to increase node capacity docs (#87188)
This adds troubleshooting documentation for the case when the ShardsAvailabilityHealthIndicatorService
reports that there are not enough nodes in the data tier (user action "increase_node_capacity_for_allocations" or
"increase_tier_capacity_for_allocations_". This covers both the cloud and self-managed environments. For
cloud we first recommend increasing the number of availability zones (because you cannot directly add nodes), and
decreasing index.number_of_replicas if that is not possible. For self-managed, we first recommend adding nodes,
and decreasing index.number_of_replicas if that is not possible.
2022-06-08 14:06:47 -05:00
Leaf-Lin
ad0228fd2f
[DOCS] Adding Getting Help section to troubleshooting docs (#87095)
* Adding Getting Help section

Add getting help section in the troubleshooting guide to be pointed by health API when issues are too complicated to be addressed. 

This is taken from https://www.elastic.co/guide/en/cloud/current/ec-get-help.html, someone might want to elaborate it a bit more?

* Fix broken partintro, modify headings, and update wording

Co-authored-by: Adam Locke <adam.locke@elastic.co>
2022-05-25 15:58:41 -04:00
Andrei Dan
08b323131f
Troubleshooting guides for disabled allocations (#86789)
This adds the troubleshooing guides when index and cluster allocations are
disabled.

Co-authored-by: Lee Hinman <dakrone@users.noreply.github.com>
2022-05-24 10:27:15 +01:00
Andrei Dan
20802a9f66
Add migrate to tiers troubleshooting doc (#86738)
This adds a troubleshooting doc for indices that mix index filtering allocation
with data tiers routing.

Co-authored-by: Lee Hinman <dakrone@users.noreply.github.com>
2022-05-24 10:12:28 +01:00
Andrei Dan
490f417efd
Troubleshooting guide for diagnosing unassigned shards (#86996)
Co-authored-by: Leaf-Lin <39002973+Leaf-Lin@users.noreply.github.com>
Co-authored-by: Lee Hinman <dakrone@users.noreply.github.com>
2022-05-24 09:56:23 +01:00
Andrei Dan
21785c9a77
How-to docs for increasing the total number of shards per node (#86214)
Co-authored-by: Lee Hinman <dakrone@users.noreply.github.com>
Co-authored-by: Leaf-Lin <39002973+Leaf-Lin@users.noreply.github.com>
2022-05-10 09:13:27 +01:00