elasticsearch

mirror of https://github.com/elastic/elasticsearch.git synced 2025-04-25 15:47:23 -04:00

Author	SHA1	Message	Date
David Turner	460a19fc27	Expand docs about max-shards-per-node (#105607 ) (#105643 ) Adds a little more detail on what sorts of problems may occur if you exceed the default limits.	2024-02-20 04:01:16 -05:00
Abdon Pijpelink	7b37d4242e	[DOCS] Mention that vector quantization increases disk usage (#104509 )	2024-01-18 14:01:07 +01:00
Abdon Pijpelink	ea4b6fd3ea	[DOCS] Change order on 'tune knn' page (#104036 )	2024-01-08 12:04:39 +01:00
Abdon Pijpelink	7d1c342883	[DOCS] Stop recommending dot_product over cosine similarity (#103856 )	2024-01-03 14:37:21 +01:00
Benjamin Trent	f00364aefd	Add byte quantization for float vectors in HNSW (#102093 ) Adds new `quantization_options` to `dense_vector`. This allows for vectors to be automatically quantized to `byte` when indexed. Example: ``` PUT vectors { "mappings": { "properties": { "my_vector": { "type": "dense_vector", "index": true, "index_options": { "type": "int8_hnsw" } } } } } ``` When querying, the query vector is automatically quantized and used when querying the HNSW graph. This reduces the memory required to only `25%` of what was previously required for `float` vectors at a slight loss of accuracy. This is currently only available when `index: true` and when using `hnsw`	2023-11-29 12:29:55 -05:00
James Rodewig	3a91763d27	[DOCS] Deprecate rollups (#101265 )	2023-10-25 16:52:25 -04:00
Mayya Sharipova	b582276dd6	Update kNN search guide with knn parallelization (#100705 ) Relates to PR #98204	2023-10-11 15:31:03 -04:00
Benjamin Trent	83b70e37ef	Revert "Auto-normalize dot_product vectors at index & query (#98944 )" (#99421 ) This reverts commit `7b9c367aeb`.	2023-09-11 09:33:17 -04:00
Benjamin Trent	7b9c367aeb	Auto-normalize dot_product vectors at index & query (#98944 ) `dot_product` requires vectors to be unit-length. Previously, we would check that vectors were unit-length and throw if they were not. Instead, we will now auto-normalize vectors as they are indexed. `cosine` will continue to behave as usual, not normalizing the vectors. closes: https://github.com/elastic/elasticsearch/issues/98935	2023-08-30 09:50:49 -04:00
David Turner	60935c68cc	Adjust sizing guidance re. doc count (#97831 ) In #87246 we describe some reasons why it's a good idea to limit the doc count of a shard, and we started to do so in #94065, so this commit adjusts the sizing guidance docs to match.	2023-07-20 14:56:52 +01:00
David Turner	ddd4ba5e30	Fix docs for explaining unassigned shards (#97538 ) Today the `current_node` parameter is given in several sample requests illustrating how to explain an unassigned shard using the cluster allocation explain API. This doesn't make sense, an unassigned shard has no `current_node`. This commit removes the misleading parameter in these cases.	2023-07-11 08:01:12 +01:00
Mayya Sharipova	b366935df8	Add file extensions for vector search for preload (#96955 ) In this tuning guide we mentioned preload to warm up the filesystem cache, but we did not provide file extensions used in vector search. This adds these extensions.	2023-06-20 13:52:51 -04:00
David Turner	846d640ddf	Suggest capturing a heap dump to diagnose high heap (#96526 ) The `high-jvm-memory-pressure.html` troubleshooting docs give some suggestions, but vitally they omit the advice to capture a heap dump which is what we really need users to do if they want to understand their high heap usage. This commit adds a note to the docs to that effect.	2023-06-02 09:43:52 -04:00
debadair	777598d602	[DOCS] Remove redirect pages (#88738 ) * [DOCS] Remove manual redirects * [DOCS] Removed refs to modules-discovery-hosts-providers * [DOCS] Fixed broken internal refs * Fixing bad cross links in ES book, and adding redirects.asciidoc[] back into docs/reference/index.asciidoc. * Update docs/reference/search/point-in-time-api.asciidoc Co-authored-by: James Rodewig <james.rodewig@elastic.co> * Update docs/reference/setup/restart-cluster.asciidoc Co-authored-by: James Rodewig <james.rodewig@elastic.co> * Update docs/reference/sql/endpoints/translate.asciidoc Co-authored-by: James Rodewig <james.rodewig@elastic.co> * Update docs/reference/snapshot-restore/restore-snapshot.asciidoc Co-authored-by: James Rodewig <james.rodewig@elastic.co> * Update repository-azure.asciidoc * Update node-tool.asciidoc * Update repository-azure.asciidoc --------- Co-authored-by: amyjtechwriter <61687663+amyjtechwriter@users.noreply.github.com> Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com> Co-authored-by: Amy Jonsson <amy.jonsson@elastic.co> Co-authored-by: James Rodewig <james.rodewig@elastic.co>	2023-05-24 12:32:46 +01:00
Stef Nestor	4c5a3fb4da	[+Doc] Troubleshooting / Hot Spotting (#95429 ) * [+Doc] Troubleshooting / Hot Spotting --------- Co-authored-by: Abdon Pijpelink <abdon.pijpelink@elastic.co>	2023-04-26 12:29:47 -06:00
Jim Ferenczi	57cbbb3fcd	Minor ann docs update (#94783 ) Replace the link to the deprecated knn search API and added a link to the nightly benchmarks in Rally.	2023-03-31 17:59:25 +01:00
Benjamin Trent	e8c5ed46c6	Fixing our docs for vector sizing calculation (#93703 )	2023-02-13 07:52:53 -05:00
Luca Belluccini	7c5b6483a1	[DOCS] Typo in Search speed (#91934 ) * [DOCS] Typo in Search speed The PR https://github.com/elastic/elasticsearch/pull/89782 introduced some broken tags to leak in the text * Fix tags * Make all headings discrete Co-authored-by: Abdon Pijpelink <abdon.pijpelink@elastic.co>	2022-11-28 13:55:47 +01:00
Julie Tibshirani	1b249639f1	Remove experimental marking from kNN search (#91065 ) This commit removes the experimental tag from kNN search docs and makes some docs improvements: * Add a prominent warning about memory usage in the kNN search guide * Link to the performance tuning guide from the main guide * Clarify the memory requirements section in the tuning guide	2022-10-27 18:00:56 +02:00
Julie Tibshirani	f4038b3f15	Add guide for tuning kNN search (#89782 ) This 'how to' guide explains performance considerations specific to kNN search. It takes inspiration from the 'tune for search speed' guide.	2022-10-12 14:53:53 -07:00
Ievgen Degtiarenko	4d6d979e0e	Deprecate state field in `/_cluster/reroute` response (#90399 )	2022-10-05 08:18:27 +02:00
Iraklis Psaroudakis	ad8d064de5	Redefine section on sizing data nodes (#90274 ) Now that we have the estimated field mappings heap overhead in nodes stats, we can refer to them in the guide for sizing data nodes appropriately. Relates to #86639	2022-09-30 12:37:21 +03:00
Iraklis Psaroudakis	3ed7a04d22	Introduce node mappings stats (#89807 ) So that they are visible in NodeIndicesStats only at the node and index (but not shard) levels. Also visible in the _cat/nodes table. And make an exact count yaml REST test.	2022-09-19 15:47:47 +03:00
Iraklis Psaroudakis	34471b1cd2	Introduce max headroom for disk watermark stages (#88639 ) Introduce max headroom settings for the low, high, and flood disk watermark stages, similar to the existing max headroom setting for the flood stage of the frozen tier. Introduce new max headrooms in HealthMetadata and in ReactiveStorageDeciderService. Add multiple tests in DiskThresholdDeciderUnitTests, DiskThresholdDeciderTests and DiskThresholdMonitorTests. Moreover, addition & subtraction for ByteSizeValue, and min.	2022-09-19 14:59:18 +03:00
Abdon Pijpelink	56edb88fed	Update disk-usage.asciidoc (#89709 ) (#89874 ) added missing word (cherry picked from commit `3e35455511`) Co-authored-by: Brady Vidovic <bradvido@users.noreply.github.com>	2022-09-07 23:28:44 +09:30
David Turner	546a2e2898	Add note on per-segment field name overhead (#89152 ) We encountered a case where a substantial fraction of the heap usage was due to per-segment-per-field `FieldInfo` objects, particularly `FieldInfo#name`. This commit adds a note to the sizing docs about this overhead.	2022-08-10 08:17:55 +01:00
David Turner	c81f907ad8	Refine size-your-shards wording (#89081 ) Clarify that the limits in the docs are absolute maxima that will avoid things just breaking but won't necessarily give great performance.	2022-08-08 18:36:32 +09:30
Dimitrios Liappis	5056b666de	[DOCS] Warn about impact of large readahead on search (#88007 ) When using LVM or software raid on Linux the kernel, or specific distribution rules, may use higher ergonomic defaults for the readahead of resulting block device(s). This can adversely affect search performance due to high page cache thrashing, in search heavy scenarios when mmap is involved. Add a clarification section in the docs raising awareness about this value and preferring the lower default.	2022-06-27 13:00:44 +03:00
Elasticsearch addict	336df7a266	Update disabling _source doc mentioning highlight (#87582 ) Closes #87311	2022-06-13 09:11:25 -04:00
Armin Braun	2a5d65c17f	Remove shards per gb of heap guidance (#86223 ) This guidance does not apply any longer. The overhead per shard has been significantly reduced in recent versions and removed rule of thumb will be too pessimistic in many if not most cases and might be too optimistic in other specific ones. => Replace guidance with rule of thumb per field count on data nodes and rule of thumb by index count (which is far more relevant nowadays than shards) for master nodes. relates #77466 Co-authored-by: David Turner <david.turner@elastic.co> Co-authored-by: Henning Andersen <33268011+henningandersen@users.noreply.github.com>	2022-06-09 15:31:01 +02:00
Leaf-Lin	7bd4708886	Revert "Move fix common issues into troubleshooting" This reverts commit `4a563e9bfb`.	2022-06-07 17:14:38 +10:00
Leaf-Lin	4a563e9bfb	Move fix common issues into troubleshooting	2022-06-07 17:07:03 +10:00
debadair	5f06a7f9c2	[DOCS] Remove references to X-Pack Basic License (#86822 )	2022-05-20 13:34:46 -07:00
vincetrumental	05b7664272	correct way of getting node heap size (#85045 ) * correct way of getting node heap size in [[shard-count-recommendation]], we explain that the number of shards should be at most 20 shards per GB of heap. but the command to get relevant heap size should be _cat/nodes?v=true&h=heap.max and not _cat/nodes?v=true&h=heap.current . The latter gives the current memory consumption, which is alway moving. Here we need to consider the max allocated heap size (-Xmx) * Adds heap.max to valid columns Co-authored-by: Adam Locke <adam.locke@elastic.co>	2022-05-11 09:59:34 -04:00
David Turner	ff742fcb27	More balanced docs about NFS etc (#85060 ) Today we don't really say anything about the requirements for the data path in terms of correctness, and we specifically say to avoid NFS for performance reasons. This isn't wholly accurate: some NFS implementations work just fine. This commit documents a more balanced position on local vs remote storage.	2022-03-18 13:01:59 +00:00
Tobias Stadler	e3deacf547	[DOCS] Fix typos (#83895 )	2022-02-15 12:42:17 -05:00
edh-oss	5ef77ef370	Add/update source block delimeters (#83624 ) Asciidoc source blocks are to be delimited with four dashes. This adds missing delimiters, and updates some that contained only three dashes. It matters for parsing purposes.	2022-02-11 14:19:30 -05:00
David Turner	7d69f1a974	Oversharding is also indices and fields (#81511 ) Today the _Size your shards_ docs focus on shard size and count, but in fact index count and field count are also important. This commit expands these docs a bit to cover this observation too.	2021-12-09 08:51:36 +00:00
David Turner	ca65718923	Clarify `unassigned.reason` docs (#81017 ) Today we indicate that the `unassigned.reason` field in various APIs indicates the reason why a shard is unassigned. This isn't really true, it tells you some information about the event that caused the shard to _become_ unassigned (or which most recently changed its routing table entry while remaining unassigned) but tells you almost nothing about why the shard _is now_ unassigned and how to fix it. That's what the allocation explain API is for. This commit clarifies this point in the docs. Closes #80892 Co-authored-by: James Rodewig <james.rodewig@elastic.co>	2021-11-29 18:47:01 +00:00
Adam Locke	247d124666	[DOCS] Update ES quick start for security ON by default (#80735 ) * [DOCS] Update ES quick start for security ON by default * Remove code.asciidoc, which is part of the overall doc build now * Update node names for cleanup * Add note with links to tools * Add --net elastic network Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com>	2021-11-17 17:48:07 -05:00
Stef Nestor	bc7c82c6b2	[+DOC] Tasks' Queue backup (#80447 ) * Add Tasks queue backup troubleshooting Co-authored-by: Deb Adair <debadair@elastic.co>	2021-11-10 08:47:16 -07:00
James Rodewig	f4bfdee5db	[DOCS] Fix cluster get settings API refs	2021-11-05 17:20:17 -04:00
James Rodewig	58abbe941f	[DOCS] Fix cluster update settings refs (#79580 ) The API is named 'cluster update settings,' not 'update cluster settings.'	2021-10-20 13:16:35 -04:00
Nikola Grcevski	055c770083	Deprecation of transient cluster settings (#78794 ) This PR changes uses of transient cluster settings to persistent cluster settings. The PR also deprecates the transient settings usage. Relates to #49540	2021-10-15 13:00:52 -04:00
James Rodewig	9e0299f551	[DOCS] Troubleshoot the flood-stage watermark error (#78519 ) Adds troubleshooting steps for the flood-stage watermark error. Closes #77906.	2021-10-01 08:32:53 -04:00
Stef Nestor	e1062803bb	[DOCS] Use dedicated hosts for ES (#77582 ) In production, we recommend you run {es} on a dedicated host or as a primary service. This adds that best practice to our setup documentation. Co-authored-by: James Rodewig <40268737+jrodewig@users.noreply.github.com>	2021-09-21 17:50:21 -04:00
James Rodewig	201a328d0c	[DOCS] Remove shrink snippet from 'Size your shards' (#77593 ) The current shrink API snippet doesn't show you how to remove replicas or reduce primary shards. Rather than duplicate those instructions from the shrink API docs, this removes the snippet. A link to the shrink API and shrink ILM action docs is already provided. It also updates a delete index API snippet to avoid wildcards. Wildcard expansion for the delete index API is disabled by default in 8.0.	2021-09-14 08:59:41 -04:00
James Rodewig	434843e66c	[DOCS] Fix typo	2021-09-10 10:56:37 -04:00
Stef Nestor	95a8c80f3d	[DOCS] Add max open shards error to 'Size your shards' (#77287 ) * [+DOC] ERROR: maximum shards open Appending common error into our Shard sizing docs along w/extra resources commonly viewed from [this Elastic Discuss](https://discuss.elastic.co/t/how-to-fix-hitting-maximum-shards-open-error/200502/2). Top 4 viewed error last 30d on Elastic Discuss. Kindly assist - fixing resource links - I'm debating including [the cluster setting you can temporarily override](https://www.elastic.co/guide/en/elasticsearch/reference/7.14/modules-cluster.html#cluster-shard-limit), but have left it off so far. Would love your thoughts! * reorg + edits * review feedback Co-authored-by: James Rodewig <40268737+jrodewig@users.noreply.github.com>	2021-09-10 09:41:30 -04:00
James Rodewig	e246e1ce53	[DOCS] Remove 'step' from headings (#76753 )	2021-08-20 08:52:04 -04:00

1 2 3 4

161 commits