Commit graph

9962 commits

Author SHA1 Message Date
Keith Massey
2566cd1738
Adding DiskHealthIndicatorService (#90041)
Adds a new health indicator that reports problems if indexes have a block placed on them, or if
any nodes in the cluster are running low on disk space.
2022-09-20 15:27:26 -05:00
Iraklis Psaroudakis
56dd7c74da
Forward port 8.4.2 documentation to main (#90170)
Forward port 8.4.2 documentation to main
2022-09-20 18:42:47 +03:00
Julie Tibshirani
b1acb3603d
Clarify that knn does not use postfiltering (#89897)
This PR expands the approximate kNN docs to clarify the filter is applied during
the kNN search, not after. It explains the downsides of postfiltering.
2022-09-19 16:47:17 -07:00
Joe Gallo
21356162eb
ILM don't rollover empty indices (#89557) 2022-09-19 18:37:41 -04:00
István Zoltán Szabó
45646b78e2
[DOCS] Adds missing_bucket setting to transform APIs (#90111) 2022-09-19 15:22:48 +02:00
Iraklis Psaroudakis
3ed7a04d22
Introduce node mappings stats (#89807)
So that they are visible in NodeIndicesStats only at the node and index (but not shard) levels. Also visible in the _cat/nodes table. And make an exact count yaml REST test.
2022-09-19 15:47:47 +03:00
Iraklis Psaroudakis
34471b1cd2
Introduce max headroom for disk watermark stages (#88639)
Introduce max headroom settings for the low, high, and flood disk watermark stages, similar to the existing max headroom setting for the flood stage of the frozen tier. Introduce new max headrooms in HealthMetadata and in ReactiveStorageDeciderService. Add multiple tests in DiskThresholdDeciderUnitTests, DiskThresholdDeciderTests and DiskThresholdMonitorTests. Moreover, addition & subtraction for ByteSizeValue, and min.
2022-09-19 14:59:18 +03:00
James Baiera
db73aa0498
Add repeated snapshot failure troubleshooting guide (#89762)
This troubleshooting guide is what will be returned from the SLM health indicator 
when a SLM policy has suffered from too many repeat failures without a successful 
execution.
2022-09-15 17:01:32 -04:00
David Turner
c7da07068c
Reinstate prerelease upgrade warning (#90093)
This warning was lost in #83489, but it's important we have it in these
docs since users keep on trying this kind of invalid upgrade. This
commit reinstates the lost warning.
2022-09-16 00:06:08 +09:30
Keith Massey
8f591e72f8
Adding node name to cluster_formation section of stable master health API results (#89982) 2022-09-14 13:31:03 -05:00
Joe Gallo
f262f36564
Add master_timeout to the snapshot delete docs (#90032) 2022-09-14 12:23:46 -04:00
Nik Everett
17967a98d3
Remove synthetic _source from tech preview (#90042)
I've been hacking on synthetic source for a while now and not seen any
need to break backwards compatibility or any major bugs. I think it's
time to remove the `preview` marker from it so folks can use it without
fear.
2022-09-13 16:33:10 -04:00
Iraklis Psaroudakis
d83ed3315a
Re-registering corrupt repository unblocks it (#89719)
Fixes #89130
2022-09-12 20:21:35 +03:00
didendron
84eb60ee16
SQL: Implement DATE_FORMAT function (#88388) 2022-09-12 09:33:34 +02:00
James Baiera
c570d3e88e
Add advice on restoring ILM indices into unmanaged state (#89140) 2022-09-11 23:29:54 -04:00
Alan Woodward
224f48e637
[DOCS] document that date and date_nanos fields support synthetic source (#89968) 2022-09-09 17:21:43 +01:00
Christos Soulios
1a709caa65 [TSDB] Removed summary and histogram metric types (#89937)
It seems that for now we don't have a good use for the histogram and summary metric types. 
They had been left as place holders for a while, but at this point there is no concrete plan forward for them.

This PR removes the histogram and summary metric types. We may add them back in the future.

Also, this PR completely removes the time_series_metric mapping parameter from the histogram field type and only allows the gauge metric type for aggregate_metric_double fields.
2022-09-09 15:04:30 +03:00
Ed Savage
fd20027751
[ML] Performance improvements for categorization jobs (#89824)
Categorization of strings which break down to a huge number of tokens can cause the C++ backend process to choke - see elastic/ml-cpp#2403.

This PR adds a limit filter to the default categorization analyzer which caps the number of tokens passed to the backend at 100.

Unfortunately this isn't a complete panacea to all the issues surrounding categorization of many tokened / large messages as verification checks on the frontend can also fail due to calls to the datafeed _preview API returning an excessive amount of data.
2022-09-08 18:41:01 +01:00
Nikola Grcevski
f16110da9b
[DOCS] Revert changes for ES_JAVA_OPTS (#89931)
ES_JAVA_OPTS is still the correct way to pass options to
the Elasticsearch process, CLI_JAVA_OPTS affects only the
command line tool. CLI_JAVA_OPTS is the correct way to pass
options for plugin installation or other tools.
2022-09-08 13:37:39 -04:00
Nik Everett
c4a77d572d
Synthetic _source: support dense_vector (#89840)
This adds support for synthetic _source to `dense_vector` fields.

![image](https://user-images.githubusercontent.com/215970/188734496-0f0772c7-4c7a-46b6-b978-0c220e73474d.png)
2022-09-09 00:54:59 +09:30
Nik Everett
e89586c20d
Document synthetic source for text and keyword (#89893)
`text` and `keyword` fields support synthetic _source in a few more
configurations now. This documents those configurations.
2022-09-08 23:35:27 +09:30
Adam Locke
93ad8828a3
[DOCS] Add note about passwords in .env Docker file (#89892) 2022-09-08 08:40:09 -04:00
Abdon Pijpelink
346f7848e6
[DOCS] Add troubleshooting searches guide (#89583)
* [DOCS] Adds troubleshooting searches guide

* Additional troubleshooting steps

* Apply review suggestions

* Replace separate _cat aliases/indices requests with one get indices call

* Reorder steps to move field caps forward

* Add note about ignore_unavailable

Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com>
2022-09-08 14:30:21 +02:00
Pooya Salehi
72b2bd90e2
Update running snapshot state value to STARTED (#89863)
The get snapshot status API will currently return a value of `STARTED` for the state of a snapshot that is currently running. The documentation says that the `state` value for a running snapshot is `IN_PROGRESS`. This documentation change will align the docs with the actual result of the get snapshot status API.

Co-authored-by: Austin Smith <76973609+asmith-elastic@users.noreply.github.com>
2022-09-08 10:12:44 +02:00
Keith Massey
06cfa747be
Avoiding the use of dynamic map keys in the cluster_formation results of the stable master health indicator (#89842)
Part of the stable master history health indicator's results (the
`cluster_formation` section within `details`) used dynamic keys in a
map. This gets rid of that. So now instead of:

```
"details": {
                "current_master": {
                    "node_id": null,
                    "name": null
                },
                "recent_masters": [
                    {
                        "node_id": "31WBm9iTTRuMyWnBhWNUGA",
                        "name": "master-node-3"
                    }
                ],
                "cluster_formation": {
                    "31WBm9iTTRuMyWnBhWNUGA": "master not discovered or elected yet, an election requires at least 2 nodes with ids from [nADkAeGsT-q12gw89Ga1FA, 31WBm9iTTRuMyWnBhWNUGA, w8v48JvuRsuDCjwBn8KbRw], have only discovered non-quorum [{master-node-3}{31WBm9iTTRuMyWnBhWNUGA}{lJmGYiTPS_W7AJU7csG_gQ}{master-node-3}{127.0.0.1}{127.0.0.1:9301}{dm}]; discovery will continue using [127.0.0.1:9300, 127.0.0.1:9302, 127.0.0.1:9303, 127.0.0.1:9304, 127.0.0.1:9305, [::1]:9300, [::1]:9302, [::1]:9303, [::1]:9304, [::1]:9305] from hosts providers and [{master-node-2}{nADkAeGsT-q12gw89Ga1FA}{logzEHuuTpqwJp-RWssBPw}{master-node-2}{127.0.0.1}{127.0.0.1:9300}{dm}, {master-node-3}{31WBm9iTTRuMyWnBhWNUGA}{lJmGYiTPS_W7AJU7csG_gQ}{master-node-3}{127.0.0.1}{127.0.0.1:9301}{dm}] from last-known cluster state; node term 39, last-accepted version 461 in term 39"
                }
}
```

We will have:

```
"details": {
                "current_master": {
                    "node_id": null,
                    "name": null
                },
                "recent_masters": [
                    {
                        "node_id": "31WBm9iTTRuMyWnBhWNUGA",
                        "name": "master-node-3"
                    }
                ],
                "cluster_formation": [
                    {
                        "node_id": "31WBm9iTTRuMyWnBhWNUGA",
                        "cluster_formation_message": "master not discovered or elected yet, an election requires at least 2 nodes with ids from [nADkAeGsT-q12gw89Ga1FA, 31WBm9iTTRuMyWnBhWNUGA, w8v48JvuRsuDCjwBn8KbRw], have only discovered non-quorum [{master-node-3}{31WBm9iTTRuMyWnBhWNUGA}{lJmGYiTPS_W7AJU7csG_gQ}{master-node-3}{127.0.0.1}{127.0.0.1:9301}{dm}]; discovery will continue using [127.0.0.1:9300, 127.0.0.1:9302, 127.0.0.1:9303, 127.0.0.1:9304, 127.0.0.1:9305, [::1]:9300, [::1]:9302, [::1]:9303, [::1]:9304, [::1]:9305] from hosts providers and [{master-node-2}{nADkAeGsT-q12gw89Ga1FA}{logzEHuuTpqwJp-RWssBPw}{master-node-2}{127.0.0.1}{127.0.0.1:9300}{dm}, {master-node-3}{31WBm9iTTRuMyWnBhWNUGA}{lJmGYiTPS_W7AJU7csG_gQ}{master-node-3}{127.0.0.1}{127.0.0.1:9301}{dm}] from last-known cluster state; node term 39, last-accepted version 461 in term 39"
                    }
                ]
}
```
2022-09-08 06:22:36 +09:30
Nik Everett
b667aa33f0
Synthetic _source: support histogram field (#89833)
Adds support for the `histogram` field type to synthetic _source.

![image](https://user-images.githubusercontent.com/215970/188691249-9d23d1dc-64ab-49a4-8b24-f60fc966c0ac.png)
2022-09-08 01:55:38 +09:30
István Zoltán Szabó
7de1a6efc5
[DOCS] Simplifies composite aggregation recommendation (#89878) 2022-09-07 17:54:05 +02:00
Adam Locke
686a3fd45d
[DOCS] Update CCS compatibility matrix for 8.3 (#88906)
* [DOCS] Update CCS compatibility matrix for 8.3

Updates the CCS compatibility table to include 8.3.

* Fixing busted table 🔨

* Update table for 8.3 -> 8.1 support

Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com>
2022-09-07 11:53:22 -04:00
Abdon Pijpelink
56edb88fed
Update disk-usage.asciidoc (#89709) (#89874)
added missing word

(cherry picked from commit 3e35455511)

Co-authored-by: Brady Vidovic <bradvido@users.noreply.github.com>
2022-09-07 23:28:44 +09:30
Pooya Salehi
59c1c81262
Fix typo in get-snapshot-status-api doc (#89865) 2022-09-07 15:21:41 +02:00
Julie Tibshirani
3c1b070329
Avoid negative scores with cross_fields type (#89016)
The cross_fields scoring type can produce negative scores when some documents
are missing fields. When blending term document frequencies, we take the maximum
document frequency across all fields. If one field appears in fewer documents
than another, this means that its IDF can become negative. This is because IDF
is calculated as `Math.log(1 + (docCount - docFreq + 0.5) / (docFreq + 0.5))`

This change adjusts the docFreq for each field to `Math.min(docCount, docFreq)`
so that the IDF can never become negative. It makes sense that the term document
frequency should never exceed the number of documents containing the field.
2022-09-06 13:02:24 -07:00
Tanguy Leroux
33ff7b2cb8
[Docs] Indicate that CCR does not auto-follow existing indices, only newly created ones (#89498) 2022-09-07 01:43:19 +09:30
Anthony McGlone
492f5b1751
[DOCS] Update search_after section with an example (#89631)
* [DOCS] Update search_after section with an example

* Update docs/reference/search/search-your-data/paginate-search-results.asciidoc

Co-authored-by: Abdon Pijpelink <abdon@abdon.nl>

* Update docs/reference/search/search-your-data/paginate-search-results.asciidoc

Co-authored-by: Abdon Pijpelink <abdon@abdon.nl>

* Update docs/reference/search/search-your-data/paginate-search-results.asciidoc

Co-authored-by: Abdon Pijpelink <abdon@abdon.nl>

* [DOCS] Update search_after section with an example

* [DOCS] Update search_after example with a response with sort values

Co-authored-by: Abdon Pijpelink <abdon@abdon.nl>
2022-09-05 15:33:03 +02:00
David Turner
6e3fe01b5a
Reject unknown bulk actions (#89450)
Requests to the bulk API comprise a sequence of items, each of which
starts with a JSON object describing the item. This object includes the
type of action to perform with the item which should be one of `create`,
`update`, `index`, or `delete`. In earlier versions Elasticsearch would
ignore items with an unrecognized type, skipping the next line in the
request, but this lenient behaviour means that there is no way for the
client to associate the items in the response with the items in the
request, and in some cases it would cause the remainder of the request
to be parsed incorrectly.

With this commit, requests to the bulk API must comprise only items with
recognized types. Elasticsearch will reject requests containing any
items with an unrecognized type with a `400 Bad Request` error response.
2022-09-05 20:57:02 +09:30
debadair
ba14aaa004
[DOCS] Add simple dev setup instructions back to the README (#89714)
* [DOCS] Add Docker-based quickstart instructions to README & ES guide

* Removed attributes

* Added basic tutorial

* Fixed headings

* Edit

* [DOCS] Adding basic dev setup instructions to the ES guide

* Incorporated security config from Kibana docs

* Update README.asciidoc

Co-authored-by: Adam Locke <adam.locke@elastic.co>

* Update README.asciidoc

Co-authored-by: Adam Locke <adam.locke@elastic.co>

* Added callouts for version attributes

* Update README.asciidoc

Co-authored-by: Adam Locke <adam.locke@elastic.co>

* Update README.asciidoc

Co-authored-by: Adam Locke <adam.locke@elastic.co>

* Added README content to guide topic

* Snippet test fixes

* Incorporated review feedback

* Remove version

* Update docs/reference/setup/run-elasticsearch-locally.asciidoc

* Added info about security on by default

* Fix xrefs

* Update docs/reference/setup/run-elasticsearch-locally.asciidoc

* Fix headers

* Edits

Co-authored-by: Adam Locke <adam.locke@elastic.co>
2022-09-01 16:49:05 -07:00
István Zoltán Szabó
e244473962
[DOCS] Reworks aggregating data for faster performance page (#89575) 2022-09-01 13:59:05 +02:00
István Zoltán Szabó
cbda0a51c6
[DOCS] Adds text similarity task example to API docs (#89756) 2022-09-01 11:53:26 +02:00
Dimitris Athanasiou
b5504ea701
[ML] Lift limit of max number of classes for classification to 100 (#89755)
Limit was previously set to `30`. After the improvements in elastic/ml-cpp#2395
we now raist the limit to `100`.
2022-09-01 10:47:58 +03:00
István Zoltán Szabó
a23c77ceaa
[DOCs] Orders transform API parameters in alphabetical order. (#89746) 2022-08-31 11:42:07 +02:00
Jack Conradson
8c30b86fe2
Fix bug for kNN with filtered aliases (#89621)
This change adds the filter query for a filtered alias to the knn query during the dfs phase on the 
shard. This ensures the correct number of k results are returned instead of removing results as a post 
filter.

Fixes: #89561
2022-08-30 15:57:37 -07:00
debadair
30bb54a4e2
[DOCS] Remove coming tag from 8.4.1 RNs (#89727)
* [Doc] Release notes for v8.4.1 (#89636)

* [Doc] Release notes for v8.4.1

Gradle generated release notes for v8.4.1

* address feedback

* [DOCS] Remove coming tag for 8.4.1 RNs

Co-authored-by: Yang Wang <yang.wang@elastic.co>
2022-08-30 08:00:08 -07:00
debadair
e0fe6de63a
[Doc] Release notes for v8.4.1 (#89636) (#89726)
* [Doc] Release notes for v8.4.1

Gradle generated release notes for v8.4.1

* address feedback

Co-authored-by: Yang Wang <yang.wang@elastic.co>
2022-08-30 07:59:04 -07:00
Andrei Dan
7edf5aa750
[DOCS] health api details descriptions (#89553) 2022-08-30 15:55:18 +01:00
Nik Everett
104f4e9fb5
Synthetic _source: support version field type (#89706)
This adds support for synthetic _source to the `version` field type. It
works very similarly to `keyword` but with an extra decode step.

I modified the decoder to return a `BytesRef` instead of a `String`
because many of the callers seemed to be converting that string directly
into bytes again. Synthetic source would have wanted to do that. As was
the query infrastructure.
2022-08-30 09:39:50 -04:00
Abdon Pijpelink
e891909dfa
[DOCS] Explain dynamic behavior for unmapped copy_to fields (#89626)
* [DOCS] Explain dynamic behavior for unmapped copy_to fields

* Review suggestions
2022-08-30 15:15:35 +02:00
debadair
cda5ce42aa
remove 8.3.0 coming tag (#89701) 2022-08-29 12:32:14 -07:00
Leaf-Lin
78d9d6844f
[DOCS] Add warning on restarting nodes exceeding low disk watermark (#89018)
* Create restart-cluster.asciidoc

As per https://github.com/elastic/elasticsearch/issues/49972 and https://github.com/elastic/elasticsearch/issues/56578, if a node is above low disk threshold when being restarted (rolling restart, network disruption or crash), the disk threshold decider prevents reusing the shard content on the restarted node.

The consequence of the event is the node may take a long time to start.

* Update docs/reference/setup/restart-cluster.asciidoc

LGTM! Thanks!

Co-authored-by: Adam Locke <adam.locke@elastic.co>

Co-authored-by: Adam Locke <adam.locke@elastic.co>
2022-08-29 15:14:44 -04:00
David Kilfoyle
2a44a8982f
[DOCS] Remove feature flag from TSDS docs (#89673)
* Docs: Remove feature flag and add preview label to TSDS docs

* Fix technical preview tag
2022-08-29 10:33:55 -04:00
Abdon Pijpelink
772784f3c9
[DOCS] Add note that terms enum API may return terms from deleted docs (#89654) 2022-08-29 15:19:04 +02:00
István Zoltán Szabó
9a71d1fa78
[DOCS] Clarifies retention policy for transforms (#89685) 2022-08-29 13:17:15 +02:00