Commit graph

9899 commits

Author SHA1 Message Date
James Rodewig
7f9b62ee7a
[DOCS] Clarify document updates don't apply ingest pipelines (#82232) (#82328)
Adding text to clarify that the default pipeline only applies to indexing requests, not updates.

Co-authored-by: James Rodewig <40268737+jrodewig@users.noreply.github.com>
(cherry picked from commit 4e6e4eab22)

Co-authored-by: Mike Barretta <mike.barretta@elastic.co>
2022-01-06 16:51:45 -05:00
Bogdan Pintea
13a0e420a3
SQL: Add CCS SQL documentation (#81545)
This adds the documentation for CCS SQL.

Co-authored-by: James Rodewig <james.rodewig@elastic.co>
2022-01-05 20:01:01 +01:00
Benjamin Trent
9dc8aea1cb
[ML] adds new mpnet tokenization for nlp models (#82234)
This commit adds support for MPNet based models.

MPNet models differ from BERT style models in that:

 - Special tokens are different
 - Input to the model doesn't require token positions.

To configure an MPNet tokenizer for your pytorch MPNet based model:

```
"tokenization": {
  "mpnet": {...}
}
```
The options provided to `mpnet` are the same as the previously supported `bert` configuration.
2022-01-05 12:56:47 -05:00
Andrei Dan
f18c9c503e
Migrate to data tiers API dry run on any ILM status (#82226)
The migrate to data tiers routing API required ILM to be stopped. This
is fine for "live" runs, but for dry runs this isn't a requirement.

This changes the dry_run to allow the API to run irrespective of the ILM
status.
2022-01-05 13:40:27 +00:00
Andrei Dan
332e6d48c4
Data tiers: migrate the coldest node attribute (#81940)
This fixes the migrate to data tiers routing API to take into account
the scenario where the node attribute configuration for an index is more
accurate than the existing `_tier_preference` configuration.

Previously we would simply remove the node attributes routing if there
was a `_tier_preference` configured for the index.

With this commit, we'll look if either the `require.data` or
`include.data` custom routings are colder than the existing `_tier_preference`
configuration (ie. `cold` vs `data_warm,data_hot`) and update the tier
routing accordingly.

eg.
{
  index.routing.allocation.require.data: "warm",
  index.routing.allocation.include.data: "cold",
  index.routing.allocation.include._tier_preference: "data_hot"
}
will be migrated to:
{
  index.routing.allocation.include._tier_preference: "data_cold,data_warm,data_hot"
}

This also removes the existing invariant that had the `require.data`
configuration take precedence over a possible `include.data`
configuration, and will now migrate the coldest configuration to the
corresponding `_tier_preference`.

eg.
{
  index.routing.allocation.require.data: "warm",
  index.routing.allocation.include.data: "cold"
}
will be migrated to:
{
  index.routing.allocation.include._tier_preference: "data_cold,data_warm,data_hot"
}
2022-01-05 09:56:37 +00:00
Stef Nestor
9fd280b67e
[DOC] Don't include searchable snapshot ILM action in both hot and cold phases (#82013)
As outlined in elastic/elasticsearch#81604, including the `searchable_snapshot` action in both the hot and cold phases can result in indices not automatically migrating to the cold tier during the cold phase.

This adds a related warning.

Co-authored-by: James Rodewig <40268737+jrodewig@users.noreply.github.com>
2022-01-04 12:37:05 -05:00
Jake Landis
fd6f04bb24
[docs] clarify purged http stats (#82123) 2022-01-04 09:51:41 -06:00
James Rodewig
f5f76ff1ca
[DOCS] Note that default_field support wildcards (#81127)
Changes:

* Notes that the query string query's `default_field` and `fields` parameters support wildcards.
* Adds an xref to the `index.query.default_field` docs to the `default_field` parameter.
2022-01-04 08:26:13 -05:00
Costin Leau
ec57fbec6f
SQL: Remove JDBC dependency on ES lib geo (#82166)
As part of the effort of making JDBC driver self sufficient, remove the
ES lib geo dependencies without any replacement.
Currently the JDBC driver takes the WKT text and instantiates a geo
object based on the ES lib geo.
Moving forward the driver will return the WKT string representation
without any conversion letting the user pick the geo library desired.
That can be ES lib geo, jts, spatial4j or others.

Note this is a breaking change.

Relates #80277
2022-01-04 07:55:41 +02:00
Albert Zaharovits
9e9a8cc7d0
Auto-generated TLS files under fixed config path (#81547)
We (mostly I) were initially advocating for the auto-generated files to
use unique names (the name containing a timestamp particle), in order to
avoid that subsequent invocations of the config step conflict with
itself. Moreover, I was wishing that these files will not have to be
handled directly by admins (that the enrollment process was to be used).
However, experience proved us otherwise, admins have to manipulate these
files, and unique configuration names are hard to deal with in scripts
and docs, so this PR is all about using a fixed name for all the
generated files. _Labeling as a bug fix because the feedback is that it
very negatively impacts usabilty._ Closes
https://github.com/elastic/elasticsearch/issues/81057
2021-12-29 12:56:29 -05:00
Lisa Cawley
0a1851353a
Update using.asciidoc (#81896) (#82097)
Co-authored-by: Majid Alaeinia <11965368+majidalaeinia@users.noreply.github.com>
2021-12-27 12:51:21 -08:00
Dimitris Athanasiou
14a63ac115
[ML] Improve reporting of trained model size stats (#82000)
This improves reporting of trained model size in the response of the stats API.

In particular, it removes the `model_size_bytes` from the `deployment_stats` section and
replaces it with a top-level `model_size_stats` object that contains:

- `model_size_bytes`: the actual model size
- `required_native_memory_bytes`: the amount of memory required to load a model

In addition, these are now reported for PyTorch models regardless of their deployment state.
2021-12-22 18:20:47 +02:00
Justin Cranford
e467424043
Add JwtRealmSettings (#81680)
Add JwtRealmSettings

Include unit tests and realm security settings documentation. Covers all settings except client authentication mTLS option, and HTTP proxy option.

Refactor Open ID Connect realm to reuse ClaimSetting.java and ClaimParser.java for JWT realm.
2021-12-21 16:28:33 -05:00
James Rodewig
f1004ee698
[DOCS] Fix xref for conditionally running ingest processor (#82001)
Closes #81966
2021-12-21 11:37:20 -05:00
Ievgen Degtiarenko
11b52619c5
do not scroll if max docs is less than scroll size (update/delete by query) (#81654)
This change allows to not open scroll while reindex/delete_by_query/update_by_query
if configured max_docs if less then or equal to the number of documents returned by the scroll batch.
2021-12-21 15:26:51 +01:00
James Rodewig
ff89ee0beb
[DOCS] Remove Windows MSI installer package (#81952) (#81957)
After 7.16.2, we'll no longer produce Windows MSI installer packages for Elasticsearch. These packages were previously released in beta and didn't receive widespread adoption.

### Changes:

* Adds a related 7.17 breaking change.
* Adds a related 7.16 deprecation.
* Removes the MSI installation instructions.
* Removes references to the MSI installer.

I plan to port the applicable changes to 8.1 (main), 8.0, 7.17, and 7.16. In the 7.16 ports, I'll leave in the MSI install docs and add related deprecation notes to them instead.
2021-12-20 13:49:32 -05:00
James Rodewig
78e2e14404
[DOCS] Remove configuration management tools (#81938)
Removes a section covering configuration management tools from the
installation instructions.

After 7.16.2, Elastic will no longer maintain these tools. Previously,
the tools were only supported on a "best effort" basis.
2021-12-20 11:31:03 -05:00
Stef Nestor
e2d66cd257
[DOCS] Thread pool settings are static (#81887)
Starting in 5.1 Thread Pools can no longer be dynamically updated, [doc](https://www.elastic.co/guide/en/elasticsearch/reference/5.0/breaking_50_settings_changes.html#_threadpool_settings).
2021-12-20 11:20:06 -05:00
Ed Savage
a646f55c57
[ML] Set default value of 30 days for model prune window (#81377)
For new jobs, when the analysis config field model_prune_window is not set, use a default value of 30 days or 20 times the bucket span, whichever is greater.

Co-authored-by: David Roberts <dave.roberts@elastic.co>
Co-authored-by: Lisa Cawley <lcawley@elastic.co>
2021-12-20 11:27:30 +00:00
Lisa Cawley
076343933f
[DOCS] Update link in inference processor (#81897) 2021-12-17 15:49:59 -08:00
James Rodewig
efeb1983ae
[DOCS] Combine 8.0 breaking changes for mapping type endpoints (#81730)
Combines several 8.0 breaking changes for the removal of API endpoints that contain mapping types. These items were separate because we previously organized breaking changes by area.

This is a follow-on to #79162.
2021-12-17 16:03:51 -05:00
Dan Hermann
b1f5373e02
Correct docs on output_format option for date processor (#81557) 2021-12-17 06:07:03 -06:00
Alan Woodward
f0bf6f5ee3
Configure IndexSearcher.maxClauseCount() based on Node characteristics (#81525)
This commit deprecates the indices.query.bool.max_clause_count node setting,
and instead configures the maximum clause count for lucene based on the available
heap and the size of the thread pool.

Closes #46433
2021-12-17 11:37:32 +00:00
Adam Locke
bf60ce68c6
[DOCS] Update Docker Compose installation with simplified configuration (#81835)
* [DOCS] Update Docker Compose installation with simplified configuration

* Change heading to fix link issue
2021-12-16 19:42:41 -05:00
James Baiera
159b1ce20b
Adding default templates for Metricbeat ECS data (#81744)
This PR adds four new templates that are automatically installed from the Monitoring plugin.

In 8.x, Metricbeat will be writing its data in ECS compliant format, even when used with xpack 
mode enabled (stack monitoring). In order to continue to support the legacy data format, new 
mappings have been created with the new ECS fields for indexing data, and alias fields for the 
legacy format which point to the corresponding ECS fields.

Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com>
Co-authored-by: Mat Schaffer <mat@schaffer.me>
2021-12-16 16:14:18 -05:00
Adam Locke
dd47e68ce1
[DOCS] Enroll additional nodes on Docker (#81787)
* [DOCS] Enroll additional nodes on Docker

* Remove -p option for second node

Co-authored-by: Fabio Busatto <52658645+bytebilly@users.noreply.github.com>

* Rename nodes to align with other Docker docs

* Add elastic network to first node docker run command

* Remove hyphen from node names

Co-authored-by: Fabio Busatto <52658645+bytebilly@users.noreply.github.com>
2021-12-16 12:42:16 -05:00
Leaf-Lin
82592c4268
[DOCS] Update remote cluster version compatibility table for 8.x (#81239)
Updates the remote clusters version compatibility table to include 7.17 and 8.x versions.

Co-authored-by: James Rodewig <40268737+jrodewig@users.noreply.github.com>
2021-12-16 11:16:24 -05:00
James Rodewig
dc009b746f
[DOCS] Update scripted upsert example (#81481)
Updates the scripted upsert example to use a script `source` rather than
a stored script `id`. This makes the example more copy/paste-able.
2021-12-16 11:13:43 -05:00
David Turner
b1db7b67c6
Permit metadata updates on flood-stage-blocked indices (#81781)
If a node reaches the flood stage watermark then we automatically apply
the `read_only_allow_delete` block to all its indices to prevent any
further growth in data. Users are expected to fix the disk space issue
by adding more space or deleting indices. However some users may prefer
to fix the disk space issues by modifying some of the index settings,
perhaps removing replicas or adjusting an allocation filter to move
shards onto nodes with more space. Today this isn't possible since the
`read_only_allow_delete` block also applies to metadata writes. Blocking
metadata writes isn't necessary to protect against further increases in
disk usage, and makes it harder for users to resolve the disk space
issue, so this commit removes the `METADATA_WRITE` level from the block
definition.
2021-12-16 12:19:41 +00:00
William Chaparro
c8e8104f66
[DOCS] Remove experimental language from HDR Histo percentiles/ranks (#81773)
per issue 60780, decision from team to remove experimental language from HDR Histogram percentiles and ranks. Feature has been in production for quite some time.
closes #60780
2021-12-15 14:35:08 -05:00
James Rodewig
04318961b9
[DOCS] Clarify supported parameters for terms value source (#81775)
The composite aggregation's `terms` value source doesn't support the same set of
parameters as the `terms` aggregation.

Closes #81431.
2021-12-15 14:32:16 -05:00
Adam Locke
1bd0561a91
[DOCS] Add docs for verifying CA fingerprint (#81279)
* [DOCS] Add docs for verifying CA fingerprint

* Update openssl command and explanatory text

* Explain copying CA cert if fingerprint validation isn't possible

* Incorporate new section into the main security config page

* Clarify how cert is used

Co-authored-by: Ioannis Kakavas <ikakavas@protonmail.com>

* Split into two, separate sections

* Rename file and update text based on feedback

* Update ref to use new filename

* Remove extra word

Co-authored-by: Ioannis Kakavas <ikakavas@protonmail.com>
Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com>
2021-12-15 14:08:20 -05:00
Adam Locke
7b47214e5e
[DOCS] Rework manual security docs for 8.0+ (#80285)
* [DOCS] Remove sentence about security being disabled by default

* Updating introduction

* Remove minimal security page

* Clarify configuring security before starting ES

* Clarifications

* Remove old file

* Add set passwords page

* Update change passwords page, clarify TLS adjustments, and other edits

* Update test

* Minor clarification to intro text

Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com>
2021-12-14 10:58:53 -05:00
Ignacio Vera
8c6ed1efc0
Remove experimental flag from geo field format mvt (#81721)
Small left over from 7.16 where mvt feature become GA
2021-12-14 15:21:05 +01:00
David Kyle
d1ee756da8
[ML][DOCS] Add note about max values of thread settings (#81367) 2021-12-14 13:07:34 +00:00
David Roberts
0559dd087b
[ML] Model snapshot upgrade needs a stats endpoint (#81641)
Previously the ML model snapshot upgrade endpoint did not
provide a way to reliably monitor progress. This could lead
to the upgrade assistant UI thinking that a model snapshot
upgrade had finished when it actually hadn't.

This change adds a new "stats" API that allows external
interested parties to find out the status of each model
snapshot upgrade and which node (if any) each is running on.

Fixes #81519
2021-12-14 08:31:49 +00:00
Lisa Cawley
1751ced80a
[DOCS] Fix formatting in get anomaly job API (#81682) 2021-12-13 12:56:27 -08:00
eltomello
38a74a4545
[DOCS] Fix field name to match description (#81621) 2021-12-13 15:51:42 -05:00
David Turner
30bda56f9a
Readonly repos don't cache (#81674)
We say to mark repos as readonly to prevent corruption, but there's
other ways to prevent corruption that people sometimes use instead (e.g.
denying writes at the filesystem/bucket level). It's reasonable to think
that the readonly flag is redundant in that situation but it's not: they
should still mark the repo as readonly tho to bypass the cache and
re-read its contents on each access. This commit adds docs to that
effect.

Co-authored-by: James Rodewig <james.rodewig@elastic.co>
2021-12-13 18:36:26 +00:00
James Rodewig
6c8f0b170b
[DOCS] Fix ordered list formatting in data stream docs (#81667) 2021-12-13 12:53:48 -05:00
James Rodewig
5354e50316
[DOCS] Fix anchor for 'Shrink an index' section (#81665)
Reverts an anchor change from #46711.

Previous versions of the docs use the `_shrinking_an_index` anchor for this
section. Preserving that anchor will prevent doc build breaks in future releases.
2021-12-13 12:24:31 -05:00
Mary Gouseti
175c4793f9
Expose the index age in ILM explain output. (#81273)
* Expose the index age in ILM explain output.

 ILM already exposes the `age` that ILM will use to transition to the next phase, based on that phase's `min_age`. The `index_age` is based only on the index creation date and it's used to trigger a rollover.

 Resolves #64429
2021-12-13 15:38:25 +01:00
mushaoqiong
d467aae67e
Force merge rest api support wait_for_completion (#80463)
Force merge action is a very costly action. It may take several hours to run for big indices. But current force merge rest api do not support wait_for_completion parameter.
This adds support for the wait_for_completion parameter.
2021-12-13 10:32:03 +00:00
James Rodewig
cec4a79a8e [DOCS] Fix formatting typo 2021-12-09 15:46:44 -05:00
Przemyslaw Gomulka
09dc47f352
[DOCS] Add description of X-Opaque-ID and trace.id (#81433)
Documenting headers x-opaque-id and traceparent (emitted as trace.id)
and their usage in logs
2021-12-09 15:54:05 +01:00
James Rodewig
03afcf1a88
[DOCS] Clarify rolling upgrade support for minor versions (#81444)
We only support rolling upgrades between minor versions of the same
major.

It also adds some tags to reuse duplicated content.
2021-12-09 09:04:10 -05:00
Olivier Cavadenti
90e4e8ce63
Add index pressure stats in cluster stats (#80303)
`GET _nodes/stats` returns statistics about indexing pressure for each node.
With this commit `GET _cluster/stats` now returns stats about indexing pressure
computed by aggregating the indexing pressure stats of each node in the
cluster.

Closes #79788
2021-12-09 12:41:08 +00:00
David Turner
7d69f1a974
Oversharding is also indices and fields (#81511)
Today the _Size your shards_ docs focus on shard size and count, but in
fact index count and field count are also important. This commit expands
these docs a bit to cover this observation too.
2021-12-09 08:51:36 +00:00
David Turner
5b9ce9e820
Remove dead code from same-shard decider (#81520)
Today the same-shard allocation decider falls back to checking the
hostname if the node has no host address. In practice nodes will always
have an address so the fallback is dead code. This commit removes that
dead code.

Relates #80702 which will add the ability to distinguish nodes by
hostname regardless of whether they have an address or not, and #80767
which optimizes this area of code - this refactoring should make the
optimization simpler.
2021-12-09 08:42:25 +00:00
David Kyle
3c974a1e5d
[ML][DOCS] Remove orphaned GET deployment stats doc (#81505) 2021-12-09 08:32:33 +00:00