Commit graph

9899 commits

Author SHA1 Message Date
James Rodewig
c672eca54d
[DOCS] Fix intro sentence for Docker install instructions (#83939) (#84822)
Fixes an intro sentence for the Docker install instructions.

Co-authored-by: James Rodewig <40268737+jrodewig@users.noreply.github.com>
(cherry picked from commit 472a7d8e91)

Co-authored-by: Alexander Reelsen <alexander@reelsen.net>
2022-03-09 11:29:20 -05:00
Dimitris Athanasiou
4eaedb265d
[ML] Only one of inference_threads and model_threads may be great… (#84794)
Starting a trained model deployment the user may set values for `inference_threads`
of `model_threads`. The first improves latency whereas the latter improves throughput.
It is easier to reason on how a model allocation uses resources if we ensure only
one of those two may be greater than one. In addition, it allows us to distribute
the cores of the ML nodes in the cluster across the model allocations in the future.

This commit adds a validation that prevents both `inference_threads` and `model_threads`
to be greater than one.
2022-03-09 16:33:35 +02:00
James Rodewig
93680e0fdb
[DOCS] Remove 8.1.0 coming tag (#84169) (#84812)
Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com>
(cherry picked from commit 813fca0716)

# Conflicts:
#	docs/reference/migration/migrate_8_1.asciidoc
2022-03-09 08:51:12 -05:00
David Turner
76b05bfd8e
Note that S3 compat includes performance (#84798)
Today the note in the docs about S3-compatible repositories notes that
the repo must behave correctly, but it's also important that it has the
same performance profile. This commit extends the docs to include this
info.
2022-03-09 12:28:17 +00:00
David Kyle
27ae82139a
[ML] Add throughput stats for Trained Model Deployments (#84628)
Throughput is measured as the number of inference requests 
processed per minute. The node level stats peak_throughput_per_minute, 
throughput_last_minute and average_inference_time_ms_last_minute are 
added with a deployment level stat peak_throughput_per_minute which
 is the summed throughput of all nodes.
2022-03-08 11:06:36 +00:00
Dan Roscigno
37beabcb2e
reorder and merge data management and ILM doc pages (#84679)
* reorder and merge data management and ILM doc pages

* update abbreviated titles
2022-03-07 18:33:28 -05:00
Dan Roscigno
52a1653e3f
add data tier configuration information for Cloud (#84518)
* add data tier configuration information for Cloud

* Move configuration docs and add headings

* update ILM tutorial image

* add save changes step

Co-authored-by: James Rodewig <40268737+jrodewig@users.noreply.github.com>
2022-03-07 18:13:58 -05:00
Nik Everett
de5ca3cfaa
fixed typo (#84694) (#84726)
Co-authored-by: Mustafa Balila (rootsofnull) <hitsugayatoshiro899@gmail.com>
2022-03-07 14:30:51 -05:00
Lisa Cawley
7ee369f263
[DOCS] Add ML PRs to 8.1 release notes (#84720) 2022-03-07 10:09:18 -08:00
Abele Mălan
9ecb96fcf3
Fix some typos in plugins & reference docs (#84667)
This pull request removes a few instances of duplicate words or
punctuation and erroneous spelling from the docs.
2022-03-07 12:29:58 -05:00
er0k
26307bbef3
[DOCS] Update install instructions for Debian/Ubuntu (#84645)
The use of `apt-key` is deprecated and will no longer be available after
Debian 11 and Ubuntu 22.04. This updates the installation instructions
for Debian-based distributions.

Closes #84644
2022-03-07 12:24:50 -05:00
Ievgen Degtiarenko
8d637f588f
shards allocation health indicator services (#83513)
Add a health indicator implementations that checks shards status 
and report their health status based on availability
2022-03-07 09:31:33 +01:00
James Rodewig
8322809607
Typo in example of the filter_path option (#84551) (#84661)
(cherry picked from commit b637d8bc70)

Co-authored-by: Stéphane Campinas <stephane.campinas@gmail.com>
2022-03-04 09:10:55 -05:00
Benjamin Trent
cf151b53fe
[ML] adds new change_point pipeline aggregation (#83428)
adds a new `change_point` sibling pipeline aggregation.

This aggregation detects a change_point in a multi-bucket aggregation. 

Example:
```
POST kibana_sample_data_flights/_search
{
  "size": 0,
  "aggs": {
    "histo": {
      "date_histogram": {
        "field": "timestamp",
        "fixed_interval": "3h"
      },
      "aggs": {
        "ticket_price": {
          "max": {
            "field": "AvgTicketPrice"
          }
        }
      }
    },
    "changes": {
      "change_point": {
        "buckets_path": "histo>ticket_price"
      }
    }
  }
}
```

Response
```
{
  /*<snip>*/ 
  "aggregations" : {
    "histo" : {
      "buckets" : [ /*<snip>*/ ]
    },
    "changes" : {
      "bucket" : {
        "key" : "2022-01-28T23:00:00.000Z",
        "doc_count" : 48,
        "ticket_price" : {
          "value" : 1187.61083984375
        }
      },
      "type" : {
        "distribution_change" : {
          "p_value" : 0.023753965139433175,
          "change_point" : 40
        }
      }
    }
  }
}
```
2022-03-04 07:00:58 -05:00
Ioannis Kakavas
121014d73f
Dix docker instructions for elasticsearch (#84604)
We suggest users should run the containers in a specific network
but we didn't include the command to create the network. This commit
addresses that.
2022-03-03 12:10:59 +02:00
Benjamin Trent
b592d2bf01
New random_sampler aggregation for sampling documents in aggregations (#84363)
This adds a new sampling aggregation that performs a background sampling over all documents in an index. 

The syntax is as follows:
```
{
  "aggregations": {
    "sampling": {
      "random_sampler": {
        "probability": 0.1
      },
      "aggs": {
        "price_percentiles": {
          "percentiles": {
            "field": "taxful_total_price"
          }
        }
      }
    }
  }
}
```

This aggregation provides fast random sampling over the entire document set in order to speed up costly aggregations.

Testing this over a variety of aggregations and data sets, the median speed up when sampling at `0.001` over millions of documents is around 70X speed improvement.

Relative error rate does rely on the size of the data and the aggregation kind. Here are some typically expected numbers when sampling over 10s of millions of documents. `p` is the configured probability and `n` is the number of documents matched by your provided filter query.
2022-03-02 14:32:30 -05:00
Lisa Cawley
cae3a662dc
[DOCS] Refresh automated screenshots (#84543) 2022-03-02 09:30:07 -08:00
Johannes Mahne
26778dfb25 Docs: Fixing formatting on slow log page (#84509)
Trying to fix some of the formatting in the last paragraph
2022-03-01 14:05:34 -05:00
James Rodewig
dd99723470
[DOCS] Remove 8.0.1 coming tag (#84297) (#84515)
(cherry picked from commit 10bca1c7e8)
2022-03-01 12:11:16 -05:00
James Rodewig
74e4add3a8
[DOCS] Update sum aggregation for histograms (#84493) (#84496)
Fixes an error and test snippets for the sum aggregation example for histograms.

Closes #84491

Co-authored-by: James Rodewig <40268737+jrodewig@users.noreply.github.com>
(cherry picked from commit fb45ac9dea)

Co-authored-by: Maja Grubic <maja.grubic@elastic.co>
2022-03-01 08:42:05 -05:00
Benjamin Trent
45deac4c96
[ML] add windowing support for text_classification (#83989)
This commit adds initial windowing support for text_classification tasks.

Specifically, a user can now indicate a span (non-negative) indicating the tokenization windowing span when creating
sub-sequences.

Default value is span: -1 indicates that no windowing should take place.
2022-03-01 08:29:12 -05:00
Andrei Dan
23a7b6bb4d
DOCS: more visibility over how min_age works when rollover is present (#84273) 2022-03-01 11:32:38 +00:00
Julie Tibshirani
713017f0e3
Improve readability of field retrieval docs (#84373)
* Collapse more specialized sections around nested fields, unmapped fields, and
  ignored values
* Move information on metadata fields to a 'note' and streamline it a bit

Closes #82983.
2022-02-28 09:52:39 -08:00
James Rodewig
a837e98b32
[DOCS] System indices no longer accessible 8.0 (#84377) (#84438)
We know that we plan to remove direct access to system indices, but we aren't sure what major version that change will fall in. This updates the docs to avoid any confusion in the meantime.

Co-authored-by: James Rodewig <40268737+jrodewig@users.noreply.github.com>
(cherry picked from commit ffd21e5259)

Co-authored-by: Stef Nestor <steffanie.nestor@gmail.com>
2022-02-28 12:02:54 -05:00
Leaf-Lin
456aca8b1a
[Docs] Distinguish warm vs cold vs frozen tiers (#84155)
Today, the description at the top of this data tier page can be confusing for users on how to choose between warm vs cold or cold vs frozen. 

I added some descriptions around cold and frozen tiers to emphasize the ability of use of searchable snapshot and their storage benefit based on https://www.elastic.co/elasticsearch/elasticsearch-searchable-snapshots and https://www.elastic.co/blog/introducing-elasticsearch-frozen-tier-searchbox-on-s3. Feel free to reword it.

Co-authored-by: Lee Hinman <dakrone@users.noreply.github.com>
Co-authored-by: James Rodewig <james.rodewig@elastic.co>
2022-02-28 11:19:43 -05:00
James Rodewig
6f5541a9d6
[DOCS] Update CCS forward compatibility docs (#84055)
Documents the following:

* FWC for CCS within the same major version.
* A local cluster running the last minor of a major can search a remote cluster running any minor in the following major.
* Only features that exist across all searched clusters are supported.
2022-02-28 08:18:04 -05:00
Mary Gouseti
ed0bb2a8af
Push back excessive requests for stats (#83832)
Resolves #51992
2022-02-28 08:46:18 +01:00
Ioannis Kakavas
8cebba5708
Instructions to retrieve keystore pwd (#84340)
In 8.0.0. we introduce TLS autoconfiguration. We store the key and
certificate materials in password protected PKCS#12 keystores and
we store these passwords in the elasticsearch keystore.
This commit adds instructions on how to get hold of the passwords
for users to inspect or alter the PKCS#12 keystores.
2022-02-25 15:27:06 +02:00
Julie Tibshirani
95be11f6fb
Clarify docs on field type families (#84368)
There has been some confusion over the definition of a field type family. This
PR clarifies the definition in the docs: the two types should have the exact
same search behavior (including supporting the same queries/ aggs, and producing
the same response). It's not sufficient for them to just support the samme
search operations.

This change also fixes an inaccurate statement that there is only one field type
family so far.
2022-02-24 13:27:36 -08:00
Julie Tibshirani
d9ef39f7c2
Remove 'under development' note in suggester docs (#84366)
In the intro, we mention that parts of the feature are still under development.
This is not very helpful information for users, and could give the wrong
impression about its maturity.
2022-02-24 13:27:03 -08:00
István Zoltán Szabó
614f089e82
[DOCS] Expands description of the reset transform API (#84270) 2022-02-24 10:01:28 +01:00
James Rodewig
5bcbccfac8
[DOCS] Add 8.0.1 release notes (#84213) (#84292)
Update docs for v8.0.1-BC2 release

(cherry picked from commit 7809c8e515)

# Conflicts:
#	docs/reference/release-notes.asciidoc

Co-authored-by: Salvatore Campagna <93581129+salvatore-campagna@users.noreply.github.com>
2022-02-23 12:08:29 -05:00
James Rodewig
2437e7d36f
[DOCS] Fix upgrade docs for 8.x (#84076)
The current upgrade docs contain some language that's only applicable to 8.0. This updates the docs so they're usable in other 8.x branches.
2022-02-23 10:53:18 -05:00
James Rodewig
d3d468e5f1
[DOCS] Update screenshots for ingest pipeline docs (#83845)
https://github.com/elastic/kibana/pull/101216 adds a new ECS mapper feature to the Ingest Pipelines UI. This updates the ES docs to cover the new feature.
2022-02-23 10:50:02 -05:00
James Rodewig
80be0c6abf
[DOCS] Update 8.1.0 release notes (#84185) (#84302)
(cherry picked from commit 297b69ed6c)

# Conflicts:
#	docs/reference/release-notes/8.1.asciidoc

Co-authored-by: Ievgen Degtiarenko <ievgen.degtiarenko@elastic.co>
2022-02-23 10:22:26 -05:00
Nhat Nguyen
86964c9752
Document partial search results with skip_unavailable (#84057)
This commit adds an explanation for the relation between `allow_partial_search_results` and `skip_unavailable` in CCS requests.

Relates to #33915

Closes #82407

Co-authored-by: James Rodewig <james.rodewig@elastic.co>
2022-02-23 10:04:52 -05:00
István Zoltán Szabó
030d87074b
[DOCS] Adds reference of transform reset option to the tutorial (#84194) 2022-02-23 10:00:44 +01:00
Nhat Nguyen
31d703f24c
Introduce lookup runtime fields (#82385)
This PR introduces the lookup runtime fields which are used to retrieve 
data from the related indices. The below search request enriches its
search hits with the location of each IP address from the `ip_location`
index.

```
POST logs/_search
{
  "runtime_mappings": {
    "location": {
      "type": "lookup",
      "lookup_index": "ip_location",
      "query_type": "term",
      "query_input_field": "ip",
      "query_target_field": "_id",
      "fetch_fields": [
        "country",
        "city"
      ]
    }
  },
  "fields": [
    "timestamp",
    "message",
    "location"
  ]
}
```

Response:

```
{
  "hits": {
    "hits": [
      {
        "_index": "logs",
        "_id": "1",
        "fields": {
          "location": [
            {
              "city": [ "Montreal" ],
              "country": [ "Canada" ]
            }
          ],
          "message": [ "the first message" ]
        }
      }
    ]
  }
}
```
2022-02-22 21:36:19 -05:00
DeDe Morton
08c0fcd91e
Clarify docs about CA fingerprints and Fleet (#84056)
Co-authored-by: James Rodewig <james.rodewig@elastic.co>
Co-authored-by: Ioannis Kakavas <ikakavas@protonmail.com>
2022-02-22 15:02:46 -08:00
Ioannis Kakavas
c923bd235b
Package installation docs minor change (#84177)
Make it clearer that security configuration happens on installation
and not on first start of Elasticsearch.
2022-02-22 18:54:26 +02:00
Lisa Cawley
5ae96e5f23
[DOCS] Add read_pipeline privilege to transform setup (#84151) 2022-02-22 07:47:44 -08:00
Yannick Welsch
083bb8a3fd
Add extra section on doc-value-only fields to documentation (#84209)
Adds a dedicated section for doc-value-only fields to the docs that can be linked to.
2022-02-22 11:46:10 +01:00
David Turner
02f38e3da9
Make allocation explanations more actionable (#83983)
The cluster allocation explain API includes a top-level status
indicating to the user whether the shard can be assigned/rebalanced/etc
or not. Today this status is fairly terse and experience shows that
users sometimes struggle to understand how to interpret it and to decide
on follow-up actions.

This commit makes the top-level explanation more detailed and
actionable. For instance, in the cases like `THROTTLED` where the status
is transient we instruct the user to wait; if a shard is lost we say to
restore it from a snapshot; if a shard cannot be assigned we say to
choose a specific node where its assignment is expected and to address
the obstacles.

Co-authored-by: James Rodewig <james.rodewig@elastic.co>
2022-02-22 09:23:01 +00:00
Joe Gallo
119fbcf64e
[DOCS] Fix shrink index API prereqs (#84197) 2022-02-21 16:17:49 -05:00
Justin Cranford
d103af2d06
Add JWT realm support for JWT validation (#83155) 2022-02-21 01:54:39 -05:00
Ioannis Kakavas
6e530c0383
Adjust auto-configuration related docs (#84080)
* Add a note that the http_ca.crt certificate that is generated and
stored in config/certs can be used to configure any client to trust
the certificate that elasticsearch uses for TLS on the HTTP layer
* Add a note that the elasticsearch-create-enrollment-token CLI
tool can only be used with auto-configured TLS settings.
2022-02-20 10:58:46 +02:00
James Rodewig
6ad3f8bfdd
[DOCS] Clarify orientation usage for WKT and GeoJSON polygons (#84025)
Clarifies that the `orientation` mapping parameter only applies to WKT polygons. GeoJSON polygons use a default orientation of `RIGHT`, regardless of the mapping parameter.

Also notes that the document-level `orientation` parameter overrides the default orientation for both WKT and GeoJSON polygons.

Closes https://github.com/elastic/elasticsearch/issues/84009.
2022-02-17 10:33:06 -05:00
James Rodewig
8ff1336127
[DOCS] Remove note about partial response from Bulk API docs (#84053)
The bulk API response with a `200 OK` HTTP status always returns an entry for each action in the request. Partial responses aren't applicable.
2022-02-17 09:20:17 -05:00
James Rodewig
f9a64b2e86
[DOCS] Fix ignore_unavailable parameter definition (#84071)
The current `ignore_unavailable` definition is a bit misleading. The parameter primarily determines if a request that targets a missing or closed index returns an error.
2022-02-17 08:24:06 -05:00
Elasticsearch addict
4e55efc09d
Update Lucene analysis base url (#84094)
Moving to Lucene 9 slightly changed the base url for the analysis components
which this PR fixes.
2022-02-17 12:44:12 +01:00