Commit graph

529 commits

Author SHA1 Message Date
Francisco Fernández Castaño
6ecd80cf05
Add support for index pattern exclusion in CCR AutoFollow (#73765)
This commit adds the ability to specify exclusion patterns in Auto-Follow patterns.
This allows excluding indices that match any of the inclusion patterns and also match
some of the exclusion patterns giving more fine grained control in scenarios where this is important.

Related #67686
Backport of #72935
2021-06-04 15:32:56 +02:00
AndyHunt66
a7b0e2c22c
[DOCS] Remove unneeded phrase (#73752) 2021-06-04 08:49:05 -04:00
David Roberts
8cf1fdcd05
[ML] Make ml_standard tokenizer the default for new categorization jobs (#73605)
Categorization jobs created once the entire cluster is upgraded to
version 7.14 or higher will default to using the new ml_standard
tokenizer rather than the previous default of the ml_classic
tokenizer, and will incorporate the new first_non_blank_line char
filter so that categorization is based purely on the first non-blank
line of each message.

The difference between the ml_classic and ml_standard tokenizers
is that ml_classic splits on slashes and colons, so creates multiple
tokens from URLs and filesystem paths, whereas ml_standard attempts
to keep URLs, email addresses and filesystem paths as single tokens.

It is still possible to config the ml_classic tokenizer if you
prefer: just provide a categorization_analyzer within your
analysis_config and whichever tokenizer you choose (which could be
ml_classic or any other Elasticsearch tokenizer) will be used.

To opt out of using first_non_blank_line as a default char filter,
you must explicitly specify a categorization_analyzer that does not
include it.

If no categorization_analyzer is specified but categorization_filters
are specified then the categorization filters are converted to char
filters applied that are applied after first_non_blank_line.

Backport of #72805
2021-06-02 07:04:16 +01:00
Nhat Nguyen
26d3edd3d6
Add point in time to HLRC (#72167)
Closes #70593
2021-05-12 19:57:11 -04:00
James Rodewig
73e140bdff
[DOCS] Rename mount types for searchable snapshots (#72699) (#72773)
Changes:

* Renames 'full copy searchable snapshot' to 'fully mounted index.'
* Renames 'shared cache searchable snapshot' to 'partially mounted index.'
* Removes some unneeded cache setup instructions for the frozen tier. We added a default cache size with #71844.
2021-05-05 16:46:23 -04:00
Lisa Cawley
58e9bb6ca6
[DOCS] Add runtime_mappings to update datafeed API in HLRC (#71772) (#72110)
Co-authored-by: David Kyle <david.kyle@elastic.co>
2021-04-22 09:52:31 -07:00
Tanguy Leroux
44842ee79e
[7.x] Add Searchable Snapshots Cache Stats API to HLRC (#71920)
In #71701 we added a new REST API that provides statistics 
about the searchable snapshots cache on Frozen Tier.

This commit adds the necessary plumbing to expose this API 
in the High Level REST Client. It also exposes the documentation 
of the Mount Snapshot API that was created in #68949 but not 
made accessible.

Backport of #71858
2021-04-20 14:55:34 +02:00
James Rodewig
71daaba676
[DOCS] Fix create enrich policy API title (#71494) (#71496) 2021-04-08 15:50:54 -04:00
James Rodewig
c757f9e4e7
[DOCS] Fix double spaces (#71082) (#71120) 2021-03-31 11:43:34 -04:00
James Rodewig
f19f44df97
[DOCS] Fix typos in HLRC delete stored script API (#70897) (#70924)
Co-authored-by: Ivan Brusic <ivan@brusic.com>
2021-03-26 12:28:56 -04:00
István Zoltán Szabó
591e93397a
[DOCS] Removes beta labels from DFA related docs. (#70808) (#70902) 2021-03-26 10:25:36 +01:00
Tanguy Leroux
03f736fed6
Add documentation for Clone Snapshot Java API (#70720) (#70819)
This commit adds some missing documentation about 
the Clone Snapshot Java API.

Relates #63863
2021-03-24 16:25:30 +01:00
James Rodewig
302341a526
[DOCS] Replace put with create or update in API names (#70330) (#70421)
Co-authored-by: debadair <debadair@elastic.co>
Co-authored-by: Lisa Cawley <lcawley@elastic.co>
2021-03-15 17:16:13 -04:00
James Rodewig
45d93d9458
[DOCS] Remove added admons (#69452) (#69454) 2021-02-23 10:48:36 -05:00
Lisa Cawley
ebea578071
[DOCS] Add grant API key to Java HLRC (#63783) (#69375)
Co-authored-by: Tim Vernum <tim@adjective.org>
2021-02-22 12:58:33 -08:00
Benjamin Trent
12e2cc8176
[7.x] [ML][HLRC] adds put and delete trained model alias APIs to rest high-level client (#69214) (#69297)
* [ML][HLRC] adds put and delete trained model alias APIs to rest high-level client (#69214)

adds put (and reassign) and delete trained model alias APIs to the rest high-level client.

This adds some serialization objects and request wrappers.
2021-02-22 07:36:34 -05:00
Dimitris Athanasiou
98c69cedce
[7.x][ML] Add runtime mappings to data frame analytics source config … (#69284)
Users can now specify runtime mappings as part of the source config
of a data frame analytics job. Those runtime mappings become part of
the mapping of the destination index. This ensures the fields are
accessible in the destination index even if the relevant data frame
analytics job gets deleted.

Closes #65056

Backport of #69183
2021-02-19 20:17:06 +02:00
Joe Gallo
53bcef0365
Rename max_single_primary_size to max_primary_shard_size (#69239) (#69248) 2021-02-18 22:21:47 -05:00
James Rodewig
7f7dc9dec3
[DOCS] Fix capitalization for Query DSL (#69236) (#69240) 2021-02-18 19:40:36 -05:00
Lisa Cawley
593de53698
Adds retention policy to HLRC put transform API (#68768) (#69146)
Co-authored-by: Hendrik Muhs <hendrik.muhs@elastic.co>
2021-02-17 13:23:26 -08:00
James Rodewig
b55249507e
[DOCS] Fix typos for duplicate words (#69125) (#69132) 2021-02-17 11:16:58 -05:00
Jason Tedor
1b0b5f7f02
Add Searchable Snapshots APIs to HLRC (#68949)
This commit adds the Searchable Snapshots APIs (the mount snapshot API)
to the high-level REST client.
2021-02-12 07:54:35 -05:00
Joe Gallo
5b899dafcd
Add max_single_primary_size as a condition for the rollover index API (#67842) (#68489) 2021-02-04 15:30:10 -05:00
Joe Gallo
4fed1a59da
Support max_single_primary_size in Resize Action and exposed in ILM (#67705) (#68321)
Co-authored-by: bellengao <gbl_long@163.com>
2021-02-01 15:06:50 -05:00
Valeriy Khakhutskyy
4bbd31a268
[7.x][ML] Add early stopping DFA configuration parameter (#68271)
The PR adds early_stopping_enabled optional data frame analysis configuration parameter. The enhancement was already described in elastic/ml-cpp#1676 and so I mark it here as non-issue.

Backport of #68099.
2021-02-01 14:11:06 +01:00
Lisa Cawley
a57db419f4
[DOCS] Remove experimental tag from find structure API (#68153) (#68156) 2021-01-28 13:26:46 -08:00
Dimitris Athanasiou
9e55623c29
[7.x][ML] Expand regression/classification hyperparameters (#67950) (#67983)
Expands data frame analytics regression and classification
analyses with the followin hyperparameters:

- alpha
- downsample_factor
- eta_growth_rate_per_tree
- max_optimization_rounds_per_hyperparameter
- soft_tree_depth_limit
- soft_tree_depth_tolerance

Backport of #67950
2021-01-26 15:48:13 +02:00
Benjamin Trent
a324055310
[7.x] [ML] move find file structure finder in Rest high Level client to its new endpoint and plugin (#67290) (#67510)
* [ML] move find file structure finder in Rest high Level client to its new endpoint and plugin (#67290)

Find file structure finder is now its own plugin, and separated from the ml plugin.

This commit updates the rest high level client to reflect this.

Additionally, this adjusts the internal and client object names from `FileStructure` to the more general `TextStructure`
2021-01-14 09:59:34 -05:00
Yang Wang
f0715f9a4b
Deprecate the id field for the InvalidateApiKey API (#66317) (#66670)
This PR deprecates the usage of the id field in the payload for the
InvalidateApiKey API. The ids field introduced in #63224 is now the recommended
way for performing (bulk) API key invalidation.

This PR also includes the test fix from #66696
2020-12-22 12:56:40 +11:00
David Kyle
5fec2538ca
[ML] Docs and HRLC for datafeed runtime mappings (#65810) (#66007)
For the changes in #65606
2020-12-08 11:04:21 +00:00
Benjamin Trent
39f5f39dc2
[7.x] [ML] add new snapshot upgrader API for upgrading older snapshots (#64665) (#65010)
* [ML] add new snapshot upgrader API for upgrading older snapshots (#64665)

This new API provides a way for users to upgrade their own anomaly job
model snapshots.

To upgrade a snapshot the following is done:
- Open a native process given the job id and the desired snapshot id
- load the snapshot to the process
- write the snapshot again from the native task (now updated via the
  native process)

relates #64154
2020-11-17 11:30:47 -05:00
James Rodewig
944568e3dd
[DOCS] Fix "the the" typos (#64344) (#64353)
Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com>
2020-10-29 11:50:02 -04:00
Jake Landis
86834e6b92
[7.x] Update getting-started.asciidoc for Java version (#63106) (#64084)
Update client documentation to state "at least" Java 1.8

Co-authored-by: junmuz <mjunaidmuzammil@gmail.com>
2020-10-27 11:52:02 -05:00
István Zoltán Szabó
b822e582c3
[DOCS] Changes experimental flag to beta in DFA related docs (#63992) (#64176) 2020-10-26 18:04:21 +01:00
Benjamin Trent
b9dc522cb4
[7.x] [ML] adding new flag exclude_generated that removes generated fields in GET config APIs (#63899)(#63092) (#63177)
* [ML] adding for_export flag for ml plugin GET resource APIs (#63092)

This adds the new `for_export` flag to the following APIs:

- GET _ml/anomaly_detection/<job_id>
- GET _ml/datafeeds/<datafeed_id>
- GET _ml/data_frame/analytics/<analytics_id>

The flag is designed for cloning or exporting configuration objects to later be put into the same cluster or a separate cluster.

The following fields are not returned in the objects:

- any field that is not user settable (e.g. version, create_time)
- any field that is a calculated default value (e.g. datafeed chunking_config)
- any field that would effectively require changing to be of use (e.g. datafeed job_id)
- any field that is automatically set via another Elastic stack process (e.g. anomaly job custom_settings.created_by)

closes https://github.com/elastic/elasticsearch/issues/63055

* [ML] adding new flag exclude_generated that removes generated fields in GET config APIs (#63899)

When exporting and cloning ml configurations in a cluster it can be
frustrating to remove all the fields that were generated by
the plugin. Especially as the number of these fields change
from version to version.

This flag, exclude_generated, allows the GET config APIs to return
configurations with these generated fields removed.

APIs supporting this flag:
- GET _ml/anomaly_detection/<job_id>
- GET _ml/datafeeds/<datafeed_id>
- GET _ml/data_frame/analytics/<analytics_id>

The following fields are not returned in the objects:

- any field that is not user settable (e.g. version, create_time)
- any field that is a calculated default value (e.g. datafeed chunking_config)
- any field that is automatically set via another Elastic stack process (e.g. anomaly job custom_settings.created_by)

relates to #63055
2020-10-20 12:42:52 -04:00
Benjamin Trent
b92cbcd41a
[Transform] add new exclude_generated flag to GET transform (#63093) (#63947)
This adds a new flag `exclude_generated` for GET transform API.

This flag is useful for when a transform needs to be cloned within a cluster or exported/imported between clusters.

It removes certain fields that are not able to be set via the PUT api (e.g. version, create_time).

relates https://github.com/elastic/elasticsearch/issues/63055
2020-10-20 12:38:41 -04:00
Lyudmila Fokina
e518bd76e7
Adding authentication information to access token create APIs (#62490) (#63841)
* Adding authentication information to access token create APIs (#62490)

* Adding authentication information to access token create APIs

Adding authentication object to following APIs:
/_security/oauth2/token
/_security/delegate_pki
/_security/saml/authenticate
/_security/oidc/authenticate

Resolves: #59685
(cherry picked from commit 51dbd9e584)

* Addressing PR commends, fixing tests

* Returning tokenGroups attribute as SID string instead of byte array (AD metadata)

Addressing PR comments

* Returning tokenGroups attribute as SID string instead of byte array (AD metadata)

Update version check

* Returning tokenGroups attribute as SID string instead of byte array (AD metadata)

Update version check

* Addressing more PR comments

* Adding more to integration tests + some small fixes

* Nit fixes and formatting following #62490 comments (#63797)

* Nit fixes and formatting following #62490 comments

Resolves: #63792

* Nit fixes and formatting following #62490 comments

Resolves: #63792

* Nit fixes and formatting following #62490 comments
Fixing username

* Nit fixes and formatting following #62490 comments
Fixing formatting

* Fixing merge conflicts

* Fixing merge conflicts
2020-10-16 20:50:03 +02:00
Przemysław Witek
bb7df2eb5f
[ML] Allow setting num_top_classes to a special value -1 (#63587) (#63601) 2020-10-13 14:00:12 +02:00
Przemysław Witek
a97bd5b787
[7.x] [ML] Validate that AucRoc has the data necessary to be calculated (#63302) (#63453) 2020-10-08 09:31:45 +02:00
Lisa Cawley
8f76c89cd3
[7.x][DOCS] Add feature_importance_baseline to get trained model API (#63279) (#63336)
Co-authored-by: Benjamin Trent <ben.w.trent@gmail.com>
2020-10-06 10:08:34 -07:00
Yang Wang
7969fbb4ab
Cache API key doc to reduce traffic to the security index (#59376) (#63319)
Getting the API key document form the security index is the most time consuing part
of the API Key authentication flow (>60% if index is local and >90% if index is remote).
This traffic is now avoided by caching added with this PR.

Additionally, we add a cache invalidator registry so that clearing of different caches will
be managed in a single place (requires follow-up PRs).
2020-10-06 23:49:23 +11:00
Lisa Cawley
22aea11016 [DOCS] Add experimental tag to rollup APIs (#63206) 2020-10-05 13:22:11 -07:00
Lisa Cawley
ce23c38e96
[DOCS] Add find file structure API to HLRC docs (#63212) (#63261) 2020-10-05 11:37:44 -07:00
Lisa Cawley
4de6104dae
[DOCS] Fix titles for ML APIs (#63152) (#63207) 2020-10-02 14:01:01 -07:00
Lisa Cawley
57ea5d27ae [DOCS] Add experimental tag to data frame analytics APIs (#63153) 2020-10-02 09:44:40 -07:00
Benjamin Trent
cfcf973259
[7.x] [ML] renames */inference* apis to */trained_models* (#63097) (#63136)
* [ML] renames */inference* apis to */trained_models* (#63097)

This commit renames all `inference` CRUD APIs to `trained_models`.

This aligns with internal terminology, documentation, and use-cases.
2020-10-02 07:34:28 -04:00
Przemysław Witek
d677a2b8ee
[7.x] [ML] Implement AucRoc metric for classification - HLRC (#62304) (#63058) 2020-09-30 14:04:10 +02:00
Benjamin Trent
e163559e4c
[7.x] [ML] Add new include flag to GET inference/<model_id> API for model training metadata (#61922) (#62620)
* [ML] Add new include flag to GET inference/<model_id> API for model training metadata (#61922)

Adds new flag include to the get trained models API
The flag initially has two valid values: definition, total_feature_importance.
Consequently, the old include_model_definition flag is now deprecated.
When total_feature_importance is included, the total_feature_importance field is included in the model metadata object.
Including definition is the same as previously setting include_model_definition=true.

* fixing test

* Update x-pack/plugin/core/src/test/java/org/elasticsearch/xpack/core/ml/action/GetTrainedModelsRequestTests.java
2020-09-18 10:07:35 -04:00
Lisa Cawley
bc5eec8205
[DOCS] Fix capitalization in HLRC ML APIs (#62010) (#62012) 2020-09-04 16:57:15 -07:00
Benjamin Trent
1ae2923632
[7.x] [ML] adding docs + hlrc for data frame analysis feature_processors (#61149) (#61493)
* [ML] adding docs + hlrc for data frame analysis feature_processors (#61149)

Adds HLRC and some docs for the new feature_processors field in Data frame analytics.

Co-authored-by: Przemysław Witek <przemyslaw.witek@elastic.co>
Co-authored-by: Lisa Cawley <lcawley@elastic.co>
2020-08-24 12:56:21 -04:00