elasticsearch

mirror of https://github.com/elastic/elasticsearch.git synced 2025-06-29 01:44:36 -04:00

Author	SHA1	Message	Date
bumjin	c226293af3	[DOCS] Fix typo in Java HLRC docs (#60863 )	2020-08-18 08:39:34 -04:00
James Rodewig	a94e5cb7c4	[DOCS] Replace Wikipedia links with attribute (#61171 )	2020-08-17 09:44:24 -04:00
James Rodewig	9b9e0b7b16	[DOCS] Remove search request body page (#60972 )	2020-08-11 12:05:54 -04:00
Mark Tozzi	fbc8950af5	[DOCS] Update snapshot repo usage (#60791 ) Clarify how to use our snapshot repository. Several folks were confused about this just now, including myself.	2020-08-06 12:16:01 -04:00
Alexander Reelsen	c7ac9e7073	[DOCS] http -> https, remove outdated plugin docs (#60380 ) Plugin discovery documentation contained information about installing Elasticsearch 2.0 and installing an oracle JDK, both of which is no longer valid. While noticing that the instructions used cleartext HTTP to install packages, this commit replaces HTTPs links instead of HTTP where possible. In addition a few community links have been removed, as they do not seem to exist anymore.	2020-07-31 15:58:38 -04:00
lcawl	b926bf006f	[DOCS] Minor typo fix	2020-07-22 08:39:42 -07:00
Przemysław Witek	2a12dcf2e0	Rename binary_soft_classification evaluation to outlier_detection (#59951 )	2020-07-21 14:27:57 +02:00
David Kyle	2c879eb618	Fix broken links to aggregation javadoc (#59083 ) Fixes links from the Java High Level Rest Client to the aggregations java docs	2020-07-09 16:52:29 +01:00
Dimitris Athanasiou	da0249f6c2	[ML] Data frame analytics max_num_threads setting (#59254 ) This adds a setting to data frame analytics jobs called `max_number_threads`. The setting expects a positive integer. When used the user specifies the max number of threads that may be used by the analysis. Note that the actual number of threads used is limited by the number of processors on the node where the job is assigned. Also, the process may use a couple more threads for operational functionality that is not the analysis itself. This setting may also be updated for a stopped job. More threads may reduce the time it takes to complete the job at the cost of using more CPU.	2020-07-09 16:31:26 +03:00
David Kyle	49f9431fe7	Add Inference Pipeline aggregation to HLRC (#59086 ) Adds InferencePipelineAggregationBuilder to the HLRC duplicating the server side classes	2020-07-08 21:17:01 +01:00
Przemysław Witek	38aa474dec	Implement pseudo Huber loss (PseudoHuber) evaluation metric for regression analysis (#58734 )	2020-07-01 13:29:56 +02:00
Przemysław Witek	dfa06240fc	Implement MSLE (MeanSquaredLogarithmicError) evaluation metric for regression analysis (#58684 )	2020-06-30 13:06:15 +02:00
weizijun	974f6e66b6	LLRC RequestOptions add RequestConfig (#57972 ) Different kinds of requests may need different request options from the client default. Users can optionally set RequestConfig on a single request's RequestOptions to override the default. Without this, socketTimeout can only set at RestClient initialization.	2020-06-29 11:19:04 -04:00
Przemysław Witek	3953de4c98	Introduce DataFrameAnalyticsConfig update API (#58302 )	2020-06-29 09:26:31 +02:00
Yang Wang	38185e5da0	Add cache for application privileges (#55836 ) Add caching support for application privileges to reduce number of round-trips to security index when building application privilege descriptors. Privilege retrieving in NativePrivilegeStore is changed to always fetching all privilege documents for a given application. The caching is applied to all places including "get privilege", "has privileges" APIs and CompositeRolesStore (for authentication).	2020-06-29 13:59:00 +10:00
Jason Tedor	2e0274266f	Allow follower indices to override leader settings (#58103 ) Today when creating a follower index via the put follow API, or via an auto-follow pattern, it is not possible to specify settings overrides for the follower index. Instead, we copy all of the leader index settings to the follower. Yet, there are cases where a user would want some different settings on the follower index such as the number of replicas, or allocation settings. This commit addresses this by allowing the user to specify settings overrides when creating follower index via manual put follower calls, or via auto-follow patterns. Note that not all settings can be overrode (e.g., index.number_of_shards) so we also have detection that prevents attempting to override settings that must be equal between the leader and follow index. Note that we do not even allow specifying such settings in the overrides, even if they are specified to be equal between the leader and the follower index. Instead, the must be implicitly copied from the leader index, not explicitly set by the user.	2020-06-18 10:55:17 -04:00
David Kyle	bc1883b582	HLRC for delete expired data by job Id (#57722 ) High level rest client changes for #57337	2020-06-11 10:07:31 +01:00
Dimitris Athanasiou	e116ac850f	[ML] Fix race condition when force stopping DF analytics job (#57680 ) When we force delete a DF analytics job, we currently first force stop it and then we proceed with deleting the job config. This may result in logging errors if the job config is deleted before it is retrieved while the job is starting. Instead of force stopping the job, it would make more sense to try to stop the job gracefully first. So we now try that out first. If normal stop fails, then we resort to force stopping the job to ensure we can go through with the delete. In addition, this commit introduces `timeout` for the delete action and makes use of it in the child requests.	2020-06-05 12:13:02 +03:00
Lisa Cawley	8b9293b3bf	[DOCS] Replace docdir attribute with es-repo-dir (#57489 )	2020-06-01 15:55:05 -07:00
Benjamin Trent	251b17009a	[ML] adds new for_export flag to GET _ml/inference API (#57351 ) Adds a new boolean flag, `for_export` to the `GET _ml/inference/<model_id>` API. This flag is useful for moving models between clusters.	2020-05-29 12:29:28 -04:00
Benjamin Trent	ec67787a2e	[ML] add max_model_memory parameter to forecast request (#57254 ) This adds a max_model_memory setting to forecast requests. This setting can take a string value that is formatted according to byte sizes (i.e. "50mb", "150mb"). The default value is `20mb`. There is a HARD limit at `500mb` which will throw an error if used. If the limit is larger than 40% the anomaly job's configured model limit, the forecast limit is reduced to be strictly lower than that value. This reduction is logged and audited. related native change: https://github.com/elastic/ml-cpp/pull/1238 closes: https://github.com/elastic/elasticsearch/issues/56420	2020-05-29 08:59:50 -04:00
Lee Hinman	4dc32611fc	Rename template V2 classes to ComposableTemplate (#57183 ) This PR changes the name of the Index Template V2 classes to "Composable Templates", it also ensures there are no mentions of "V2" in the documentation or error/warning messages. V1 templates are referred to as "legacy" templates. Resolves #56609	2020-05-27 09:32:10 -06:00
Samidh	199ecbbf77	[Docs] Fix typo in start-watch-service.asciidoc (#57182 )	2020-05-27 15:51:07 +02:00
Benjamin Trent	8fed077b0a	[ML] relax throttling on expired data cleanup (#56711 ) Throttling nightly cleanup as much as we do has been over cautious. Night cleanup should be more lenient in its throttling. We still keep the same batch size, but now the requests per second scale with the number of data nodes. If we have more than 5 data nodes, we don't throttle at all. Additionally, the API now has `requests_per_second` and `timeout` set. So users calling the API directly can set the throttling. This commit also adds a new setting `xpack.ml.nightly_maintenance_requests_per_second`. This will allow users to adjust throttling of the nightly maintenance.	2020-05-18 07:21:06 -04:00
Kamyar Ghajar	e9f1896e3d	[Docs] Update multi-search.asciidoc (#56284 ) The documentation shows the wrong command for a multi-search async call.	2020-05-06 16:56:58 +02:00
Andrei Dan	9bcf89b1e2	HLRC: document index template v2 and component template APIs (#56136 ) This documents the index template v2 and component template APIs in the high level rest client.	2020-05-05 16:43:16 +01:00
Dimitris Athanasiou	6bf3834059	[ML] Add loss_function to regression (#56118 ) Adds parameters `loss_function` and `loss_function_parameter` to regression.	2020-05-05 12:36:05 +03:00
Hendrik Muhs	33f134f556	[Transform] add throttling (#56007 ) add throttling to transform, throttling will slow down search requests by delaying the execution based on a documents per second metric. fixes #54862	2020-05-05 11:34:54 +02:00
David Roberts	8906e76079	[ML] Return assigned node in start/open job/datafeed response (#55473 ) Adds a "node" field to the response from the following endpoints: 1. Open anomaly detection job 2. Start datafeed 3. Start data frame analytics job If the job or datafeed is assigned to a node immediately then this field will return the ID of that node. In the case where a job or datafeed is opened or started lazily the node field will contain an empty string. Clients that want to test whether a job or datafeed was opened or started lazily can therefore check for this. Fixes #54067	2020-04-22 08:44:57 +01:00
Yang Wang	92427d3758	Remove local parameter for get field mapping API (#55100 ) The local parameter of get field mapping API is marked as deprecated in 7.x. This PR removes it for v8.0	2020-04-15 12:02:10 +10:00
Yang Wang	6ce88038f2	Deprecate local parameter for get field mapping request (#55014 ) The usage of local parameter for GetFieldMappingRequest has been removed from the underlying transport action since v2.0. This PR deprecates the parameter from rest layer. It will be removed in next major version.	2020-04-12 12:34:44 +10:00
Benjamin Trent	4e1ff31c3c	[ML] add new inference_config field to trained model config (#54421 ) A new field called `inference_config` is now added to the trained model config object. This new field allows for default inference settings from analytics or some external model builder. The inference processor can still override whatever is set as the default in the trained model config.	2020-04-02 10:34:17 -04:00
Nhat Nguyen	ee3d40320a	Broadcast cancellation to only nodes have outstanding child tasks (#54312 ) Today when canceling a task we broadcast ban/unban requests to all nodes in the cluster. This strategy does not scale well for hierarchical cancellation. With this change, we will track outstanding child requests and broadcast the cancellation to only nodes that have outstanding child tasks. This change also prevents a parent task from sending child requests once it got canceled. Relates #50990 Supersedes #51157 Co-authored-by: Igor Motov <igor@motovs.org> Co-authored-by: Yannick Welsch <yannick@welsch.lu>	2020-04-01 11:22:13 -04:00
Christoph Büscher	610b7bdcbf	[Docs] Add HLRC Async Search API documentation (#54353 ) Adds documentation and a corresponding test case containing typical API usage for the Async Search API to the High Level Rest Client.	2020-03-30 15:32:45 +02:00
David Roberts	8ee770560a	[ML] Add a model memory estimation endpoint for anomaly detection (#53507 ) A new endpoint for estimating anomaly detection job model memory requirements: POST _ml/anomaly_detectors/estimate_model_memory Closes #53219	2020-03-24 21:38:19 +00:00
Tom Veasey	58340c2dbe	[ML] Adds the class_assignment_objective parameter to classification (#52763 ) Adds a new parameter for classification that enables choosing whether to assign labels to maximise accuracy or to maximise the minimum class recall. Fixes #52427.	2020-03-12 18:39:29 +00:00
Maxim	b706239145	Add docs for errors in GetAlias API (#51850 ) Closes #31499	2020-02-24 17:16:05 +01:00
Benjamin Trent	1c1d45130c	[ML][Inference] don't return inflated definition when storing trained models (#52573 ) When `PUT` is called to store a trained model, it is useful to return the newly create model config. But, it is NOT useful to return the inflated definition. These definitions can be large and returning the inflated definition causes undo work on the server and client side.	2020-02-20 11:25:34 -05:00
OriGlassman	d16f92bbae	[DOCS] Fixed "SeachRequest" -> "SearchRequest" typo in HLRC docs (#52144 )	2020-02-14 13:41:51 -05:00
Nik Everett	5b2266601b	Implement top_metrics agg (#51155 ) The `top_metrics` agg is kind of like `top_hits` but it only works on doc values so it should be faster. At this point it is fairly limited in that it only supports a single, numeric sort and a single, numeric metric. And it only fetches the "very topest" document worth of metric. We plan to support returning a configurable number of top metrics, requesting more than one metric and more than one sort. And, eventually, non-numeric sorts and metrics. The trick is doing those things fairly efficiently. Co-Authored by: Zachary Tong <zach@elastic.co>	2020-02-14 07:13:52 -05:00
Nik Everett	75d83db8c7	HLRC support for string_stats (#52163 ) This adds a builder and parsed results for the `string_stats` aggregation directly to the high level rest client. Without this the HLRC can't access the `string_stats` API without the elastic licensed `analytics` module. While I'm in there this adds a few of our usual unit tests and modernizes the parsing.	2020-02-12 18:01:14 -05:00
Raidok	74a5683875	[DOCS] Fix "Asynchronous usage" title in HLRC docs (#52017 )	2020-02-07 09:41:58 -05:00
Ioannis Kakavas	308f95d9f3	Add HLRC docs for AuthN and TLS (#51355 ) This commit adds examples in our documentation for - An HLRC instance authenticating to an elasticsearch cluster using an elasticsearch token service access token or an API key - An HLRC instance connecting to an elasticsearch cluster that is setup for TLS on the HTTP layer when the CA certificate of the cluster is available either as a PEM file or a keystore - An HLRC instance connecting to an elasticsearch cluster that requires client authentication where the client key and certificate are available in a keystore Co-Authored-By: Lisa Cawley <lcawley@elastic.co>	2020-01-28 19:09:38 +02:00
Benjamin Trent	c9e285c1e6	[ML][Inference] add tags url param to GET (#51330 ) Adds a new URL parameter, `tags` to the GET _ml/inference/<model_id> endpoint. This parameter allows the list of models to be further reduced to those who contain all the provided tags.	2020-01-24 07:30:56 -05:00
Maxim	c01923043d	Add Get Source API to the HLRC (#50885 ) Relates to #47678	2020-01-23 09:32:30 +01:00
Nhat Nguyen	09b46c8646	Goodbye and thank you synced flush! (#50882 ) Synced flush was a brilliant idea. It supports instant recoveries with a quite small implementation. However, with the presence of sequence numbers and retention leases, it is no longer needed. This change removes it from 8.0. Relates #5077	2020-01-16 09:43:07 -05:00
Dimitris Athanasiou	4d2be9bd32	[ML] Add num_top_feature_importance_values param to regression and classi… (#50914 ) Adds a new parameter to regression and classification that enables computation of importance for the top most important features. The computation of the importance is based on SHAP (SHapley Additive exPlanations) method.	2020-01-14 15:01:47 +02:00
Benjamin Trent	4cecb7a5be	[ML][Inference] PUT API (#50852 ) This adds the `PUT` API for creating trained models that support our format. This includes * HLRC change for the API * API creation * Validations of model format and call	2020-01-11 16:02:56 -05:00
Dimitris Athanasiou	af0ce426cc	[ML] Implement force deleting a data frame analytics job (#50553 ) Adds a `force` parameter to the delete data frame analytics request. When `force` is `true`, the action force-stops the jobs and then proceeds to the deletion. This can be used in order to delete a non-stopped job with a single request. Closes #48124	2020-01-03 12:01:41 +02:00
Jake	427bf374cd	[DOCS] Bump copyright to 2019 for Java HLRC license (#50206 )	2019-12-30 15:38:59 -05:00

1 2 3 4 5 ...

533 commits