elasticsearch

mirror of https://github.com/elastic/elasticsearch.git synced 2025-06-28 17:34:17 -04:00

Author	SHA1	Message	Date
Liam Thompson	33a71e3289	[DOCS] Refactor book-scoped variables in `docs/reference/index.asciidoc` (#107413 ) * Remove `es-test-dir` book-scoped variable * Remove `plugins-examples-dir` book-scoped variable * Remove `:dependencies-dir:` and `:xes-repo-dir:` book-scoped variables - In `index.asciidoc`, two variables (`:dependencies-dir:` and `:xes-repo-dir:`) were removed. - In `sql/index.asciidoc`, the `:sql-tests:` path was updated to fuller path - In `esql/index.asciidoc`, the `:esql-tests:` path was updated idem * Replace `es-repo-dir` with `es-ref-dir` * Move `:include-xpack: true` to few files that use it, remove from index.asciidoc	2024-04-17 14:37:07 +02:00
James Rodewig	255c9a7f95	[DOCS] Move x-pack docs to `docs/reference` dir (#99209 ) Problem: For historical reasons, source files for the Elasticsearch Guide's security, watcher, and Logstash API docs are housed in the `x-pack/docs` directory. This can confuse new contributors who expect Elasticsearch Guide docs to be located in `docs/reference`. Solution: - Move the security, watcher, and Logstash API doc source files to the `docs/reference` directory - Update doc snippet tests to use security Rel: https://github.com/elastic/platform-docs-team/issues/208	2023-09-12 14:53:41 -04:00
David Roberts	708730e27c	[ML] Add _meta field to data frame analytics config (#94529 ) This PR adds a new field, `_meta`, to the data frame analytics configuration. The `_meta` field stores an arbitrary key-value map. Keys are strings. Values are arbitrary objects (possibly also maps). The `_meta` field can be updated using the data frame analytics `_update` endpoint.	2023-03-20 11:53:53 +00:00
Dimitris Athanasiou	b5504ea701	[ML] Lift limit of max number of classes for classification to 100 (#89755 ) Limit was previously set to `30`. After the improvements in elastic/ml-cpp#2395 we now raist the limit to `100`.	2022-09-01 10:47:58 +03:00
Dimitris Athanasiou	3f9334012f	[ML] Fix version substitution in put DFA docs (#88862 ) This fixes the version substitution in a couple of response examples in the put DFA docs.	2022-07-28 01:37:30 +09:30
Lisa Cawley	7e214fc51b	[DOCS] Add authorization info to create, get, and update DFA jobs APIs (#88098 )	2022-06-30 08:41:04 -07:00
István Zoltán Szabó	7f556ece75	[DOCS] Adds size param to evaluate DFA API docs (#85735 )	2022-04-07 10:03:09 +02:00
Lisa Cawley	429bdd9afc	[DOCS] Move trained model APIs out of dataframe analytics (#81315 )	2021-12-03 09:21:09 -08:00
David Kyle	aba14aacfa	[ML][DOCS] Add zero shot example and setting truncation at inference (#81003 ) More examples for the _infer endpoint	2021-12-01 11:44:04 +00:00
Lisa Cawley	8da1236bca	[DOCS] Clarify impact of force stop trained model deployment (#81026 )	2021-11-25 09:08:46 -08:00
Lisa Cawley	d1af86cfdd	[DOCS] Fixes start and stop trained model deployment APIs (#80978 )	2021-11-24 10:09:45 -08:00
Dimitris Athanasiou	c7f745b40a	[ML] Force delete trained models (#80595 ) Adds a `force` parameter to the delete trained models API which when set to `true` allows deletion of a model that is referenced by ingest pipelines or has a started deployment. Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com>	2021-11-11 10:54:01 +02:00
Benjamin Trent	5627dc66e1	[ML] deprecate estimated_heap_memory_usage_bytes and replace with model_size_bytes (#80554 ) This deprecates estimated_heap_memory_usage_bytes on model put and replaces it with model_size_bytes. On GET, only model_size_bytes is returned unless v7 rest-api compatibility is requested. For the ml/info API, only model_size_bytes is returned A forward-port of: #80545	2021-11-10 10:23:25 -05:00
Benjamin Trent	cf5f521fac	[ML] add deployment_stats to trained model stats (#80531 ) This commit adds a new field deployment_stats that is optionally set for models that are deployed. If a model does not have a deployment, it will be null. Also, removes the get deployment stats API and makes the deployment stats action internal only.	2021-11-09 16:09:47 -05:00
Benjamin Trent	c3c3f88000	[ML] validate model definition on start deployment (#80439 ) When a deployment is started, we do not validate that the definition documents are all present and not truncated. This commit adds a validation on _start that prevents a bad state from occurring where the deployment starts, but the model is incorrectly defined, or some unknown error occurs to late in the deployment process.	2021-11-09 10:33:55 -05:00
Dimitris Athanasiou	afe58ba6d8	[ML] Force stop deployment in use (#80431 ) Implements a `force` parameter to the stop deployment API. This allows a user to forcefully stop a deployment. Currently, this specifically allows stopping a deployment that is in use by ingest processors. Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com>	2021-11-08 14:35:52 +02:00
James Rodewig	f56a0f4b66	[DOCS] Remove `testenv` annotations from doc snippet tests (#80023 ) Removes `testenv` annotations and related code. These annotations originally let you skip x-pack snippet tests in the docs. However, that's no longer possible. Relates to #79309, #31619	2021-11-05 18:38:50 -04:00
Lisa Cawley	638fe2c26a	[DOCS] Fixes typo in start trained models API (#80368 )	2021-11-04 14:23:03 -07:00
Dimitris Athanasiou	d13baade69	[ML] Report start_time for trained model deployments and allocations (#80188 ) Adds `start_time` to the get deployment stats API for the deployment and each allocation. Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com>	2021-11-02 17:12:46 +02:00
David Kyle	58a517309a	[ML] [DOCS] Update the model part upload URL in example (#80181 )	2021-11-02 11:33:04 +00:00
Benjamin Trent	8887cfa080	[ML] updating the infer trained model deployment docs (#80083 ) the infer endpoint has changed its format. Also, the results format for the various tasks have changed. This updates the docs to match what is currently in 8.0.0.	2021-10-29 13:07:23 -04:00
Benjamin Trent	f9bf4e57b9	[ML] adds new params to the start trained model deployment docs (#80016 )	2021-10-28 11:23:25 -04:00
Benjamin Trent	d2b638356b	[ML] Update trained model docs for truncate parameter for bert tokenization (#79652 )	2021-10-28 07:19:10 -04:00
David Roberts	6b20e8e1b0	[ML] Fixing doc test substitution bug (#79943 ) The substitutions should not have a space after the field name. Fixes #79931	2021-10-27 19:45:15 +01:00
Mark Vieira	8f79cfacab	Mute documentation test	2021-10-27 09:48:20 -07:00
Lisa Cawley	610043f100	[DOCS] Edits formatting in create trained models API (#79758 ) Related to #78376 This PR fixes minor formatting issues in the create trained models API documentation	2021-10-27 07:41:11 -04:00
István Zoltán Szabó	c879db98b1	[DOCS] Updates get trained models API docs (#79372 ) * [DOCS] Updates get trained models API docs. * [DOCS] Reviews get trained models related definitions in ml-shared.	2021-10-25 11:47:45 +02:00
Benjamin Trent	498e6e3d0f	[ML] adding docs for estimated heap and operations (#78376 ) Add docs for optionally supplying memory and operation estimates in put model	2021-09-29 09:11:42 -04:00
Benjamin Trent	b96d929af3	[ML] add documentation for get deployment stats API (#78412 ) * [ML] add documentation for get deployment stats API * Apply suggestions from code review Co-authored-by: István Zoltán Szabó <istvan.szabo@elastic.co> Co-authored-by: István Zoltán Szabó <istvan.szabo@elastic.co>	2021-09-29 07:20:25 -04:00
Benjamin Trent	408489310c	[ML] add zero_shot_classification task for BERT nlp models (#77799 ) Zero-Shot classification allows for text classification tasks without a pre-trained collection of target labels. This is achieved through models trained on the Multi-Genre Natural Language Inference (MNLI) dataset. This dataset pairs text sequences with "entailment" clauses. An example could be: "Throughout all of history, man kind has shown itself resourceful, yet astoundingly short-sighted" could have been paired with the entailment clauses: ["This example is history", "This example is sociology"...]. This training set combined with the attention and semantic knowledge in modern day NLP models (BERT, BART, etc.) affords a powerful tool for ad-hoc text classification. See https://arxiv.org/abs/1909.00161 for a deeper explanation of the MNLI training and how zero-shot works. The zeroshot classification task is configured as follows: ```js { // <snip> model configuration </snip> "inference_config" : { "zero_shot_classification": { "classification_labels": ["entailment", "neutral", "contradiction"], // <1> "labels": ["sad", "glad", "mad", "rad"], // <2> "multi_label": false, // <3> "hypothesis_template": "This example is {}.", // <4> "tokenization": { /<snip> tokenization configuration </snip>/} } } } ``` * <1> For all zero_shot models, there returns 3 particular labels when classification the target sequence. "entailment" is the positive case, "neutral" the case where the sequence isn't positive or negative, and "contradiction" is the negative case * <2> This is an optional parameter for the default zero_shot labels to attempt to classify * <3> When returning the probabilities, should the results assume there is only one true label or multiple true labels * <4> The hypothesis template when tokenizing the labels. When combining with `sad` the sequence looks like `This example is sad.` For inference in a pipeline one may provide label updates: ```js { //<snip> pipeline definition </snip> "processors": [ //<snip> other processors </snip> { "inference": { // <snip> general configuration </snip> "inference_config": { "zero_shot_classification": { "labels": ["humanities", "science", "mathematics", "technology"], // <1> "multi_label": true // <2> } } } } //<snip> other processors </snip> ] } ``` * <1> The `labels` we care about, these replace the default ones if they exist. * <2> Should the results allow multiple true labels Similarly one may provide label changes against the `_infer` endpoint ```js { "docs":[{ "text_field": "This is a very happy person"}], "inference_config":{"zero_shot_classification":{"labels": ["glad", "sad", "bad", "rad"], "multi_label": false}} } ```	2021-09-28 09:38:23 -04:00
Benjamin Trent	00defa38a9	[ML] adding some initial document for our pytorch NLP model support (#78270 ) Adding docs for: put vocab put model definition part start deployment all the new NLP configuration objects for trained model configurations	2021-09-27 12:46:13 -04:00
Lisa Cawley	b5a32678e7	[DOCS] Fixes admonition formatting (#77393 )	2021-09-08 11:20:43 -07:00
Benjamin Trent	a68c6acdb3	[ML] adding new PUT trained model vocabulary endpoint (#77387 ) This commit removes the ability to set the vocabulary location in the model config. This opts instead for sane defaults to be set and used. Wrapping this up in an API. The index is now always the internally managed .ml-inference-native index and the document ID is always <model_id>_vocabulary This API only works for pytorch/nlp type models.	2021-09-08 10:21:45 -04:00
Benjamin Trent	708491d0d3	[ML] add allocation state reason and support for partial model allocations (#76925 ) Previously, if a model failed to be allocated on any node, the deployment failed. This commit allows for an allocation to be partially_started and indicates its current state via a new state value in the deployment stats API. Additionally, when starting a deployment, the user may specify to wait_for starting, partially_started, started and the API will block (as long as timeout doesn't expire) until that state is reached.	2021-09-07 15:23:13 -04:00
Benjamin Trent	de49ff22a4	[ML] creating new PUT model definition part API (#76987 ) This commit simplifies the interactions for uploading chunked model definitions and model vocabulary.	2021-09-07 08:22:52 -04:00
Benjamin Trent	02e17c3442	[ML] adding new defer_definition_decompression parameter to put trained model API (#77189 ) This new parameter is a boolean parameter that allows users to put in a compressed model without it having to be inflated on the master node during the put request This is useful for system/module set up and then later having the model validated and fully parsed when it is being loaded on a node for usage	2021-09-03 09:07:54 -04:00
István Zoltán Szabó	cdec5228e8	[DOCS] Fixes line breaks. (#77248 )	2021-09-03 14:40:43 +02:00
István Zoltán Szabó	70a012b0c7	[DOCS] Fixes section IDs in start/stop trained model deployment APIs. (#77247 )	2021-09-03 14:24:37 +02:00
Benjamin Trent	0e1efa6533	[ML] generalize pytorch sentiment analysis to text classification (#77084 ) * [ML] generalize pytorch sentiment analysis to text classification * Update x-pack/plugin/core/src/main/java/org/elasticsearch/xpack/core/ml/inference/trainedmodel/TextClassificationConfig.java	2021-09-01 08:45:13 -04:00
István Zoltán Szabó	8aed99fc02	[DOCS] Adds links that point to loss function to ML API docs. (#76438 )	2021-08-23 13:09:37 +02:00
István Zoltán Szabó	9b0417f2df	[DOCS] Comments out links that points to regression loss functions (#76435 ) * [DOCS] Comments out links that points to regression loss functions. * Update docs/reference/ml/df-analytics/apis/get-trained-models.asciidoc	2021-08-12 18:33:42 +02:00
István Zoltán Szabó	ce537a33b6	[DOCS] Adds link that points to outlier detection example to GET DFA stats API docs. (#75689 )	2021-08-02 18:10:03 +02:00
István Zoltán Szabó	8d4fb3aa84	[DOCS] Changes link to outlier detection docs in PUTDFA API docs. (#75933 )	2021-08-02 13:45:37 +02:00
Lisa Cawley	02d851e50e	[DOCS] Drafts trained model deployment APIs (#75497 )	2021-07-26 09:49:37 -07:00
István Zoltán Szabó	7e7a386078	[DOCS] Comments out link that points to outlier detection example (#75687 )	2021-07-26 16:36:57 +02:00
István Zoltán Szabó	6a4de77e11	[DOCS] Adds classification and regression links back to DFA docs. (#74930 )	2021-07-08 16:37:16 +02:00
István Zoltán Szabó	841cfb9214	[DOCS] Adds outlier detection links to DFA API docs (#74748 )	2021-07-06 15:10:41 +02:00
István Zoltán Szabó	483d145f78	[DOCS] Fixes an attribute in PUT DFA API docs. (#74931 )	2021-07-05 17:08:11 +02:00
István Zoltán Szabó	6c6e6874ff	[DOCS] Removes link to classification and regression. (#74926 )	2021-07-05 16:28:14 +02:00
István Zoltán Szabó	a4f9f4fae1	[DOCS] Comments out links to outlier detection. (#74745 )	2021-06-30 14:24:34 +02:00

1 2 3 4

194 commits