elasticsearch

mirror of https://github.com/elastic/elasticsearch.git synced 2025-06-29 01:44:36 -04:00

Author	SHA1	Message	Date
Liam Thompson	33a71e3289	[DOCS] Refactor book-scoped variables in `docs/reference/index.asciidoc` (#107413 ) * Remove `es-test-dir` book-scoped variable * Remove `plugins-examples-dir` book-scoped variable * Remove `:dependencies-dir:` and `:xes-repo-dir:` book-scoped variables - In `index.asciidoc`, two variables (`:dependencies-dir:` and `:xes-repo-dir:`) were removed. - In `sql/index.asciidoc`, the `:sql-tests:` path was updated to fuller path - In `esql/index.asciidoc`, the `:esql-tests:` path was updated idem * Replace `es-repo-dir` with `es-ref-dir` * Move `:include-xpack: true` to few files that use it, remove from index.asciidoc	2024-04-17 14:37:07 +02:00
István Zoltán Szabó	c34e0c0746	[DOCS] Clarifies that inference input must be single string (#101301 )	2023-10-25 17:18:05 +02:00
István Zoltán Szabó	446ac9f378	[DOCS] Updates ELSER tutorial with inference processor changes (#100420 ) Co-authored-by: Abdon Pijpelink <abdon.pijpelink@elastic.co>	2023-10-11 17:33:20 +02:00
David Kyle	6cde0df463	[ML] More checks and tests for parsing Inference processor config (#100335 ) Following on from #100205 this PR adds more tests and checks for corner cases when parsing the configuration.	2023-10-06 15:10:45 +01:00
David Kyle	b055204b43	[ML] Simplify the Inference Ingest Processor configuration (#100205 ) Adds a `input_ouput` option the removes the need for a `field_map` and/or target fields. Multiple inputs can be specified in `input_output`	2023-10-03 18:42:31 +01:00
István Zoltán Szabó	e0cc375b14	[DOCS] Adds text_expansion config to inference processor reference docs. (#99900 )	2023-09-26 12:58:19 +02:00
István Zoltán Szabó	b164555072	[DOCS] Adds deployment ID param documentation to trained model APIs (#96174 )	2023-05-17 15:56:58 +02:00
Benjamin Trent	258d2b71e2	[ML] add roberta/bart docs (#85001 ) adds roberta section to NLP tokenization documentation.	2022-03-17 12:14:57 -04:00
Benjamin Trent	45deac4c96	[ML] add windowing support for text_classification (#83989 ) This commit adds initial windowing support for text_classification tasks. Specifically, a user can now indicate a span (non-negative) indicating the tokenization windowing span when creating sub-sequences. Default value is span: -1 indicates that no windowing should take place.	2022-03-01 08:29:12 -05:00
David Kyle	1473b09415	[ML] Add NLP inference configs to the inference processor docs (#82320 )	2022-01-11 08:50:45 +00:00
Lisa Cawley	076343933f	[DOCS] Update link in inference processor (#81897 )	2021-12-17 15:49:59 -08:00
Lisa Cawley	b18f5fd2c6	[DOCS] Fixes link to language identification example (#81347 )	2021-12-03 17:21:04 -08:00
James Rodewig	f56a0f4b66	[DOCS] Remove `testenv` annotations from doc snippet tests (#80023 ) Removes `testenv` annotations and related code. These annotations originally let you skip x-pack snippet tests in the docs. However, that's no longer possible. Relates to #79309, #31619	2021-11-05 18:38:50 -04:00
István Zoltán Szabó	6a4de77e11	[DOCS] Adds classification and regression links back to DFA docs. (#74930 )	2021-07-08 16:37:16 +02:00
István Zoltán Szabó	6c6e6874ff	[DOCS] Removes link to classification and regression. (#74926 )	2021-07-05 16:28:14 +02:00
István Zoltán Szabó	9a8c6fb66f	[DOCS] Removes beta labels from DFA related docs. (#70808 )	2021-03-26 09:46:41 +01:00
Lisa Cawley	efa9b095aa	[DOCS] Adds model alias to inference processor and agg (#69576 )	2021-02-24 13:12:39 -08:00
István Zoltán Szabó	d3ad9fe632	[DOCS] Improves inference processor linking and docs (#66119 )	2021-01-05 09:42:06 +01:00
István Zoltán Szabó	6093518f4a	[DOCS] Changes experimental flag to beta in DFA related docs (#63992 )	2020-10-26 17:02:46 +01:00
Lisa Cawley	ecf9e929ba	[DOCS] Add experimental tag to inference processor and bucket aggregation (#63023 )	2020-09-30 07:20:38 -07:00
James Rodewig	4eb09cb31e	[DOCS] Fix case of ingest processor titles (#61024 ) Converts page headings to sentence case. Adds a title abbreviation.	2020-08-12 11:28:00 -04:00
David Kyle	bf245e4c07	Make Inference processor field_map and inference_config optional (#58868 ) Relaxes the requirement that the inference ingest processor must has a field_map and inference_config defined even if they are empty.	2020-07-03 08:36:57 +01:00
István Zoltán Szabó	d0042fb791	[DOCS] Updates results_field description in the inference processor docs (#58554 )	2020-06-29 11:28:17 +02:00
Lisa Cawley	8b9293b3bf	[DOCS] Replace docdir attribute with es-repo-dir (#57489 )	2020-06-01 15:55:05 -07:00
István Zoltán Szabó	ca2f98382f	[DOCS] Changes feature importance links to point to the new page (#55531 ) * [DOCS] Changes feature importance links to point to the new page. * [DOCS] Fixes line breaks.	2020-04-28 09:02:14 +02:00
Benjamin Trent	c1afda4a23	[ML] adding prediction_field_type to inference config (#55128 ) Data frame analytics dynamically determines the classification field type. This field type then dictates the encoded JSON that is written to Elasticsearch. Inference needs to know about this field type so that it may provide the EXACT SAME predicted values as analytics. Here is added a new field `prediction_field_type` which indicates the desired type. Options are: `string` (DEFAULT), `number`, `boolean` (where close_to(1.0) == true, false otherwise). Analytics provides the default `prediction_field_type` when the model is created from the process.	2020-04-15 08:32:48 -04:00
István Zoltán Szabó	a0662399c7	[DOCS] Makes PUT inference API docs collapsible (#54653 ) Co-authored-by: lcawl <lcawley@elastic.co>	2020-04-03 09:45:42 +02:00
Benjamin Trent	4e1ff31c3c	[ML] add new inference_config field to trained model config (#54421 ) A new field called `inference_config` is now added to the trained model config object. This new field allows for default inference settings from analytics or some external model builder. The inference processor can still override whatever is set as the default in the trained model config.	2020-04-02 10:34:17 -04:00
lcawl	2641a39fd5	[DOCS] Fixes shared attribute for feature importance	2020-04-01 14:46:38 -07:00
István Zoltán Szabó	a65e95e093	[DOCS] Adds feature importance mapping subsection to inference processor docs (#54190 )	2020-03-26 09:22:12 +01:00
Benjamin Trent	970f726c1f	[ML] renaming inference processor field field_mappings to new name field_map (#53433 ) This renames the `inference` processor configuration field `field_mappings` to `field_map`. `field_mappings` is now deprecated.	2020-03-12 12:49:25 -04:00
Benjamin Trent	4e1f029b04	[ML][Inference] adds new default_field_map field to trained models (#53294 ) Adds a new `default_field_map` field to trained model config objects. This allows the model creator to supply field map if it knows that there should be some map for inference to work directly against the training data. The use case internally is having analytics jobs supply a field mapping for multi-field fields. This allows us to use the model "out of the box" on data where we trained on `foo.keyword` but the `_source` only references `foo`.	2020-03-11 12:23:56 -04:00
Benjamin Trent	20f54272f0	[ML] Adds feature importance to option to inference processor (#52218 ) This adds machine learning model feature importance calculations to the inference processor. The new flag in the configuration matches the analytics parameter name: `num_top_feature_importance_values` Example: ``` "inference": { "field_mappings": {}, "model_id": "my_model", "inference_config": { "regression": { "num_top_feature_importance_values": 3 } } } ``` This will write to the document as follows: ``` "inference" : { "feature_importance" : { "FlightTimeMin" : -76.90955548511226, "FlightDelayType" : 114.13514762158526, "DistanceMiles" : 13.731580450792187 }, "predicted_value" : 108.33165831875137, "model_id" : "my_model" } ``` This is done through calculating the [SHAP values](https://arxiv.org/abs/1802.03888). It requires that models have populated `number_samples` for each tree node. This is not available to models that were created before 7.7. Additionally, if the inference config is requesting feature_importance, and not all nodes have been upgraded yet, it will not allow the pipeline to be created. This is to safe-guard in a mixed-version environment where only some ingest nodes have been upgraded. NOTE: the algorithm is a Java port of the one laid out in ml-cpp: https://github.com/elastic/ml-cpp/blob/master/lib/maths/CTreeShapFeatureImportance.cc usability blocked by: https://github.com/elastic/ml-cpp/pull/991	2020-02-21 16:36:21 -05:00
David Kyle	34743bcd6f	[ML] Remove stray field from inference docs (#51870 ) model_info_field is not a valid option	2020-02-05 10:49:36 +00:00
István Zoltán Szabó	4e0e6e83e0	[DOCS] Fixes indentation in inference processor code snippet (#51252 )	2020-01-21 16:21:17 +01:00
István Zoltán Szabó	b8cae37374	[DOCS] Adds inference processor documentation (#50204 ) Co-Authored-By: Lisa Cawley <lcawley@elastic.co>	2019-12-19 12:19:44 +01:00

36 commits