Commit graph

263 commits

Author SHA1 Message Date
Jason Tedor
0d4494f121
Clarify ingest-geoip database_file docs (#64340)
The docs for the geoip processor database_file option appear to indicate
that all geoip databases are in the config directory. This is leftover
legacy from when this was the case when ingest-geoip was a plugin, but
it is no longer true as the built-in databases now ship inside the
ingest-geoip module that is bundled by default. This commit clarifies
those docs.

Co-authored-by: Jakob Reiter <jakommo@users.noreply.github.com>
2020-10-29 13:27:17 -04:00
James Rodewig
1ea83359bb
[DOCS] Fix case for 'Boolean' (#64299) 2020-10-29 09:04:43 -04:00
István Zoltán Szabó
6093518f4a
[DOCS] Changes experimental flag to beta in DFA related docs (#63992) 2020-10-26 17:02:46 +01:00
bellengao
0c88c19c1d
Add country_name to the default properties of geoip ingest processor (#62915) 2020-09-30 14:06:51 -05:00
Lisa Cawley
ecf9e929ba
[DOCS] Add experimental tag to inference processor and bucket aggregation (#63023) 2020-09-30 07:20:38 -07:00
Jakob Reiter
534b179c33
[DOCS] Updated target_field description of the json ingest processor (#61968)
Co-authored-by: Dan Hermann <danhermann@users.noreply.github.com>
2020-09-30 08:43:29 -04:00
Peter Ansell
b40bdd3093
Add network from MaxMind Geo ASN database (#61676)
This adds the network property from the MaxMind Geo ASN database. 
This enables analysis of IP data based on the subnets that MaxMind have 
previously identified for ASN networks.

closes #60942
2020-09-24 11:51:50 -05:00
Dan Hermann
80ea415e0f
[DOCS] allow_duplicates option for append processor (#62336) 2020-09-15 09:01:25 -05:00
Dan Hermann
9b8e8aa7ed
[DOCS] Sort option for the grok patterns endpoint (#62092) 2020-09-14 12:36:21 -05:00
James Rodewig
b0336111af
[DOCS] Fix Gsub processor snippet (#61720) 2020-08-31 10:14:54 -04:00
Dan Hermann
0ba8d82c1b
[DOCS] Configurable output format for date processor (#61440) 2020-08-24 11:07:13 -05:00
James Rodewig
a4b8079c0a
[DOCS] Fix ingest processor TOC sort (#61412) 2020-08-21 08:44:59 -04:00
James Rodewig
bccd58b2f1
[DOCS] Fix field def for join processor (#61395) 2020-08-21 08:35:56 -04:00
James Rodewig
a94e5cb7c4
[DOCS] Replace Wikipedia links with attribute (#61171) 2020-08-17 09:44:24 -04:00
Nicole Albee
8d60bfb52d
[DOCS] Update ingest processor snippet for ECS (#61128) 2020-08-14 14:06:24 -04:00
James Rodewig
a0f4edff66
[DOCS] Fix chunking in query docs (#61053)
Changes:
* Moves "Notes" sections for the joining queries and percolate query
  pages to the parent page
* Adds related redirects for the moved "Notes" pages
* Assigns explicit anchor IDs to other "Notes" headings. This was required for
  the redirects to work.
2020-08-12 13:45:49 -04:00
James Rodewig
4eb09cb31e
[DOCS] Fix case of ingest processor titles (#61024)
Converts page headings to sentence case.
Adds a title abbreviation.
2020-08-12 11:28:00 -04:00
James Rodewig
56c778235c
[DOCS] Fix metadata field refs (#60764) 2020-08-05 13:21:00 -04:00
Jake Landis
35fc997943
Enhance the ingest node simulate verbose output (#60433)
This commit enhances the verbose output for the
`_ingest/pipeline/_simulate?verbose` api. Specifically
this adds the following:

* the pipeline processor is now included in the output
* the conditional (if) and result is now included in the output iff it was defined
* a status field is always displayed. the possible values of status are
  * `success` - if the processor ran with out errors
  * `error` - if the processor ran but threw an error that was not ingored
  * `error_ignored` - if the processor ran but threw an error that was ingored
  * `skipped` - if the process did not run (currently only possible if the if condition evaluates to false)
  * `dropped` - if the the `drop` processor ran and dropped the document
* a `processor_type` field for the type of processor (e.g. set, rename, etc.)
* throw a better error if trying to simulate with a pipeline that does not exist

closes #56004
2020-08-04 13:09:08 -05:00
Alexander Reelsen
c7ac9e7073
[DOCS] http -> https, remove outdated plugin docs (#60380)
Plugin discovery documentation contained information about installing
Elasticsearch 2.0 and installing an oracle JDK, both of which is no
longer valid.

While noticing that the instructions used cleartext HTTP to install
packages, this commit replaces HTTPs links instead of HTTP where possible.

In addition a few community links have been removed, as they do not seem
to exist anymore.
2020-07-31 15:58:38 -04:00
James Rodewig
441c3a21b1
[DOCS] Update my-index examples (#60132)
Changes the following example index names to `my-index-000001` for consistency:

* `my-index`
* `my_index`
* `myindex`
2020-07-27 14:46:39 -04:00
James Rodewig
9f34e8e771
[DOCS] Fix ingest processor docs for autogen doc IDs (#60147)
If you autogen doc IDs, you cannot use the `{{_id}}` value in an ingest
processor.

This adds a related admonition to the ingest processor docs.
2020-07-27 13:37:22 -04:00
James Rodewig
2774cd6938
[DOCS] Swap [float] for [discrete] (#60124)
Changes instances of `[float]` in our docs for `[discrete]`.

Asciidoctor prefers the `[discrete]` tag for floating headings:
https://asciidoctor.org/docs/asciidoc-asciidoctor-diffs/#blocks
2020-07-23 11:48:22 -04:00
James Rodewig
80b674fb25
[DOCS] Reformat snippets to use two-space indents (#59973) 2020-07-21 12:24:26 -04:00
Shahzad
24e5da7851
Update regex file for es user agent node processor (#59697) 2020-07-17 16:54:34 +02:00
James Rodewig
2be9db01c8
[DOCS] Replace datatype with data type (#58972) 2020-07-07 13:52:10 -04:00
David Kyle
bf245e4c07
Make Inference processor field_map and inference_config optional (#58868)
Relaxes the requirement that the inference ingest processor must has a 
field_map and inference_config defined even if they are empty.
2020-07-03 08:36:57 +01:00
DeDe Morton
b5e374d958
[DOCS] Change Beats links to refactored getting started docs (#58790) 2020-07-02 17:10:09 -07:00
Nik Everett
a4d30352c7
Document using stored scripts for ingest (#58783)
This documents using stored scripts for complex conditionals in indest.
2020-07-01 13:35:13 -04:00
István Zoltán Szabó
d0042fb791
[DOCS] Updates results_field description in the inference processor docs (#58554) 2020-06-29 11:28:17 +02:00
Jake Landis
5088ab151a
Update hh to HH in date processor example (#58089) (#58142)
Co-authored-by: Leaf-Lin <39002973+Leaf-Lin@users.noreply.github.com>
2020-06-15 17:03:42 -05:00
bellengao
efc4c9a210
Add ignore_empty_value parameter in set ingest processor (#57030) 2020-06-15 07:26:57 -05:00
Jake Landis
f5910664b7
Ensure Joni warning are logged at debug (#57302)
When Joni, the regex engine that powers grok emits a warning it
does so by default to System.err. System.err logs are all bucketed
together in the server log at WARN level. When Joni emits a warning,
it can be extremely verbose, logging a message for each execution
again that pattern. For ingest node that means for every document
that is run that through Grok. Fortunately, Joni provides a call
back hook to push these warnings to a custom location.

This commit implements Joni's callback hook to push the Joni warning
to the Elasticsearch server logger (logger.org.elasticsearch.ingest.common.GrokProcessor)
at debug level. Generally these warning indicate a possible issue with
the regular expression and upon creation of the Grok processor will
do a "test run" of the expression and log the result (if any) at WARN 
level. This WARN level log should only occur on pipeline creation which 
is a much lower frequency then every document. 

Additionally, the documentation is updated with instructions for how
to set the logger to debug level.
2020-06-09 13:33:27 -05:00
Lisa Cawley
8b9293b3bf
[DOCS] Replace docdir attribute with es-repo-dir (#57489) 2020-06-01 15:55:05 -07:00
Adam Locke
d77388f919
[DOCS] Add links to flattened datatype (#56794)
* Changes for #52239.

* Incorporating review feedback from Julie T. Also single-sourcing nexted options in the Mapping page and referencing them in the Nested page.

* Moving tip after the introduction and clarifying limits.

* Update docs/reference/mapping.asciidoc

Co-authored-by: James Rodewig <james.rodewig@elastic.co>

* Update docs/reference/mapping/types/nested.asciidoc

Co-authored-by: James Rodewig <james.rodewig@elastic.co>

Co-authored-by: James Rodewig <james.rodewig@elastic.co>
2020-05-19 13:40:26 -04:00
James Rodewig
2f930f1ec0
[DOCS] Correct query datatype in enrich policy definition (#56224)
Corrects the datatype for the `query` property of an enrich policy
object. The `query` property is a query object, not a string.
2020-05-13 08:34:22 -04:00
Thiago Souza
863a883286 [DOCS] Correct get enrich policy API request example (#56207) 2020-05-05 12:34:50 -04:00
István Zoltán Szabó
ca2f98382f
[DOCS] Changes feature importance links to point to the new page (#55531)
* [DOCS] Changes feature importance links to point to the new page.

* [DOCS] Fixes line breaks.
2020-04-28 09:02:14 +02:00
Benjamin Trent
c1afda4a23
[ML] adding prediction_field_type to inference config (#55128)
Data frame analytics dynamically determines the classification field type. This field type then dictates the encoded JSON that is written to Elasticsearch. 

Inference needs to know about this field type so that it may provide the EXACT SAME predicted values as analytics. 

Here is added a new field `prediction_field_type` which indicates the desired type. Options are: `string` (DEFAULT), `number`, `boolean` (where close_to(1.0) == true, false otherwise). 

Analytics provides the default `prediction_field_type` when the model is created from the process.
2020-04-15 08:32:48 -04:00
István Zoltán Szabó
a0662399c7
[DOCS] Makes PUT inference API docs collapsible (#54653)
Co-authored-by: lcawl <lcawley@elastic.co>
2020-04-03 09:45:42 +02:00
Benjamin Trent
4e1ff31c3c
[ML] add new inference_config field to trained model config (#54421)
A new field called `inference_config` is now added to the trained model config object. This new field allows for default inference settings from analytics or some external model builder. 

The inference processor can still override whatever is set as the default in the trained model config.
2020-04-02 10:34:17 -04:00
lcawl
2641a39fd5 [DOCS] Fixes shared attribute for feature importance 2020-04-01 14:46:38 -07:00
AndyHunt66
ba8253f5ee
[DOCS] Remove redundant sentence in ingest processor docs (#54329) 2020-03-27 08:23:46 -04:00
István Zoltán Szabó
a65e95e093
[DOCS] Adds feature importance mapping subsection to inference processor docs (#54190) 2020-03-26 09:22:12 +01:00
bellengao
8ffe5d1f94
Support array for all string ingest processors 2020-03-17 15:22:30 -05:00
Benjamin Trent
970f726c1f
[ML] renaming inference processor field field_mappings to new name field_map (#53433)
This renames the `inference` processor configuration field `field_mappings` to `field_map`. 

`field_mappings` is now deprecated.
2020-03-12 12:49:25 -04:00
James Rodewig
bc7643c65b
[DOCS] Reduce content reuse in enrich docs (#53460)
Restructures the 'Update an enrich policy' section to:

* Migrate the content to the section. It was previously stored in the
  Put Enrich Policy API docs.
* Remove the warning tag admonition from the section content.
* Replace a reused section earlier in the "Set up an enrich processor"
  page with a link.

No substantive changes were made to the content.
2020-03-12 05:40:57 -04:00
Benjamin Trent
4e1f029b04
[ML][Inference] adds new default_field_map field to trained models (#53294)
Adds a new `default_field_map` field to trained model config objects. 

This allows the model creator to supply field map if it knows that there should be some map for inference to work directly against the training data.

The use case internally is having analytics jobs supply a field mapping for multi-field fields. This allows us to use the model "out of the box" on data where we trained on `foo.keyword` but the `_source` only references `foo`.
2020-03-11 12:23:56 -04:00
Orhan Toy
bce4a3bd4b
[DOCS] Fix formatting of simulate ingest pipeline API docs (#52754)
Wraps request routes for the simulate ingest pipelines in the API docs. This ensures the routes display in monospace.
2020-03-02 11:41:22 -05:00
David Pilato
e51b8a51aa
[DOS] Fix typo in CSV processor docs (#52649)
Corrects an example array in a snippet of the CSV processor docs.
2020-02-25 08:47:58 -05:00