Commit graph

227 commits

Author SHA1 Message Date
Joe Gallo
1a252841a1
Update geolocation database documentation (#121472) (#121670) 2025-02-05 02:22:30 +11:00
István Zoltán Szabó
db4f33043e
[DOCS] Adds examples to inference processor docs (#116018) (#118134)
(cherry picked from commit f27cb5efd3)
2024-12-06 10:26:17 +01:00
Joe Gallo
7427eb97b6
Document new ip_location processor (#116623) (#116630) 2024-11-12 12:20:52 +11:00
Joe Gallo
c8134bf787
Document new ip geolocation fields (#116603) (#116606) 2024-11-12 03:41:21 +11:00
István Zoltán Szabó
45af6f97f0
[DOCS] Updates inference processor docs. (#115566) (#115627) 2024-10-25 19:44:51 +11:00
Pete Gillin
6ec7a3439d
Add a terminate ingest processor (#114157) (#114343)
This processor simply causes any remaining processors in the pipeline
to be skipped. It will normally be executed conditionally using the
`if` option. (If this pipeline is being called from another pipeline,
the calling pipeline is *not* terminated.)

For example, this:

```
POST /_ingest/pipeline/_simulate
{
  "pipeline":
  {
    "description": "Appends just 'before' to the steps field if the number field
 is present, or both 'before' and 'after' if not",
    "processors": [
      {
        "append": {
          "field": "steps",
          "value": "before"
        }
      },
      {
        "terminate": {
          "if": "ctx.error != null"
        }
      },
      {
        "append": {
          "field": "steps",
          "value": "after"
        }
      }
    ]
  },
  "docs": [
    {
      "_index": "index",
      "_id": "doc1",
      "_source": {
        "name": "okay",
        "steps": []
      }
    },
    {
      "_index": "index",
      "_id": "doc2",
      "_source": {
        "name": "bad",
        "error": "oh no",
        "steps": []
      }
    }
  ]
}
```

returns something like this:

```
{
  "docs": [
    {
      "doc": {
        "_index": "index",
        "_version": "-3",
        "_id": "doc1",
        "_source": {
          "name": "okay",
          "steps": [
            "before",
            "after"
          ]
        },
        "_ingest": {
          "timestamp": "2024-10-04T16:25:20.448881Z"
        }
      }
    },
    {
      "doc": {
        "_index": "index",
        "_version": "-3",
        "_id": "doc2",
        "_source": {
          "name": "bad",
          "error": "oh no",
          "steps": [
            "before"
          ]
        },
        "_ingest": {
          "timestamp": "2024-10-04T16:25:20.448932Z"
        }
      }
    }
  ]
}
```
2024-10-09 16:44:57 +01:00
István Zoltán Szabó
bca80f7797
[DOCS] Adds DeBERTA v2 to the tokenizers list in API docs (#112752) (#114203)
Co-authored-by: Max Hniebergall <137079448+maxhniebergall@users.noreply.github.com>
2024-10-07 19:48:43 +11:00
Sam Xiao
d405df9679
Tag redacted document in ingest pipeline (#113552) (#113750)
Adds a new option trace_redact in redact processor to indicate a document has been redacted in the ingest pipeline. If a document is processed by a redact processor AND any field is redacted, ingest metadata _ingest._redact._is_redacted = true will be set.

Closes #94633

Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com>
2024-10-04 06:26:23 +10:00
Simon Cooper
53d9c3cc6a
Add some information on locale database to the ES docs (#113587) 2024-09-30 09:28:13 +01:00
kosabogi
ff926182f1
Adds text_similarity task type to inference processor documentation (#113517) (#113612) 2024-09-27 00:38:48 +10:00
Stef Nestor
b9662b505b
(Doc+) Inference Pipeline ignores Mapping Analyzers (#112522) (#112776)
* (Doc+) Inference Pipeline ignores Mapping Analyzers

From internal Dev feedback (will cross-link after), this updates that inference processors within ingest pipelines run before mapping analyzers effectively ignoring them. So if users want analyzers to take effect, they would need to select the analyzer's ingest pipeline process equivalent and run it higher in flow than the inference processor.

---------

Co-authored-by: István Zoltán Szabó <istvan.szabo@elastic.co>
2024-09-12 08:30:07 +10:00
Panos Koutsovasilis
29453cb2ce
fix: support all allowed protocol numbers (#111528)
* fix(CommunityIdProcessor): support all allowed protocol numbers

* fix(CommunityIdProcessor): update documentation
2024-08-26 08:37:40 +03:00
István Zoltán Szabó
1ba72e4602
[DOCS] Documents output_field behavior after multiple inference runs (#111875)
Co-authored-by: David Kyle <david.kyle@elastic.co>
2024-08-15 12:36:59 +02:00
Keith Massey
c6a7537df7
Ingest download databases docs (#111688)
Co-authored-by: Joe Gallo <joegallo@gmail.com>
2024-08-08 09:23:56 -05:00
Joe Gallo
1aa5b2face
Fix geoip processor isp_organization_name property and docs (#111372) 2024-07-26 18:28:44 -04:00
Matt Culbreth
81b8495388
Mark the Redact processor as Generally Available 2024-07-02 16:58:57 -04:00
Joe Gallo
d9941f6285
Ingest geoip new databases release highlight (#109355) 2024-06-04 12:48:19 -04:00
Joe Gallo
e1b2b599de
Add continent_code support to the geoip processor (#108780) 2024-05-17 11:48:23 -04:00
Joe Gallo
babab0a8c0
Add support for the 'Connection Type' database to the geoip processor (#108683) 2024-05-15 17:58:08 -04:00
Keith Massey
639eee577e
Adding user_type support for the enterprise database for the geoip processor (#108687) 2024-05-15 12:23:52 -05:00
Keith Massey
69ec54d541
Add support for the 'ISP' database to the geoip processor (#108651) 2024-05-15 09:27:06 -05:00
Joe Gallo
cc6597df23
Add support for the 'Domain' database to the geoip processor (#108639) 2024-05-14 17:49:05 -04:00
Liam Thompson
33a71e3289
[DOCS] Refactor book-scoped variables in docs/reference/index.asciidoc (#107413)
* Remove `es-test-dir` book-scoped variable

* Remove `plugins-examples-dir` book-scoped variable

* Remove `:dependencies-dir:` and `:xes-repo-dir:` book-scoped variables

- In `index.asciidoc`, two variables (`:dependencies-dir:` and `:xes-repo-dir:`) were removed.
- In `sql/index.asciidoc`, the `:sql-tests:` path was updated to fuller path
- In `esql/index.asciidoc`, the `:esql-tests:` path was updated idem

* Replace `es-repo-dir` with `es-ref-dir`

* Move `:include-xpack: true` to few files that use it, remove from index.asciidoc
2024-04-17 14:37:07 +02:00
Joe Gallo
6ff3a2628a
Add support for the 'Enterprise' database to the geoip processor (#107377) 2024-04-11 16:45:10 -04:00
Joe Gallo
5266f79b16
Add support for the 'Anonymous IP' database to the geoip processor (#107287) 2024-04-11 14:05:52 -04:00
Jennie Soria
30828a5680
Update geoip.asciidoc (#105908)
The GeoIP endpoint does not use the xpack http client. The GeoIP downloader uses the JDKs builtin cacerts.

If customer is using custom https endpoint they need to provide the cacert in the jdk, whether our jdk bundled in or their jdk. Otherwise they will see something like
```
...PKiX path building failed: sun.security.provier.certpath.SunCertPathBuilderException: unable to find valid certification path to requested target...
```
2024-03-05 11:26:49 +01:00
ShourieG
147484b059
[elasticsearch][processors] - Added support for override flag in rename processor (#103565)
* added override flag for rename processer along with factory tests

* added yaml tests for rename processor using the override flag

* updated renameProcessor tests to include override flag as a parameter

* updated rename processor tests to incorporate override flag = true scenario

* updated rename processor asciidoc with override option

* updated rename processor asciidoc with override option

* removed unnecessary supresswarnings tag

* corrected formatting errors

* updated processor tests

* fixed yaml tests

* Prefer early throw style here

* Whitespace

* Move and rewrite this test

It's just a simple test of the primary behavior of the rename
processor, so put it first and simplify it.

* Rename this test

It doesn't actually exercise template snippets

* Tidy up this test

---------

Co-authored-by: Joe Gallo <joegallo@gmail.com>
2024-01-11 16:00:02 +05:30
Abdon Pijpelink
bc59315baa
[DOCS] Examples for ES|QL DISSECT and WHERE (#102591)
* DISSECT examples

* WHERE examples

* Remove references to empty keys

* Fix non-deterministic test
2023-11-27 10:56:48 +01:00
István Zoltán Szabó
c34e0c0746
[DOCS] Clarifies that inference input must be single string (#101301) 2023-10-25 17:18:05 +02:00
Abdon Pijpelink
284f81873f
[DOCS] Expand ES|QL DISSECT and GROK documentation (#101225)
* Add 'Process data with DISSECT and GROK' page

* Expand DISSECT docs

* More DISSECT and GROK enhancements

* Improve examples

* Fix CSV tests

* Review feedback

* Reword
2023-10-25 13:19:17 +02:00
István Zoltán Szabó
446ac9f378
[DOCS] Updates ELSER tutorial with inference processor changes (#100420)
Co-authored-by: Abdon Pijpelink <abdon.pijpelink@elastic.co>
2023-10-11 17:33:20 +02:00
David Kyle
6cde0df463
[ML] More checks and tests for parsing Inference processor config (#100335)
Following on from #100205 this PR adds more tests and checks 
for corner cases when parsing the configuration.
2023-10-06 15:10:45 +01:00
David Kyle
b055204b43
[ML] Simplify the Inference Ingest Processor configuration (#100205)
Adds a `input_ouput` option the removes the need for a `field_map` and/or
target fields. Multiple inputs can be specified in `input_output`
2023-10-03 18:42:31 +01:00
István Zoltán Szabó
e0cc375b14
[DOCS] Adds text_expansion config to inference processor reference docs. (#99900) 2023-09-26 12:58:19 +02:00
Felix Barnsteiner
3a7bdb5838
Make reroute processor GA (#99531) 2023-09-20 13:22:36 +02:00
Marius Iversen
4b41b17772
Update documentation for Set Processor (#99191) 2023-09-07 14:47:07 -04:00
Joe Gallo
3284903205
Document the redact processor's skip_if_unlicensed option (#99063) 2023-08-31 14:00:12 -04:00
James Baiera
7d990d5a09
Allow custom geo ip database files to be downloaded (#97850)
This PR extends the assumptions we make about database file availability to all database file 
names instead of the default ones we host at Elastic. When creating a geo ip processor with 
a database name that is not recognized we unilaterally convert the processor to one that 
tags documents with a missing database message until the database file requested is 
downloaded or provided via the manual configuration route. This allows a pipeline to be 
created and for the download service to be started, potentially sourcing the needed files.

---------

Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com>
2023-08-16 00:31:51 -04:00
James Rodewig
fe6a42b35f
[DOCS] Update Elastic GeoIP service link (#97455)
Adds TOS-related query parameters to the Elastic GeoIP link in the [GeoIP ingest processor docs](https://www.elastic.co/guide/en/elasticsearch/reference/current/geoip-processor.html). The current link returns a 400 HTTP status.
2023-07-07 10:53:02 -04:00
Aurélien FOUCRET
dd1d157b47
Enable analytics geoip in behavioral analytics. (#96624)
* When using a managed pipeline GeoIpDownloader is triggered only when an index exists for the pipeline.

* When using a managed pipeline GeoIpDownloader is triggered only when an index exists for the pipeline.

* Adding the geoip processor back

* Adding tags to the events mapping.

* Fix a forbidden API call into tests.

* lint

* Adding an integration tests for managed pipelines.

* lint

* Add a geoip_database_lazy_download param to pipelines and use it instead of managed.

* Fix a edge case: pipeline can be set after index is created.

* lint.

* Update docs/changelog/96624.yaml

* Update 96624.yaml

* Uses a processor setting (download_database_on_pipeline_creation) to decide database download strategy.

* Removing debug instruction.

* Improved documentation.

* Improved the way to check for referenced pipelines.

* Fixing an error in test.

* Improved integration tests.

* Lint.

* Fix failing tests.

* Fix failing tests (2).

* Adding javadoc.

* lint javadoc.

* Using a set instead of a list to store checked pipelines.
2023-06-15 23:42:10 +02:00
debadair
777598d602
[DOCS] Remove redirect pages (#88738)
* [DOCS] Remove manual redirects

* [DOCS] Removed refs to modules-discovery-hosts-providers

* [DOCS] Fixed broken internal refs

* Fixing bad cross links in ES book, and adding redirects.asciidoc[] back into docs/reference/index.asciidoc.

* Update docs/reference/search/point-in-time-api.asciidoc

Co-authored-by: James Rodewig <james.rodewig@elastic.co>

* Update docs/reference/setup/restart-cluster.asciidoc

Co-authored-by: James Rodewig <james.rodewig@elastic.co>

* Update docs/reference/sql/endpoints/translate.asciidoc

Co-authored-by: James Rodewig <james.rodewig@elastic.co>

* Update docs/reference/snapshot-restore/restore-snapshot.asciidoc

Co-authored-by: James Rodewig <james.rodewig@elastic.co>

* Update repository-azure.asciidoc

* Update node-tool.asciidoc

* Update repository-azure.asciidoc

---------

Co-authored-by: amyjtechwriter <61687663+amyjtechwriter@users.noreply.github.com>
Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com>
Co-authored-by: Amy Jonsson <amy.jonsson@elastic.co>
Co-authored-by: James Rodewig <james.rodewig@elastic.co>
2023-05-24 12:32:46 +01:00
István Zoltán Szabó
b164555072
[DOCS] Adds deployment ID param documentation to trained model APIs (#96174) 2023-05-17 15:56:58 +02:00
amyjtechwriter
c3e186ea01
Example of dot notation to access and array field for set processor. (#95893) 2023-05-09 10:21:27 +01:00
amyjtechwriter
3d6143b829
Nodes need access to storage.googleapis.com for geoip. (#95554) 2023-04-28 10:40:18 +01:00
Felix Barnsteiner
11b598a519
Add reroute processor (#76511) 2023-04-18 19:09:25 +02:00
Joe Gallo
9bc09d576a
Fix ignore_missing docs for a couple of Ingest processors (#95244) 2023-04-13 16:34:40 -04:00
Aurélien FOUCRET
9071d114f5
[Ingest Processor] Add ignore_missing param to the uri_parts ingest processor. (#95068) 2023-04-13 15:11:19 +02:00
Jean-Fabrice Bobo
a7e901263b
Update geoip.asciidoc (#95101)
Fix `ingest.geoip.downloader.eager.download` setting not appearing in the rendered documentation
2023-04-12 09:59:27 +02:00
Alessandro Stoltenberg
c787e3808f
docs: set-processor minor update (#94899) 2023-03-30 14:27:05 +02:00
Dimitris Kotsakos
38a09bea60
[ML] Make redact processor experimental for first release (#94683) 2023-03-23 18:28:03 +02:00