mirror of
https://github.com/elastic/elasticsearch.git
synced 2025-04-26 08:07:27 -04:00
* Enforce an invariant in our dependency checker so that logical plans never have duplicate output attribute names or ids. * Fix ROW to not produce columns with duplicate names. * Fix ResolveUnionTypes to not create multiple synthetic field attributes for the same union type. * Add tests for commands using the same column name more than once. * Update docs w.r.t. how commands behave if they are used with duplicate column names.
85 lines
2.3 KiB
Text
85 lines
2.3 KiB
Text
[discrete]
|
|
[[esql-grok]]
|
|
=== `GROK`
|
|
|
|
`GROK` enables you to <<esql-process-data-with-dissect-and-grok,extract
|
|
structured data out of a string>>.
|
|
|
|
**Syntax**
|
|
|
|
[source,esql]
|
|
----
|
|
GROK input "pattern"
|
|
----
|
|
|
|
*Parameters*
|
|
|
|
`input`::
|
|
The column that contains the string you want to structure. If the column has
|
|
multiple values, `GROK` will process each value.
|
|
|
|
`pattern`::
|
|
A grok pattern.
|
|
If a field name conflicts with an existing column, the existing column is discarded.
|
|
If a field name is used more than once, a multi-valued column will be created with one value
|
|
per each occurrence of the field name.
|
|
|
|
*Description*
|
|
|
|
`GROK` enables you to <<esql-process-data-with-dissect-and-grok,extract
|
|
structured data out of a string>>. `GROK` matches the string against patterns,
|
|
based on regular expressions, and extracts the specified patterns as columns.
|
|
|
|
Refer to <<esql-process-data-with-grok>> for the syntax of grok patterns.
|
|
|
|
*Examples*
|
|
|
|
// tag::examples[]
|
|
The following example parses a string that contains a timestamp, an IP address,
|
|
an email address, and a number:
|
|
|
|
[source.merge.styled,esql]
|
|
----
|
|
include::{esql-specs}/docs.csv-spec[tag=basicGrok]
|
|
----
|
|
[%header.monospaced.styled,format=dsv,separator=|]
|
|
|===
|
|
include::{esql-specs}/docs.csv-spec[tag=basicGrok-result]
|
|
|===
|
|
|
|
By default, `GROK` outputs keyword string columns. `int` and `float` types can
|
|
be converted by appending `:type` to the semantics in the pattern. For example
|
|
`{NUMBER:num:int}`:
|
|
|
|
[source.merge.styled,esql]
|
|
----
|
|
include::{esql-specs}/docs.csv-spec[tag=grokWithConversionSuffix]
|
|
----
|
|
[%header.monospaced.styled,format=dsv,separator=|]
|
|
|===
|
|
include::{esql-specs}/docs.csv-spec[tag=grokWithConversionSuffix-result]
|
|
|===
|
|
|
|
For other type conversions, use <<esql-type-conversion-functions>>:
|
|
|
|
[source.merge.styled,esql]
|
|
----
|
|
include::{esql-specs}/docs.csv-spec[tag=grokWithToDatetime]
|
|
----
|
|
[%header.monospaced.styled,format=dsv,separator=|]
|
|
|===
|
|
include::{esql-specs}/docs.csv-spec[tag=grokWithToDatetime-result]
|
|
|===
|
|
|
|
If a field name is used more than once, `GROK` creates a multi-valued
|
|
column:
|
|
|
|
[source.merge.styled,esql]
|
|
----
|
|
include::{esql-specs}/docs.csv-spec[tag=grokWithDuplicateFieldNames]
|
|
----
|
|
[%header.monospaced.styled,format=dsv,separator=|]
|
|
|===
|
|
include::{esql-specs}/docs.csv-spec[tag=grokWithDuplicateFieldNames-result]
|
|
|===
|
|
// end::examples[]
|