elasticsearch

mirror of https://github.com/elastic/elasticsearch.git synced 2025-06-29 09:54:06 -04:00

Author	SHA1	Message	Date
Liam Thompson	8c06acccf8	[DOCS][ESQL] GA search functions for 9.1 (#129786 ) * [DOCS][ESQL] Flip preview booleans, to GA search functions * render docs, tweak some applies_to metadata in docs gen code - rendered docs (md): - kql: removed serverless preview, added ga 9.1.0 - match: removed serverless preview, added ga 9.1.0 - match_phrase: changed from preview 9.1.0 to unavailable 9.0 + ga 9.1.0 - qstr: removed serverless preview, added ga 9.1.0 - search functions list: removed bullet point before term function - docs generation code (java): - match_phrase: updated function info annotations to unavailable 9.0 + ga 9.1.0 - query_string: uncommented ga 9.1.0 annotation	2025-06-23 20:51:54 +01:00
Craig Taverner	e6347b8ab0	[DOCS] Update ES\|QL generated docs to consistently use the `applies_to` metadata (#128576 ) - Add PREVIEW annotations to all preview functions - Update docs generation logic to use annotations instead of preview boolean * Changed stack: ga 9.1 to stack: coming in multiple places - Match.java: Updated the options parameter description - MultiMatch.java: Updated the options parameter description - ToLower.java: Reformatted parameter description and updated version annotation - ToUpper.java: Removed the appliesTo annotation entirely and reformatted parameter description - updated applies_to annotations to specify both preview 9.0.0 and ga 9.1.0 lifecycle stages - added version-specific documentation examples with applies_to markers for ga 9.1.0 features - enhanced to_lower and to_upper functions to indicate support for multi-valued expressions in ga 9.1.0 - added version guards around function parameters and descriptions using applies_to syntax - updated function parameter descriptions to indicate ga 9.1.0 availability for named parameters - use detailedDescription + and fix match_phrase applies_to syntax - strip inline applies_to from kibana docs - update roundto, matchphrase lifecycles - fix match named params info - various spatial functions are preview - unsigned long is preview - update qstr - Update TO_LOWER/TO_UPPER parameter descriptions for clarity - hide spatial functions per https://github.com/elastic/elasticsearch/pull/129839 - added `stack: ga 9.1.0` blocks to match_phrase.md and qstr.md examples - simplified term.md version from `preview 9.0.0` to just `preview` - added `applies_to = "stack: ga 9.1.0"` to matchphrase and querystring java annotations - removed version `9.0.0` from term function annotation - deleted unused `makeCallout()` and `appendLifeCycleAndVersion()` methods Co-authored-by: elasticsearchmachine <infra-root+elasticsearchmachine@elastic.co> Co-authored-by: Liam Thompson <leemthompo@gmail.com> Co-authored-by: Liam Thompson <32779855+leemthompo@users.noreply.github.com> Co-authored-by: Nik Everett <nik9000@gmail.com>	2025-06-23 20:07:41 +02:00
Alexander Spies	efb1397fe9	ESQL: Hide spatial grid functions behind SNAPSHOT (#129839 ) #125143 added 9 spatial grid functions and released them into Serverless. We think this is not the best long-term approach and the functions in #129581 are likely better. As a first step, rmove the spatial grid functions added in #125143 from release builds so they don't get released into 8.19/9.1. --------- Co-authored-by: Craig Taverner <craig@amanzi.com>	2025-06-23 17:16:30 +02:00
Nik Everett	0b35acf861	ESQL: Fix misspelling in generated docs (#129789 ) Pulled from #128576 so it's easier to review.	2025-06-20 20:42:37 +01:00
Jan Kuipers	5449d95dcd	ES\|QL categorize tech preview -> GA (#129398 )	2025-06-19 12:06:28 +02:00
Kathleen DeRusso	17d463f6bf	Register match_phrase as a function not a snapshot function (#129255 ) * Register match_phrase as a function not a snapshot function * Update usage	2025-06-11 13:01:24 -04:00
Larisa Motova	1333ea8883	[ES\|QL] Specify population in StdDev docs (#129225 ) There are 2 types of Standard Deviation: population and sample, this commit clarifies that the existing is population.	2025-06-10 12:47:11 -10:00
Carlos Delgado	366e00f5c5	ES\|QL - kNN function initial support (#127322 )	2025-06-10 11:11:42 +02:00
Ioana Tagirta	15dd896a61	Remove null example for match_phrase (#129173 )	2025-06-10 10:27:15 +02:00
Kathleen DeRusso	b214fbfcdc	Take match_phrase out of snapshot and make tech preview (#128925 ) * Take match_phrase out of snapshot and make tech preview * Update docs/changelog/128925.yaml * PR feedback * Adding regenerated test data * Update docs/changelog/128925.yaml Co-authored-by: Carlos Delgado <6339205+carlosdelest@users.noreply.github.com> * [CI] Auto commit changes from spotless * Checkstyle * Correct docs * Hopefully fix docs build * Found one more bad docs link - here's hoping this now fixes the doc build * OMG bitten by - vs _ --------- Co-authored-by: Carlos Delgado <6339205+carlosdelest@users.noreply.github.com> Co-authored-by: elasticsearchmachine <infra-root+elasticsearchmachine@elastic.co> Co-authored-by: Aurélien FOUCRET <aurelien.foucret@gmail.com>	2025-06-09 18:55:49 +02:00
Kathleen DeRusso	5d22ad6874	Add clarification to semantic_text documentation on default quantization and lexical search support (#128927 ) * Add clarifications to semantic text documentation * Regnerate match ESQL docs * Fix whitespace * PR feedback * Update docs/reference/elasticsearch/mapping-reference/semantic-text.md Co-authored-by: Liam Thompson <32779855+leemthompo@users.noreply.github.com> --------- Co-authored-by: Liam Thompson <32779855+leemthompo@users.noreply.github.com>	2025-06-06 15:01:50 +02:00
Kathleen DeRusso	eee423aaa0	[ES\|QL] Add MATCH_PHRASE (#127661 ) * Initial commit of match_phrase * Add MatchPhraseQueryTests * First pass at CSV specs * Update docs/changelog/127661.yaml * Refactor so MatchPhrase doesn't use all fulltext test cases, just text only * Fix tests * Add some CSV test cases * Fix test * Update changelog * Update tests * Comment out MATCH_PHRASE in search-functions Markdown * Minor PR feedback * PR feedback - refactor/consolidate code * Add some more tests * Fix some tests * [CI] Auto commit changes from spotless * Fix tests * PR feedback - add tests, support boost and numeric data * Revert "PR feedback - add tests, support boost and numeric data" This reverts commit `4e7a699e3e`. * Apply testing/PR feedback outside numeric support only * Regenerate docs * Add negative test * Update x-pack/plugin/esql/qa/testFixtures/src/main/resources/match-phrase-function.csv-spec Co-authored-by: Carlos Delgado <6339205+carlosdelest@users.noreply.github.com> * Update x-pack/plugin/esql/qa/testFixtures/src/main/resources/match-phrase-function.csv-spec Co-authored-by: Carlos Delgado <6339205+carlosdelest@users.noreply.github.com> * Update x-pack/plugin/esql/qa/testFixtures/src/main/resources/match-phrase-function.csv-spec Co-authored-by: Carlos Delgado <6339205+carlosdelest@users.noreply.github.com> * PR feedback * Fix auto-commit error * Regenerate docs * Update x-pack/plugin/esql/src/main/java/org/elasticsearch/xpack/esql/expression/function/fulltext/MatchPhrase.java Co-authored-by: Liam Thompson <32779855+leemthompo@users.noreply.github.com> * Remove non text field types * Fake test data * Remove tests that no longer should pass without ip/date/version support * Put real data in score tests now that I was able to engineer a failure * Realized the scoring test might be flakey because how it was written, updated * PR feedback * PR feedback * [CI] Auto commit changes from spotless * Add check to MatchPhrase tests * Fix merge errors * [CI] Auto commit changes from spotless * Test generated docs * Add additional verifier tests --------- Co-authored-by: elasticsearchmachine <infra-root+elasticsearchmachine@elastic.co> Co-authored-by: Carlos Delgado <6339205+carlosdelest@users.noreply.github.com> Co-authored-by: Liam Thompson <32779855+leemthompo@users.noreply.github.com>	2025-06-04 12:32:24 -04:00
Craig Taverner	11f0c5526a	ES\|QL Support for ST_GEOHASH, ST_GEOTILE and ST_GEOHEX (#125143 ) Added support for the three primary scalar grid functions: * `ST_GEOHASH(geom, precision)` * `ST_GEOTILE(geom, precision)` * `ST_GEOHEX(geom, precision)` As well as versions of these three that take an optional `geo_shape` boundary (must be a `BBOX` ie. `Rectangle`). And also supporting conversion functions that convert the grid-id from long to string and back to long. This work represents the core of the feature to support geo-grid aggregations in ES\|QL.	2025-06-03 11:49:34 +02:00
Nik Everett	1b151eda4b	ESQL: Compute engine support for tagged queries (#128521 ) Begins adding support for running "tagged queries" to the compute engine. Here, it's just the `LuceneSourceOperator` because that's useful and contained. Example time! Say you are running: ``` FROM foo \| STATS MAX(v) BY ROUND_TO(g, 0, 100, 1000, 100000) ``` It's often faster to run this as four queries: * The docs that round to `0` * The docs that round to `100` * The docs that round to `1000` * The docs that round to `100000` This creates an ESQL operator that can run these queries, one after the other and attach those tags. Aggs uses this trick and it's way faster when it can push down count queries, but it's still faster when it pushes doc loading things. This implementation in `LuceneSourceOperator` is quite similar to the doc loading version in _search. I don't have performance measurements yet because I haven't plugged this into the language. In _search we call this `filter-by-filter` and enable it when each group averages to more than 5000 documents and when there isn't an `_doc_count` field. It's faster in those cases not to push. I expect we'll be pretty similar.	2025-05-29 12:41:58 -04:00
Nik Everett	dd180be55d	ESQL: Fix docs for ROUND_TO (#128382 ) The examples included a filter we use for testing by mistake.	2025-05-24 01:28:29 +10:00
Nik Everett	45bfaab448	ESQL: ROUND_TO function (#128278 ) Creates a `ROUND_TO` function that rounds it's input to one of the provided values. Like so: ``` ROUND_TO(v, 0, 5000, 10000, 20000, 40000, 100000) v \| ROUND_TO 0 \| 0 100 \| 0 6000 \| 5000 45001 \| 40000 999999 \| 100000 ``` For some sequences of numbers you could do this with the `/` operator - but for arbitrary sequences of numbers you needed `CASE` which is quite slow. And hard to read! Rewriting the example above would look like: ``` CASE ( v < 5000, 0, v < 10000, 5000, v < 20000, 10000, v < 40000, 20000, v < 100000, 40000, 100000 ) ``` Even better, this is fast: ``` (operation) Mode Cnt Score Error Units round_to_4_via_case avgt 7 138.124 ± 0.738 ns/op round_to_4 avgt 7 0.805 ± 0.011 ns/op round_to_3 avgt 7 0.739 ± 0.011 ns/op round_to_2 avgt 7 0.651 ± 0.009 ns/op date_trunc avgt 7 2.425 ± 0.018 ns/op ``` I've included a comparison to `DATE_TRUNC` above because we should be able to rewrite `DATE_TRUNC` into `ROUND_TO` when we know the date range of the index. This doesn't do it now, but it should be possible.	2025-05-23 10:14:30 -04:00
Nik Everett	b8e2fce60a	ESQL: Document VALUES uniques (#128157 ) Documents that the VALUES aggregate function returns unique documents and points folks to the TOP aggregate function if they want to keep dupes. Closes #128091 --------- Co-authored-by: Liam Thompson <32779855+leemthompo@users.noreply.github.com>	2025-05-22 15:50:29 +02:00
shmuelhanoch	db644e20c8	Added esql scalb function. (#127696 ) Co-authored-by: Shmuel Hanoch <shmuel.hanoch@elastic.co>	2025-05-22 10:47:44 +03:00
Jan Kuipers	9cf2a64067	ES\|QL SAMPLE aggregation function (#127629 ) * ES\|QL SAMPLE aggregation function * [CI] Auto commit changes from spotless * ThreadLocalRandom -> SplittableRandom * Update docs/changelog/127629.yaml * fix yaml test * Add SampleTests * docs + example * polish code * mark generated imports * comment with algorith description * use Randomness.get() * close properly * type checks * reuse hash * regen some files * [CI] Auto commit changes from spotless --------- Co-authored-by: elasticsearchmachine <infra-root+elasticsearchmachine@elastic.co>	2025-05-08 08:01:53 +02:00
Craig Taverner	543aeb8c19	Output function signature license requirements to Kibana definitions (#127717 ) Output function signature license requirements to Kibana definition files, and also test that this matches the actual licensing behaviour of the functions. ES\|QL functions that enforce license checks do so with the `LicenseAware` interface. This does not expose what that functions license level is, but only whether the current active license will be sufficient for that function and its current signature (data types passed in as fields). Rather than add to this interface, we've made the license level information test-only information. This means if a function implements LicenseAware, it also needs to add a method to its test class to specify the license level for the signature being called. All functions will be tested for compliance, so failing to add this will result in test failure. Also if the test license level does not match the enforced license, that will also cause a failure.	2025-05-07 10:02:17 +02:00
Nik Everett	85027384f1	ESQL: Claim transport version to backport #124913 (#127616 ) Claims a transport version in main that we will use to backport #124913 to 8.19.	2025-05-01 23:27:42 +02:00
Liam Thompson	b6c9b9b54d	[DOCS] Update URLs for ESQL Kibana generated docs (#127011 )	2025-04-17 18:25:24 +02:00
Svilen Mihaylov	02f9af732e	Add multi_match function #121525 (#125062 ) Implement multi_match function for ESQL. Its currently available on snapshot builds pending refinement of the syntax.	2025-04-15 09:38:08 -04:00
Nik Everett	55a6624746	ESQL: TO_IP can handle leading zeros (#126532 ) Modifies TO_IP so it can handle leading `0`s in ipv4s. Here's how it works now: ``` ROW ip = TO_IP("192.168.0.1") // OK! ROW ip = TO_IP("192.168.010.1") // Fails ``` This adds ``` ROW ip = TO_IP("192.168.010.1", {"leading_zeros": "octal"}) ROW ip = TO_IP("192.168.010.1", {"leading_zeros": "decimal"}) ``` We do this because there isn't a consensus on how to parse leading zeros in ipv4s. The standard unix tools like `ping` and `ftp` interpret leading zeros as octal. Java's built in ip parsing interprets them as decimal. Because folks are using this for security rules we need to support all the choices. Closes #125460	2025-04-11 19:45:14 +02:00
Craig Taverner	67b15ad5d8	Split ES\|QL functions/operators/commands into separate pages for similar functions and make commands examples generated (#126279 ) While the internal structure of the docs is already split into many (over 1000) sub-pages, the final display for the `Functions and Operators` page is a single giant page, making navigation harder. This PR splits it into separate pages, one for each group of similar functions and one for the operators. Twelve new pages. This PR also bundles a few other related changes. In total what is done is: * Split functions/operators into 12 pages, one for each group, maintaining the existing split of each function/operator into a snippet with dynamically generated examples * Split esql-commands.md into source-commands.md and processing-commands.md, each of which is split into individual snippets, one for each command * Each command snippet has it's examples split out into separate files, if they were examples that were dynamically generated in the older asciidoc system * The examples files are overwritten by the ES\|QL unit tests, using a similar mechanism to the examples written for functions and operators) * Some additional refinements to the Kibana definition and markdown files (nicer operator headings, and display text)	2025-04-10 15:56:05 +02:00
kanoshiou	30b2a1f729	ESQL: Enhanced `DATE_TRUNC` with arbitrary intervals (#120302 ) Originally, `DATE_TRUNC` only supported 1-month and 3-month intervals for months, and 1-year interval for years, while arbitrary intervals were supported for weeks and days. This PR adds support for `DATE_TRUNC` with arbitrary month and year intervals. Closes #120094	2025-04-03 16:55:56 +02:00
Nik Everett	d30296229b	ESQL: Hide some "extras" from docs (#124763 ) Hides some of the "extra" lines from ESQL's documentation. These lines are required to make the documentation into nice tests which is important to make sure the docs don't get out of date. But readers don't need to see them.	2025-04-01 21:24:15 +01:00
Craig Taverner	7b263b4b83	Kibana updates, remove links from JSON and split is-null/is-not-null (#125986 ) In particular: * Remove all links (both asciidoc and markdown) from the JSON definition files. * This required a two phase edit, from asciidoc links to markdown, and then removal of markdown (replace with markdown text). This is because the asciidoc does not have the display text, and because some links were already markdown. * Split predicates into is_null and is_not_null * We kept the old combined version because the main docs still use that, so now we have both combined and separate versions, and Kibana can select the version they want.	2025-04-01 15:46:24 +02:00
Larisa Motova	10719831b5	[ES\|QL] Add ToAggregateMetricDouble example (#125518 ) Adds AggregateMetricDouble to the ES\|QL CSV tests and examples of how to use the ToAggregateMetricDouble function	2025-03-26 07:56:48 -10:00
Craig Taverner	8ffecb408d	Additional support for docs for ES\|QL operators and version-specific differentiation (#125251 ) This PR was originally focused on improving support for Kibana docs, in particular the missing operator docs, but it has expanded to cover a bunch of related things: * Primarily the main work was to improve operators support. ESQL generated docs cover all functions and most operators for which their is a clear operator class and test class. However, some are built-in behaviour and need additional support. This PR adds more generated content for those operators. * Various specific operators requested by Kibana: Cast & null-predicates, and in particular the addition of examples * Two functions without examples: mv_append and to_date_nanos * Many small visual document cleanups (spelling, grammar, capitalization, etc.) * Initial support for `applies_to` for multi-version differentiation. This last point requires more work, as it is not yet agreed on just how we want this to look. We'll probably need to do refinements in followup PR. Consider the version in this PR as a first step into how this could look.	2025-03-24 09:56:45 +01:00
Carlos Delgado	160ac698d7	ES\|QL: Add default values for match function options (#125282 )	2025-03-21 10:44:41 +01:00
Larisa Motova	08ae54e423	[ES\|QL] ToAggregateMetricDouble function (#124595 ) This commit adds a conversion function from numerics (and aggregate metric doubles) to aggregate metric doubles. It is most useful when you have multiple indices, where one index uses aggregate metric double (e.g. a downsampled index) and another uses a normal numeric type like long or double (e.g. an index prior to downsampling).	2025-03-18 11:39:27 -10:00
Craig Taverner	94cad286bc	Restructure query-languages docs files for clarity (#124797 ) In a few previous PR's we restructured the ES\|QL docs to make it possible to generate them dynamically. This PR just moves a few files around to make the query languages docs easier to work with, and a little more organized like the ES\|QL docs. A bit part of this was setting up redirects to the new locations, so other repo's could correctly link to the elasticsearch docs.	2025-03-17 17:58:58 +01:00
Craig Taverner	d5ddb909a4	ESQL autogenerate docs v3 (#124312 ) Building on the work started in https://github.com/elastic/elasticsearch/pull/123904, we now want to auto-generate most of the small subfiles from the ES\|QL functions unit tests. This work also investigates any remaining discrepancies between the original asciidoc version and the new markdown, and tries to minimize differences so the docs do not look too different. The kibana json and markdown files are moved to a new location, and the operator docs are a little more generated than before (although still largely manual).	2025-03-13 14:16:46 +01:00

34 commits