elasticsearch

mirror of https://github.com/elastic/elasticsearch.git synced 2025-06-29 09:54:06 -04:00

Author	SHA1	Message	Date
Craig Taverner	9ac356bbcf	Update docs with support for geo_shape and cartesian_shape (#104480 ) * Document to_geoshape and to_cartesianshape * Note that geo_point and point are now supported Elasticsearch types * Fix doc links	2024-01-18 13:00:38 +01:00
Ignacio Vera	585face0cf	ESQL: Support loading shapes from source into WKB blocks (#104269 ) This commit adds support for reading geo_shape and shape fields into wkb blocks so they can use top project the data to the result of a ES\|QL query.	2024-01-17 14:55:37 +01:00
Luigi Dell'Aquila	ad28dc9a6c	ESQL: Add TO_UPPER and TO_LOWER functions (#104309 )	2024-01-15 11:58:57 +01:00
Ignacio Vera	f64147f6c1	ESQL: Remove the possibility of CartesianPoint or GeoPoint literal to a long (#104197 )	2024-01-11 14:25:48 +01:00
Marco Liberati	3145e9f9fc	[ES\|QL] Annotate all ESQL functions (for SHOW FUNCTIONS) (#103686 ) * 🔧 Initial annotations * ✏️ Add substring doc * 🏷️ Fix imports * ✏️ Add annotations for string fns * ♻️ Fix issues * :rotatin_light: Fix linting issues * ✏️ Add more annotations * :rotatin_light: Fix linting issues * ♻️ Add missing import * ✏️ Add more annotations * 🚨 Fix linting * ✅ Fix many tests for new annotations * ✅ Fix more tests * 🐛 Fix missing flag * ✨ Annotate new functions * ✅ fix more tests * ✅ Fix signature issue * ✅ Make all csv tests pass * ✅ Fix remaining tests * ✨ New assets from annotations * ✅ Refactor test * ✅ Fix updated signature	2024-01-11 10:15:30 +01:00
Bogdan Pintea	9f6e5a4ccf	ESQL: Add Warning tests for all remaining functions (#103739 ) This adds unit tests cases for all the functions that were missing tests checking on the correct generation of the Warning headers in case the execution raised an Exception that lead to a `null` result.	2024-01-10 22:27:13 +01:00
Mark Tozzi	282f0f0a00	[ESQL] Remove is_nan, is_finite, and is_infinite (#104091 ) A while ago we decided we weren't going to support NaN or infinite values in the language. Removing these functions is part of that effort.	2024-01-09 14:51:32 -05:00
Nik Everett	5ef5dca334	ESQL: `MV_FIRST` and `MV_LAST` (#103928 ) This creates the `MV_FIRST` and `MV_LAST` functions that return the first and last values from a multivalue field. They are noops from a single valued field. They are quite similar to `MV_MIN` and `MV_MAX` except they work on positional data rather than relative size. That sounds like a large distinction, but in practice our multivalued fields are often sorted. And when they operate on sorted arrays `MV_MIN` does the same thing as `MV_FIRST`. But there are some cases where it really does matter - say you are `SPLIT`ing something - so `MV_FIRST(SPLIT("foo;bar;baz", ";"))` gets you `foo` like you'd expect. No sorting needed. Relates to #103879	2024-01-09 08:46:34 -05:00
Luigi Dell'Aquila	770fc19b14	ESQL: add date_diff function (#104118 ) Same as https://github.com/elastic/elasticsearch/pull/103208 Fixes #101942 We had to revert it after a Checkstyle failure (strange it didn't pop up in the CI before merging)	2024-01-09 07:03:58 -05:00
David Turner	943b2eae70	Revert "Esql/create DATE_DIFF function (#103208 )" This reverts commit `ec2e18536d`.	2024-01-09 09:50:48 +00:00
Nicolas Gras	ec2e18536d	Esql/create DATE_DIFF function (#103208 )	2024-01-09 10:37:42 +01:00
Bogdan Pintea	a5aa900358	ESQL: Update the use of some user-caused exceptions (#104046 ) This updates the use of the exceptions subclassed from `QlServerException` when the failure reason is user-caused. This ensures that a 400-class response is returned, instead of a 500-class one.	2024-01-08 21:13:20 +01:00
Nik Everett	fac60e5803	ESQL: Delay finding field load infrastructure (#103821 ) This optimizes loading fields across many, many indices by resolving the field loading infrastructure when it's first needed rather than up front. This speeds things up because, if you are loading from many many shards, you often don't need to set up the field loading infrastructure for all shards at all - often you'll just need to set it up for a couple of the shards.	2024-01-03 14:00:32 -05:00
Bogdan Pintea	6e9215a1fa	ESQL: allow `null` in date math (#103610 ) This fixes `null`'s handling in date math. So far the `null` (of type `NULL`) has been rejected by the type resolution. This is now allowed through, leading to a `null` result, inline with the other types. Fixes #103085.	2024-01-03 07:48:31 -05:00
Bogdan Pintea	4e2389fb2c	ESQL: Improve docs for the floating points is_xxx() funtions (#103691 ) Improve the docs for is_nan, is_finite, is_infinite functions. This also adjusts the CamelCase to snake_case conversion, to not consider the last capital letter (like in `IsNaN`).	2024-01-03 12:18:15 +01:00
Nik Everett	16e9bbe2d3	ESQL: Update generated docs	2023-12-27 15:58:02 -05:00
Abdon Pijpelink	8e0d4e1846	[DOCS] Improve ES\|QL functions reference for functions A-D (#103447 ) * Functions starting with A * Functions starting with 'C' * More 'C' functions * Fix tests * Fix missing snippet * DATE_* functions * Apply suggestions from code review Co-authored-by: Bogdan Pintea <pintea@mailbox.org> --------- Co-authored-by: Bogdan Pintea <pintea@mailbox.org>	2023-12-19 15:59:02 +01:00
Craig Taverner	e1835c97e8	ESQL: Documentation for support for geo_point and point (#103207 ) * Start working on geo_point and point docs for ESQL * Added to_cartesianpoint and includes * Sub-headings for easier reading * Improve sub-headings * Hide to_long and support for longs in to_geopoint and to_cartesianpoint	2023-12-12 12:19:23 +01:00
Nik Everett	3b738905e1	ESQL: Add type tables for operators to docs (#103206 ) This adds a tiny blurb for each operator to the docs with a railroad diagram of the operator's syntax and a table of the input and output types. This also fixes the tests to correctly generate the tables for operators.	2023-12-11 10:51:38 -05:00
Mark Tozzi	690a8a2ef8	[ES\|QL] Trig & hyperbolic function finishing pass (#103205 ) Relates to #100558 These functions were mostly done already. I added descriptions and argument descriptions for most of them, otherwise not much work.	2023-12-08 14:27:22 -05:00
Nik Everett	6e0c031342	ESQL: Generate railroad diagrams for operators (#103143 ) This enables the generation of railroad diagrams for unary minus and a bunch of binary operators like `+`, `-`, `%`, and `>=`. Relates to #100558	2023-12-08 09:24:34 -05:00
Bogdan Pintea	d3fefde0a3	ESQL: add unit tests for conversion functions. Extend `TEXT` type support for them (#102746 ) This adds the missing unit tests for the conversion functions. It also extends the type support by adding the `TEXT` type to those functions that support `KEYWORD` already (which also simplifies the testing, actually). Some functions did have it, some didn't; they now all do. The change also fixes two defects resulting from better testing coverage: `ToInteger` and `ToUnsignedLong` had some missing necessary exceptions declarations in the decorators for the evaluators. It also updates `ToInteger`'s `fromDouble()` conversion to use a newly added utility, so that the failed conversions contain the right message (`out of [integer] range`, instead of the confusing `out of [long] range`). Related: #102488, #102552.	2023-12-04 13:58:23 +01:00
Nik Everett	e39fdba02e	ESQL: Add newly generated signature We're generating it now, we should commit it so the docs can have it.	2023-11-27 13:29:03 -05:00
Abdon Pijpelink	bc59315baa	[DOCS] Examples for ES\|QL DISSECT and WHERE (#102591 ) * DISSECT examples * WHERE examples * Remove references to empty keys * Fix non-deterministic test	2023-11-27 10:56:48 +01:00
Nhat Nguyen	774a05cc0a	Update doc fields for pow and date_extract function (#102554 ) These two generated files have not been updated and committed.	2023-11-23 17:33:38 -08:00
Mark Tozzi	7345e643ba	[ES\|QL] pow function always returns double (#102183 ) This corrects an earlier mistake in the ES\|QL language design. Initially we had thought to have pow return the same type as its inputs, but in practice even for integer inputs this quickly grows out of the representable range, and we returned null much of the time. This also created a lot of edge cases around casting to/from doubles (which the underlying java function uses). The version in this PR follows the java spec, by always casting its inputs to doubles, and returning a double. Doing it this way also allows for a rather significant reduction in lines of code. I removed many of the tests covering pow specific edge cases. This seems reasonable to me as I expect java.lang.math.pow to be well behaved and most of those edge cases were around type testing which no longer applies. At the same time, this simplification allows us to leverage the new scalar function testing framework, which means better null coverage, better type coverage, and much easier extensibility. We do consider this a breaking change, but as the feature is still in tech preview and this is a relatively small surface area, we are not too concerned with disruptions. Resolves #99055 Relates to #100558 --------- Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com>	2023-11-21 09:58:07 -05:00
Nik Everett	d46c386c86	ESQL: Extra tests for trim/ltrim/rtrim (#102308 ) This adds some extra tests, docs, and descriptions for the `trim`, `ltrim`, and `rtrim` functions.	2023-11-16 14:44:56 -05:00
Fabio Busatto	11cb81a09a	Update SVG reference for starts_with ESQL docs (#101909 )	2023-11-08 13:39:49 +01:00
Abdon Pijpelink	4a2ed90f92	[DOCS] Explain ES\|QL CASE behavior with even no. arguments (#101829 )	2023-11-07 18:08:54 +01:00
Abdon Pijpelink	2b4ba7a744	[DOCS] Small ES\|QL improvements (#101877 ) * [DOCS] Small ES\|QL improvements * Fix test failure	2023-11-07 17:24:59 +01:00
Luigi Dell'Aquila	d53c0cbf82	ESQL: annotate trigonometric functions and auto_bucket() (SHOW FUNCTIONS) (#101460 )	2023-10-27 16:35:44 +02:00
Nik Everett	46f95a67b4	ESQL: More MV_* tests (#100564 ) This adds more tests for some of the `MV_` functions and updates their docs now that the railroad diagram and table generated by the tests covers all of the types.	2023-10-24 16:55:17 -04:00
AlexB	931dcae41d	Add improvements to the ES\|QL docs (#101195 ) Content and structural improvements to the ES\|QL docs --------- Co-authored-by: Alexandros Batsakis <abatsakis@splunk.com> Co-authored-by: Abdon Pijpelink <abdon.pijpelink@elastic.co>	2023-10-23 07:45:42 -07:00
Abdon Pijpelink	8ac4ba751e	Restructure ES\|QL docs (#100806 ) * Break out 'Limitations' into separate page * Add REST API docs * Restructure commands, functions, and operators refs * Add placeholder for getting started guide * Group 'Syntax', 'Metafields', and 'MV fields' under 'Language' * Add placeholder for Kibana page * Add link from landing page * Apply uniform formatting to ACOS, CASE, and DATE_PARSE function refs * Reword default LIMIT * Add support for COUNT() Move 'Commands' and 'Functions and operators' to individual pages --------- Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com>	2023-10-17 17:36:14 +02:00
gheorghepucea	cb30096c65	Referenced the svgs of starts_with and trim in asciidoc for consistency. (#100834 )	2023-10-13 16:01:47 +02:00
Nik Everett	38eac268b4	ESQL: Build tracked block in EVAL (#100268 ) This changes `EVAL` to build tracked blocks so we can trip the breaker when there are too many tracked blocks hanging about.	2023-10-04 10:29:54 -04:00
Nik Everett	9620512a89	ESQL: Tests for large concat and many evals (#100159 )	2023-10-03 14:41:40 -04:00
Luigi Dell'Aquila	6e79013088	ESQL: enhance SHOW FUNCTIONS command (#99736 ) Fixes https://github.com/elastic/elasticsearch/issues/99507 Enhance SHOW FUNCTIONS command to return as _structured_ information as possible about the function signature, ie. - function name - return type - param names - param types - param descriptions For now, as an example, the annotations are used only on `sin()` and `date_parse()` functions; if we agree on this approach, I'll proceed to - enhance all the currently implemented functions with the needed information - improve the function tests to verify that every new implemented function provides meaningful information --- This feature can be useful for the end user, but the main goal is to give Kibana an easy way to produce in-line documentation (contextual messages, autocomplete) for functions Similar to current implementation, that has a `@Named("paramName")` annotation for function parameters, this PR introduces two more annotations `@Param(name, type, description, optional)` and `@FunctionInfo()` to provide information about single parameters and functions. The result of `SHOW FUNCTIONS` query will have the following columns: - name (keyword): the function name - synopsis (keyword): the full signature of the funciton, eg. `double sin(n:integer\|long\|double)` - argNames (keyword MV): the function argument names - argTypes (keyword MV): the function argument types - argDescriptions (keyword MD): a textual description of each function argument - returnType (keyword): the return type of the function - description (keyword): a textual description of the function --- Open questions: - ~~how structured shoud types be? Eg. should we have a strict `@Typed("keyword")`/`@Typed({"keyword", "text"})` or should we have a more generic type description, eg. `@Typed("numeric")`, `@Typed("any")`? The first one is more useful for API consumption but it's hard with our complex type system (type classes, custom types, unsupported and so on); the second one is less structured, but probably more useful for documentation, that is the most immediate use case of this feature.~~ All the types are listed explicitly - ~~we have alternatives for the synopsis, eg.~~ - ~~`functionName(<paramName>:<paramType>, ...): <returnType>`~ - ~~`<returnType> functionName(<paramName>:<paramType>, ...)`~~ - ~~`<returnType> functionName(<paramType> <paramName>, ...)`~~ Using `<returnType> functionName(<paramName>:<paramType>, ...)` for now. If multiple types are supported, then they will be separated by pipes, eg. `double sin(n:integer\|long\|double)`.	2023-10-02 13:56:41 -04:00
AlexB	2ccdae6745	Eval REPLACE function (#98909 ) Co-authored-by: Alexandros Batsakis <abatsakis@splunk.com> Co-authored-by: Andrei Stefan <andrei@elastic.co>	2023-09-29 17:41:20 +03:00
Nik Everett	e1b1f6f1db	ESQL: Create `Block.Ref` (#100042 ) This creates `Block.Ref`, a reference to a `Block` which may or may not be part of a `Page`. `Block.Ref` is `Releasable` and closing it is a noop if the `Block` is part of a `Page`, but if it is "free floating" then closing the `Block.Ref` will close the block. It also modified `ExpressionEvaluator` to return a `Block.Ref` instead of a `Block` - so you tend to work with `ExpressionEvaluator`s like this: ``` try (Block.Ref ref = eval.eval(page)) { return ref.block().doStuff(); } ``` This should make it much easier to release the memory from `Block`s built by `ExpressionEvaluator`s. This change is mostly mechanical, introducing the new signature for `ExpressionEvaluator`. In a follow up change I'll modify the tests to make sure we're correctly using it to close pages. I did think about changing `ExpressionEvaluator` to add a method telling you if the block that it returns must be closed or not. This would have been more difficult to work with, and, ultimately, limiting. Specifically, it is possible for an `ExpressionEvaluator` to sometimes return a free floating block and other times return one that is contained in a `Page`. Imagine `mv_concat` - it returns the block it receives if the block doesn't have multivalued fields. Otherwise it concats things. If that block happens to come directly out of the `Page`, then `mv_concat` will sometimes produce free floating blocks and sometimes not.	2023-09-29 09:26:44 -04:00
Nik Everett	5e3ab06151	ESQL: Prevent `CONCAT` from using a ton of memory (#99716 ) This prevents `CONCAT` from using an unbounded amount of memory by hooking it's temporary value into the circuit breaker. To do so, it makes all `ExpressionEvaluator`s `Releasable`. Most of the changes in this PR just plumb that through to every evaluator. The rest of the changes correctly release evaluators after their use. I considered another tactic but didn't like it as much, even though the number of changes would be smaller - I could have created a fresh, `Releasable` temporary value for every `Page`. It would be pretty contained keep the releasable there. But I wanted to share the temporary state across runs to avoid a bunch of allocations. Here's a script that used to crash before this PR but is fine after: ``` curl -uelastic:password -XDELETE localhost:9200/test curl -HContent-Type:application/json -uelastic:password -XPUT localhost:9200/test -d'{ "mappings": { "properties": { "short": { "type": "keyword" } } } }' curl -HContent-Type:application/json -uelastic:password -XPUT localhost:9200/test/_doc/1?refresh -d'{"short": "short"}' echo -n '{"query": "FROM test ' > /tmp/evil for i in {0..9}; do echo -n '\| EVAL short = CONCAT(short' >> /tmp/evil for j in {1..9}; do echo -n ', short' >> /tmp/evil done echo -n ')' >> /tmp/evil done echo '\| EVAL len = LENGTH(short) \| KEEP len"}'>> /tmp/evil curl -HContent-Type:application/json -uelastic:password -XPOST localhost:9200/_query?pretty --data-binary @/tmp/evil ```	2023-09-22 11:27:13 -04:00
Bogdan Pintea	34eea49ef5	ESQL: Swap arguments of remaining date_xxx() functions (#99561 ) This swaps the argument of `date_extract()`, `date_format()` and `date_parse()` functions, to align with `date_trunc()`. The field argument is now always last, even for _format() and _parse(), whose optional argument will now be provided as the first one.	2023-09-19 20:22:34 +02:00
gheorghepucea	d58b9ea87d	Added esql ends_with implementation (#99613 ) Added an implementation for `ends_with` function in esql. `ends_with` - Returns a boolean that indicates whether a keyword string ends with another string. Also made sure that the docs look alright: <img width="1677" alt="Screenshot 2023-09-16 at 18 10 46" src="`eccd81e1`-40a2-4a66-a514-cf3e4205f9da">	2023-09-18 11:29:20 -04:00
Nik Everett	0d8a1975a9	ESQL: Fix test for unsigned long (#99441 ) We were generating negative values which made the tests confused.	2023-09-12 11:46:09 -04:00
Nik Everett	44c3cde48c	ESQL: Fix compile Two PRs cross in the night. Then nothing compiles.	2023-09-11 14:35:05 -04:00
Nik Everett	936e69ddd5	ESQL: Yet more function tests and docs (#99009 ) This adds tests, supported types, and a signature image for `to_string` and `to_version`. It also fixes the resolution of functions who's names contain an `_` Finally, it updates the docs for `ceil` to render the image more nicely.	2023-09-11 14:10:17 -04:00
Abdon Pijpelink	91759ce592	[DOCS] Some minor ES\|QL docs fixes (#99423 )	2023-09-11 16:20:10 +02:00
dreamquster	04381664c1	ESQL: Implement 'right' function (#98974 ) Add the 'right' function, which extracts a substring beginning from its right end (opposite function of 'left'). --------- Co-authored-by: Alexander Spies <alexander.spies@elastic.co>	2023-09-08 17:27:59 +02:00
Nik Everett	b73cc0c529	ESQL: Only generate syntax diagrams locally (#99059 ) CI will skip building them. Lot's of CI machines don't have font support so they can't generate these. But all local machine have a GUI so they can. Also, super-lazy initialize the font so CI don't bump into it by accident. Closes #99018	2023-08-30 14:44:14 -04:00
dreamquster	2644ccbb8a	Implement the 'left' function in issue #98545 (#98942 ) @nik9000 Recheck out the main branch. Refactor the 'left' function to cut the prefix string in place. But I meet a adversity that left failed the test case 'testEvaluateInManyThreads'. I find that in multiple thread situation, ` EvalOperator.ExpressionEvaluator eval = evalSupplier.get(); for (int c = 0; c < count; c++) { assertThat(toJavaObject(eval.eval(page), 0), testCase.getMatcher()); } ` toJavaObject function return a BytesRef with length=2, content is [81,89]. However, assertThat function in junit4 receive the BytesRef parameters that its length is 10. Can you give me some clues? I can't find which variable is mutual. Rerun failed test case's command: `gradlew ':x-pack:plugin:esql:test' --tests "org.elasticsearch.xpack.esql.expression.function.scalar.string.LeftTests.testEvaluateInManyThreads {TestCase=Left basic test}" -Dtests.seed=44459C172243712 -Dtests.locale=lv-LV -Dtests.timezone=Asia/Irkutsk -Druntime.java=20`	2023-08-28 13:17:16 -04:00

1 2

94 commits