Commit graph

12605 commits

Author SHA1 Message Date
Nik Everett
55532c8d6f
ESQL: All descriptions are a full sentence (#110791)
This asserts that all functions have descriptions that are complete
sentences.
2024-07-11 16:44:15 -04:00
Ioana Tagirta
e99aaad800
Document how to query for a specific feature within rank_features (#110749) 2024-07-11 16:19:14 +02:00
Stef Nestor
3f3ec0c5f9
(Doc+) Include cluster.blocks.* default settings (#110560)
👋 howdy, team! Mini doc update to include default settings for `cluster.blocks.*` settings.
2024-07-11 08:04:28 -06:00
Stef Nestor
0570d68358
(+Doc) Link setting reference for action.auto_create_index (#110558)
👋 howdy, team! Mini request to cross-link doc setting to its reference.
2024-07-11 07:57:18 -06:00
Carlos Delgado
f29b92cb07
Group vector queries into new section (#110722) 2024-07-11 14:45:35 +02:00
Jedr Blaszyk
19cc601bd1
[Connector API] Update mistake in docs (#110517) 2024-07-11 14:34:48 +02:00
Nik Everett
1256a49c3a
ESQL: Move description of commands in docs (#110714)
This copies the first line of the description of each command to just
under the syntax so that it's "in order", before the `Parameters`
section. That way if you are reading from top to bottom you see:
```
syntax
short description
parameter names and descriptions
long description
examples
```

I've also removed the `Description` section entirely if the description
was just one sentence. So in some cases that just isn't `long
description`.
2024-07-11 08:31:35 -04:00
Alexander Spies
e28654f8d2
Aggs: Scripted metric allow list docs (#109635)
* Document new settings
* Mention agg allow list in scripting security doc
2024-07-11 14:27:51 +02:00
Nik Everett
8f93bd00f9
ESQL: Document the profile option (#110727)
This adds some basic documentation for the `profile` option in ESQL but
doesn't really explain the results beyond "this is for human debugging."
We're not ready for any kind of specification for this thing, but it is
useful to look at.
2024-07-11 22:20:31 +10:00
Jonathan Buttner
3e7f7f4709
[ML] Adding pytorch oom to known issues (#110668)
* Adding pytorch oom to known issues

* Fixing section

* Updating text to exclude the pytorch version
2024-07-11 07:50:17 -04:00
Nik Everett
a1695ffbea
ESQL: Documents STATS on multivalue groups (#110712)
This documents running `STATS` on a multivalued column. It also removes
a long out of date warning about a limitation of grouping.
2024-07-10 15:49:46 -04:00
Iván Cea Fontenla
2901711c46
ESQL: Add boolean support to Max and Min aggs (#110527)
- Added support for Booleans on Max and Min
- Added some helper methods to BitArray (`set(index, value)` and `fill(from, to, value)`). This way, the container is more similar to other BigArrays, and it's easier to work with

Part of https://github.com/elastic/elasticsearch/issues/110346, as Max
and Min are dependencies of Top.
2024-07-10 23:10:32 +10:00
Benjamin Trent
ea9baab6c9
Fix search template examples by removing params on put (#110660) 2024-07-10 07:32:58 -04:00
David Kyle
1b6d44b55d
[DOCS] Fix typo: though -> through (#110636) 2024-07-09 07:30:42 -07:00
Iván Cea Fontenla
5d3512fb33
ESQL: Fix Max doubles bug with negatives and add tests for Max and Min (#110586)
`MAX()` currently doesn't work with doubles smaller than
`Double.MIN_VALUE` (Note that `Double.MIN_VALUE` returns the smallest
non-zero positive, not the smallest double).

This PR adds tests for Max and Min, and fixes the bug (Detected by the
tests).

Also, as the tests now generate the docs, replaced the old docs with the
generated ones, and updated the Max&Min examples.
2024-07-09 21:05:00 +10:00
Iván Cea Fontenla
38cd0b333e
ESQL: AVG aggregation tests and ignore complex surrogates (#110579)
Some work around aggregation tests, with AVG as an example:
- Added tests and autogenerated docs for AVG
- As AVG uses "complex" surrogates (A combination of functions), we can't trivially execute them without a complete plan. As I'm not sure it's worth it for most aggregations, I'm skipping those cases for now, as to avoid blocking other aggs tests.

The bad side effect of skipping those tests is that most tests in AvgTests are actually ignored (74 of 100)
2024-07-09 12:01:46 +02:00
elasticsearchmachine
333e1bbb81
Forward port release notes for v8.14.2 (#110538) 2024-07-08 10:36:34 -04:00
David Turner
58bb05df94
Clarify logs/errors re. publish addresses (#110570)
These warning logs and error messages assume some level of understanding
of Elasticsearch's networking config and are not particularly
actionable. This commit adds links to the relevant section of the
manual, rewords them a little to match the terminology used in the
manual, and also documents that each node must have its own publish
address, distinct from those of all other nodes.
2024-07-08 18:20:28 +10:00
Simon Cooper
ae4aa2ee77
Add known-issues for all affected releases for the feature upgrade issue (#110523) 2024-07-08 09:17:10 +01:00
Parker Timmins
eeedb356fd
Deprecate using slm privileges to access ilm (#110540)
Currently, read_slm privilege grants access to get the ILM status,
and manage_slm grants access to start/stop ILM. This access will
be removed in the future, but needs to be deprecated before
removal. Add deprecation warning to the read_slm and manage_slm docs.
2024-07-05 15:36:32 -05:00
Martijn van Groningen
5d791d4e27
Slightly adjust wording around potential savings mentioned in the description of the index.codec setting (#110468) 2024-07-05 12:06:46 +02:00
Albert Zaharovits
747fa59a2c
DOCS Query Roles (#110473)
These are the docs changes in relation to
https://github.com/elastic/elasticsearch/pull/108733
2024-07-05 19:46:48 +10:00
Oleksandr Kolomiiets
276ae121c2
Reflect latest changes in synthetic source documentation (#109501) 2024-07-04 09:48:04 -07:00
István Zoltán Szabó
ffea002a99
[DOCS] Adds 8.14.2 release notes to main. (#110471) 2024-07-04 13:10:00 +02:00
elasticsearchmachine
c5eb558371 Bump to version 8.16.0 2024-07-04 09:10:43 +00:00
Martijn van Groningen
6eaf171411
Add some information about the impact of index.codec setting. (#110413) 2024-07-04 09:20:19 +02:00
George Wallace
b6e9860919
Update role-mapping-resources.asciidoc (#110441)
made it clear that some characters need to be escaped properly

Co-authored-by: Jan Doberstein <jan.doberstein@elastic.co>
2024-07-03 13:00:52 -06:00
Lisa Cawley
748dbd51e4
[DOCS] Add serverless details in Elasticsearch security privileges (#109718) 2024-07-03 09:52:21 -07:00
Tim Grein
406b969c62
[Inference API] Add Google Vertex AI reranking docs (#110390) 2024-07-03 14:03:12 +02:00
Johannes Fredén
89cd966b24
Add bulk delete roles API (#110383)
* Add bulk delete roles API
2024-07-03 11:04:53 +02:00
Sylvain Wallez
e78bdc953a
ESQL: add Arrow dataframes output format (#109873)
Initial support for Apache Arrow's streaming format as a response for ES|QL. It triggers based on the Accept header or the format request parameter.

Arrow has implementations in every mainstream language and is a backend of the Python Pandas library, which is extremely popular among data scientists and data analysts. Arrow's streaming format has also become the de facto standard for dataframe interchange. It is an efficient binary format that allows zero-cost deserialization by adding data access wrappers on top of memory buffers received from the network.

This PR builds on the experiment made by @nik9000 in PR #104877

Features/limitations:
- all ES|QL data types are supported
- multi-valued fields are not supported
- fields of type _source are output as JSON text in a varchar array. In a future iteration we may want to offer the choice of the more efficient CBOR and SMILE formats.

Technical details:

Arrow comes with its own memory management to handle vectors with direct memory, reference counting, etc. We don't want to use this as it conflicts with Elasticsearch's own memory management.

We therefore use the Arrow library only for the metadata objects describing the dataframe schema and the structure of the streaming format. The Arrow vector data is produced directly from ES|QL blocks.

---------

Co-authored-by: Nik Everett <nik9000@gmail.com>
2024-07-03 10:29:57 +02:00
Carlos Delgado
30b32b6a46
semantic_text: Updated copy-to docs (#110350) 2024-07-03 10:18:40 +02:00
Fang Xing
8abc8857f2
[ES|QL] weighted_avg (#109993)
* weighted_avg
2024-07-02 18:29:02 -04:00
Matt Culbreth
81b8495388
Mark the Redact processor as Generally Available 2024-07-02 16:58:57 -04:00
Nik Everett
6fbc52d170
ESQL docs: Push down needs index and doc_values (#110353)
This adds a `NOTE` to each comparison saying that pushing the comparison
to the search index requires that the field have an `index` and
`doc_values`. This is unique compared to the rest of Elasticsearch which
only requires an `index` and it's caused by our insistence that
comparisons only return true for single-valued fields. We can in future
accelerate comparisons without `doc_values`, but we just haven't written
that code yet.
2024-07-02 14:22:50 -04:00
Kathleen DeRusso
7a1d532ffb
Pass over Sparse Vector docs for correctness (#110282)
* Remove legacy mentions of text expansion queries

* Add missing query_vector param to sparse_vector query docs

* Fix formatting errors in sparse vector query dsl doc

* Remove unnecessary test setup block
2024-07-02 13:37:25 -04:00
Felix Barnsteiner
cdbe092d90
Update docs now that keyword dimensions support ignore_above (#110385)
This is a follow-up from https://github.com/elastic/elasticsearch/pull/110337
2024-07-02 17:04:57 +02:00
Johannes Fredén
55476041d9
Add BulkPutRoles API (#109339)
* Add BulkPutRoles API
2024-07-02 15:45:39 +02:00
Tim Grein
390439ad9f
[Inference API] Add Google Vertex AI text embeddings docs (#110317) 2024-07-02 14:47:14 +02:00
Mike Pellegrini
d288dbf94e
Fix Semantic Query Parameter Formatting (#110355) 2024-07-02 08:07:35 -04:00
Iván Cea Fontenla
c89ee3b648
ESQL: Renamed TopList to Top (#110347)
Rename TopList aggregation to Top, after internal discussions
2024-07-02 03:52:24 +10:00
Jedr Blaszyk
3b827f6a8c
Create manage_connector privilege (#110128)
* Create manage_seaech_connector privilege

* `manage_search_connector` -> `manage_connector` and exclude connector secrets patterns from this privilege

* Add `monitor_connector` privilege

* Update Kibana system privilege to monitor_connector for telemetry

* Rename privilege to 'manage_connector_state'

Since privilege names are often namespaced and used with globs, we want to ensure that if there's a future privilege like `manage_connector_secrets`, that it is not implicitly included in this new privileg's <name>*. By extending the privilege name to include "_state", we better namespace this distinct from any "_secrets" namespace.

* Revert "Rename privilege to 'manage_connector_state'"

This reverts commit 70b89eee76.
After further discussion with the security team, this name change is not needed after all
since the secret management privileges aren't currently prefixed with "manage_"

---------

Co-authored-by: Sean Story <sean.j.story@gmail.com>
2024-07-01 12:41:28 -05:00
Tim Grein
99749aa277
[Inference API] Fix wording in Azure AI Studio docs (#110322) 2024-07-01 14:37:56 +02:00
Tim Grein
6accd6e247
[Inference API] Fix wording in delete-inference docs (#110321) 2024-07-01 13:37:30 +02:00
Tim Grein
35eae4029a
Fix typo in get-inference docs (retrives -> retrieves) (#110320) 2024-07-01 10:13:48 +02:00
István Zoltán Szabó
43f5696406
[DOCS] Refactors PUT inference API docs (#109812) 2024-07-01 10:12:16 +02:00
Nikolaj Volgushev
78c812f845
Fix security index settings docs (#110126)
Docs tweak with a typo fix and a clarification on how the two available
settings interact (essentially
https://github.com/elastic/elasticsearch/issues/27871). I'm also open to
including this info in the more generic settings API but feels like a
simple enough callout to add to the security API.
2024-07-01 18:07:15 +10:00
Kostas Krikellas
6ae652f90e
Support index sorting with nested fields (#110251)
This PR piggy-backs on recent changes in Lucene 9.11.1
(https://github.com/apache/lucene/pull/12829,
https://github.com/apache/lucene/pull/13341/), setting the parent doc
when nested fields are present. This allows moving nested documents
along with parent ones during sorting.

With this change, sorting is now allowed on fields outside nested
objects. Sorting on fields within nested objects is still not supported
(throws an exception).

Fixes #107349
2024-07-01 17:24:17 +10:00
Costin Leau
b906ce3d66
ESQL: change from quoting from backtick to quote (#108395)
* ESQL: change from quoting from backtick to quote

For historical reasons, the source declaration inside FROM command is
 treated as an identifier, using backticks (`) for escaping the value.
This is inconsistent since the source is not an identifier (field name)
 but an index name which has different semantics.
 `index` means a field name index while "index" means a literal with
 said value.

In case of FROM, the index name/location is more like a literal (also in
 unquoted form) than an identifier (that is a reference to a value).

This PR tweaks the grammar and plugs in the quoted string logic so that
 both the single quote (") and triple quote (""") are allowed.

* Update grammar

* Add more tests

* Add a few more tests

* Add extra test

* Update docs/changelog/108395.yaml

* Adress review comments

* Add doc note

* Revert test rename

* Fix quoting with remote cluster

* Update docs/reference/esql/source-commands/from.asciidoc

Co-authored-by: marciw <333176+marciw@users.noreply.github.com>

---------

Co-authored-by: Bogdan Pintea <bogdan.pintea@elastic.co>
Co-authored-by: Bogdan Pintea <pintea@mailbox.org>
Co-authored-by: marciw <333176+marciw@users.noreply.github.com>
Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com>
2024-06-30 20:01:31 +03:00
George Wallace
dea593db3f
Update behavioral-analytics-start.asciidoc (#110271) 2024-06-28 09:01:48 -06:00