Commit graph

12605 commits

Author SHA1 Message Date
Kathleen DeRusso
c7ec808f45
[Docs] Add docs for new semantic text query functionality (#119520)
* Update docs with new semantic text functionality

* PR feedback

* PR feedback

* PR Feedback
2025-01-09 11:11:20 -05:00
David Kyle
547a567c13
[DOCS][ML] Document the text_expansion task type (#119581) 2025-01-09 15:44:01 +00:00
Arianna Laudazzi
70e5a67904
[AutoOps] Reference AutoOps solution on troubleshooting pages (#119630)
* Reference AutoOps on troubleshooting pages

* Integrate reviewer's feedback
2025-01-09 16:24:20 +01:00
shainaraskas
d37e1bd14d
Fix broken anchors (#119802) 2025-01-09 09:15:00 -05:00
Michael Peterson
230acb8ac5
Resolve/cluster should mark remotes as not connected when a security exception is thrown (#119793)
Fixes two bugs in _resolve/cluster.

First, the code that detects older clusters versions and does a fallback to the _resolve/index
endpoint was using an outdated string match for error detection. That has been adjusted.

Second, upon security exceptions, the _resolve/cluster endpoint was marking the clusters as connected: true,
under the assumption that all security exceptions related to cross cluster calls and remote index access were
coming from the remote cluster, but that is not always the case. Some cross-cluster security violations can
be detected on the local querying cluster after issuing the remoteClient.execute call but before the transport
layer actually sends the request remotely. So we now mark the connected status as false for all ElasticsearchSecurityException cases. End user docs have been updated with this information.
2025-01-09 08:56:57 -05:00
Jedr Blaszyk
750a0ab846
[Connector API] Support soft-deletes of connectors (#118669)
* [Connector API] Add interface for soft-deletes

* Define connector deleted system index

* Got soft-delete logic working

* Add unit tests

* Add yaml e2e test and attempt to update permissions

* Fix permissions

* Update docs

* Fix docs

* Update docs/changelog/118282.yaml

* Change logic

* Fix tests

* Remove unnecessary privilege from yaml rest test

* Update changelog

* Update docs/changelog/118669.yaml

* Adapt yaml tests

* Undo changes to muted-tests.yml

* Fix compilation issue after other PR got merged

* Exclude soft-deleted connector from checks about index_name already in use

* Update docs/reference/connector/apis/get-connector-api.asciidoc

Co-authored-by: Tim Grein <tim@4greins.de>

* Update rest-api-spec/src/main/resources/rest-api-spec/api/connector.list.json

Co-authored-by: Tim Grein <tim@4greins.de>

* Adapt comments, add connector wire serializing test

* Introduce new transport versions for passing the delete flag

* Get rid of wire serialisation, use include_deleted instead of deleted flag

* Remove unused import

* Final tweaks

* Adapt variable name in rest layer

---------

Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com>
Co-authored-by: Tim Grein <tim@4greins.de>
2025-01-09 12:20:28 +00:00
Liam Thompson
996a4f8e7d
Update rejected-requests.asciidoc (#119547) (#119673)
I believe this is a typo, as in our 8.16.1 cluster this field appears to be called `combined_coordinating_and_primary`

Co-authored-by: Ian Lee <IanLee1521@gmail.com>
2025-01-09 11:01:46 +01:00
Lisa Cawley
385040d2b5
[DOCS] Fix link in repository-s3.asciidoc (#119719) 2025-01-08 12:02:36 -08:00
James Baiera
c3839e1f76
Add selector syntax to index expressions (#118614)
This PR introduces a new syntactical feature to index expression resolution: The selector.

Selectors, denoted with a :: followed by a recognized suffix will allow users to specify which component of 
an index abstraction they would like to operate on within an API call. In this case, an index abstraction is a 
concrete index, data stream, or alias; Any abstraction that can be resolved to a set of indices/shards. We 
define a component of an index abstraction to be some searchable unit of the index abstraction.
2025-01-08 11:37:03 -05:00
Ievgen Degtiarenko
fd1be8ce6f
Hash functions (#118938)
This change adds md5, sha1 and sha256 hash functions.
2025-01-08 16:44:15 +01:00
Albert Zaharovits
12eb1cfda1
Metrics for indexing failures due to version conflicts (#119067)
This exposes new OTel node and index based metrics for indexing failures due to version conflicts.

In addition, the /_cat/shards, /_cat/indices and /_cat/nodes APIs also expose the same metric, under the newly added column iifvc.

Relates: #107601
2025-01-08 12:34:37 +00:00
Benjamin Trent
c18b48dbd4
Apply default k for knn query eagerly (#118774)
When originally added, the knn query didn't apply `top-k` restrictions
to the query. Instead it would allow the resulting `num_candidate` to be
combined with sibling queries without restricting to `top-size` results
ahead of time.

This honestly is confusing behavior and leads to some bugs in understand
how it all works.

This commit addresses this by eagerly gathering only `size` results when
`k==null` before combining with other queries. 

To achieve the previous behavior, this can be done directly by setting
`k==num_candidates` in the query.
2025-01-08 07:40:34 +11:00
Joan Fontanals
53a16ee967
[docs] Add documentation for JinaAI service (#118782) 2025-01-07 18:04:12 +00:00
Svilen Mihaylov
93c349cc76
Add ability to set "max_analyzed_offset" implicitly to "index.highlight (#118895)
Add ability to set "max_analyzed_offet" implicitly to "index.highlight
.max_analyzed_offset", by setting it excplicitly to "-1".

Closes #112822
2025-01-07 11:19:07 -05:00
Liam Thompson
92bb091521
[DOCS] RAG overview (#119590) 2025-01-07 15:18:39 +01:00
Srikanth Manvi
8ded6c4568
Update search-application-api.asciidoc (#118494)
Fixing minor Typo
2025-01-07 13:30:14 +01:00
Liam Thompson
c7b61bdc63
[DOCS] Add full-text search overview (#119462) 2025-01-06 17:56:35 +00:00
shainaraskas
17111e1258
[DOCS] Concept cleanup 2 - ES settings (#119373) 2025-01-06 12:07:15 -05:00
Benjamin Trent
a5716c8f99
Add new experimental rank_vectors mapping for late-interaction second order ranking (#118804)
Late-interaction models are powerful rerankers. While their size and
overall cost doesn't lend itself for HNSW indexing, utilizing them as
second order "brute-force" reranking can provide excellent boosts in
relevance. At generally lower inference times than large cross-encoders.


This commit exposes a new experimental `rank_vectors` field that allows
for maxSim operations. This unlocks the initial, and most common use of
late-interaction dense-models. 

For example, this is how you would use it via the API:

```
PUT index
{
  "mappings": {
    "properties": {
      "late_interaction_vectors": {
        "type": "rank_vectors"
      }
    }
  }
}
```

Then to index:

```
POST index/_doc
{
  "late_interaction_vectors": [[0.1, ...],...]
}
```

For querying, scoring can be exposed with scripting:

```
POST index/_search
{
  "query": {
    "script_score": {
      "query": {
        "match_all": {}
      },
      "script": {
        "source": "maxSimDotProduct(params.query_vector, 'my_vector')",
        "params": {
          "query_vector": [[0.42, ...], ...]
        }
      }
    }
  }
}
```

Of course, the initial ranking should be done before re-scoring or
combining via the `rescore` parameter, or simply passing whatever first
phase retrieval you want as the inner query in `script_score`.
2025-01-07 04:06:59 +11:00
Dan Rubinstein
f5cffbf300
Deprecating data_frame_transforms roles (#117519)
* Deprecating data_frame_transforms roles

* Update docs/changelog/117519.yaml

* Update changelog

* Removing deprecation warning

* Cleaning up unused role retrieval function

* Update docs/changelog/117519.yaml

* Update changelog with breaking change details

* Revert ClusterPrivilegeResolver

* Remove Deprecated Transform Role permissions

* Update docs/changelog/117519.yaml

Co-authored-by: Nikolaj Volgushev <n1v0lg@users.noreply.github.com>

* Revert "Remove Deprecated Transform Role permissions"

This reverts commit 4599256dcf.

* Revert docs, update changelog

* Revert privilege tests

---------

Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com>
Co-authored-by: Nikolaj Volgushev <n1v0lg@users.noreply.github.com>
Co-authored-by: Pat Whelan <pat.whelan@elastic.co>
2025-01-06 11:00:16 -05:00
István Zoltán Szabó
c32cec165a
[DOCS] Adds note on default ELSER and E5 endpoints to service pages (#119507)
* [DOCS] Adds note on default ELSER and E5 enpoints to service pages.

* [DOCS] Further edits.
2025-01-03 15:53:17 +01:00
Edoardo Tenani
171424727a
[DOCS] update recommended template priority for override to 500 (#119450) 2025-01-03 11:12:25 +01:00
Stanislav Malyshev
0292905ef6
Add ESQL telemetry collection (#119474)
* Add ESQL telemetry collection
2025-01-02 14:05:21 -07:00
shainaraskas
9862a43cb6
[DOCS] Concept cleanup (extracting conceptual docs from reference content pt I of ?) (#119016) 2025-01-02 15:05:08 +00:00
Lisa Cawley
8a821f0a96
[DOCS] More links to new API site (#119380) 2024-12-31 12:02:59 -08:00
Lisa Cawley
ba8beecdb0
[DOCS] More links to new API site (#119377) 2024-12-31 11:32:29 -08:00
Ryan Ernst
7fb6ca447a
Add ephemeral node id to shutdown metadata (#118722)
Shutdown metadata is keyed on node id. This makes sense since only one
node with a given node id can exist within a cluster. However, it is
possible that shutdown was initiated for one instance of a node, but
that node is restarted. This commit adds the ephemeral node id to
shutdown metadata so that nodes with the same id but different ephemeral
id can be distinguished.
2024-12-30 10:13:58 -08:00
Lisa Cawley
5e0fbef58b
[DOCS] Link to new API site (#119038)
Co-authored-by: shainaraskas <58563081+shainaraskas@users.noreply.github.com>
2024-12-30 16:52:16 +00:00
Max Hniebergall
7e126d75a4
[Inference API] Update get-inference docs to use elasticsearch service (#119238)
* update get-inference docs to use elasticsearch service

* fix indentation
2024-12-30 10:11:40 -05:00
Jim Ferenczi
12e86b1cd0
Refactor semantic text field to align with text field behaviour (#119183)
Co-authored-by: Mike Pellegrini <mike.pellegrini@elastic.co>
2024-12-30 09:31:02 +01:00
Stef Nestor
ec35dc2049
(Doc+) Videos of rejected requests (#119311)
* (Doc+) Videos of rejected requests
2024-12-27 13:49:57 -07:00
Stef Nestor
c8f2703cdf
(Doc+) Flush out Slow Logs (#118518)
* (Doc+) Slow Logs

---------

Co-authored-by: shainaraskas <58563081+shainaraskas@users.noreply.github.com>
2024-12-27 10:12:00 -07:00
Stef Nestor
db1c41b41d
(Doc+) Enrich run on ingest+data nodes not coordinating-only (#119136)
* (Doc+) Enrich run on ingest+data nodes not coordinating-only

👋 howdy, team! I'm not otherwise finding it so documenting https://github.com/elastic/elasticsearch/issues/95969 in ES docs

> Currently we tell users of enrich that they should co-locate the nodes that perform the enrichment (ingest nodes) with the actual enrich data so that enrich operations don't require a remote search operation.

* feedback

Co-authored-by: Lee Hinman <dakrone@users.noreply.github.com>

---------

Co-authored-by: Lee Hinman <dakrone@users.noreply.github.com>
2024-12-25 08:17:18 -07:00
Niels Bauman
9641c7623f
Run TransportGetComponentTemplateAction on local node (#116868)
This action solely needs the cluster state, it can run on any node.
Additionally, it needs to be cancellable to avoid doing unnecessary work
after a client failure or timeout.

The `?local` parameter becomes a no-op and is marked as deprecated.

Relates #101805
Relates #107984
2024-12-23 20:01:21 +00:00
Niels Bauman
dac3bfd52e
Fix usage API docs test (#119192)
This ensures the usage API docs tests are passing again. We achieve this
by: 1. ignoring the contents of `inference.models` because the models
might not yet have been initialized and 2. adding missing fields to the
`logsdb` usage.
2024-12-23 16:03:47 +01:00
Tanguy Leroux
103d29f0c8
Revert min_read_only_index_version in doc after #119083 (#119125) 2024-12-20 05:38:09 +11:00
Carlos Delgado
6ee641bdfd
ESQL - Update WHERE command docs with MATCH and full text functions examples (#118987) 2024-12-19 16:44:53 +01:00
Marci W
696ee806e7
Revise content to match new troubleshooting guidelines (#118033)
* Revise to match new guidelines

* Address review suggestions and comments

* Apply suggestions from review

Co-authored-by: shainaraskas <58563081+shainaraskas@users.noreply.github.com>

* Apply suggestions from review

Co-authored-by: shainaraskas <58563081+shainaraskas@users.noreply.github.com>

* Apply suggestions from review

Co-authored-by: shainaraskas <58563081+shainaraskas@users.noreply.github.com>

* Apply suggestions from review

---------

Co-authored-by: shainaraskas <58563081+shainaraskas@users.noreply.github.com>
2024-12-19 10:09:14 -05:00
kosabogi
78bd9ec6f0
[DOCS] Updates SharePoint Online page (#118318) 2024-12-19 11:43:34 +01:00
Jim Ferenczi
6f261067f2
Add a generic rescorer retriever based on the search request's rescore functionality (#118585)
This pull request introduces a new retriever called `rescorer`, which leverages the `rescore` functionality of the search request.  
The `rescorer` retriever re-scores only the top documents retrieved by its child retriever, offering fine-tuned scoring capabilities.  

All rescorers supported in the `rescore` section of a search request are available in this retriever, and the same format is used to define the rescore configuration.  

<details>
<summary>Example:</summary>

```yaml
  - do:
      search:
        index: test
        body:
          retriever:
            rescorer:
              rescore:
                window_size: 10
                query:
                  rescore_query:
                    rank_feature:
                      field: "features.second_stage"
                      linear: { }
                  query_weight: 0
              retriever:
                standard:
                  query:
                    rank_feature:
                      field: "features.first_stage"
                      linear: { }
          size: 2
```

</details>

Closes #118327

Co-authored-by: Liam Thompson <32779855+leemthompo@users.noreply.github.com>
2024-12-18 19:47:12 +00:00
Bogdan Pintea
bc3b629d8d
ESQL: Docs: add example of date bucketing with offset (#116680)
Add an example of how to create date histograms with an offset.

Fixes #114167
2024-12-18 17:12:14 +01:00
Sean Story
5255bfb6fb
Replace 'ent-search-generic' with 'search-default' pipeline (#118899)
* Replace 'ent-search-generic' with 'search-default' pipeline

* missed one

* [CI] Auto commit changes from spotless

---------

Co-authored-by: elasticsearchmachine <infra-root+elasticsearchmachine@elastic.co>
2024-12-18 08:03:08 -06:00
Liam Thompson
54fa07450a
[DOCS] Make Wolfi hardened Docker option more prominent (#118755) 2024-12-18 13:54:17 +01:00
Ievgen Degtiarenko
7cf28a910e
ESQL Add esql hash function (#117989)
This change introduces esql hash(alg, input) function that relies on the Java MessageDigest to compute the hash.
2024-12-18 09:56:42 +01:00
Satyam Mishra
41c3dde414
Updated ilm docs as per the issue (#118148)
This PR updates the Elasticsearch ILM tutorial for the newer screenshot
and the primary shard update in the text.
2024-12-18 10:32:39 +11:00
Pius Fung
1054503ba8
Update start-trained-model-deployment.asciidoc (#118887)
Updating with changes in https://github.com/elastic/elasticsearch/pull/115041
2024-12-17 13:03:28 -08:00
Tanguy Leroux
f3a1664906
Add min. read-only index version compatible to DiscoveryNode (#118744)
#118443 added a new index version for indices that can be opened in read-only mode by Lucene. This change adds this information to the discovery node's VersionInformation and the transport serialization logic.

In a short future we'd like to use this information in methods like IndexMetadataVerifier#checkSupportedVersion and NodeJoineExecutor to allow opening indices in N-2 versions as read-only indices on ES V9.
2024-12-17 18:59:49 +01:00
Luigi Dell'Aquila
1d2840ece1
EQL: add support for partial search results (#116388)
Allow queries to succeed if some shards are failing
2024-12-17 17:13:52 +01:00
Adam Szaraniec
f64c05ac32
Update alias.asciidoc (#118553)
Add section about removing index
2024-12-17 10:09:22 -06:00
Rene Groeschke
6516a535ab
Add wolfi documentation from 8.16 branch (#118835)
port from https://github.com/elastic/elasticsearch/pull/118684
2024-12-17 23:45:27 +11:00