Commit graph

1356 commits

Author SHA1 Message Date
István Zoltán Szabó
9b7d808bf4
[DOCS] Fixes adaptive_allocations examples (#113248)
Co-authored-by: Jan Kuipers <148754765+jan-elastic@users.noreply.github.com>
2024-09-20 11:31:04 +02:00
Liam Thompson
163b2c781f
[DOCS] Fix reranking IA, move retrievers to search api overview (#112949) 2024-09-19 17:20:52 +02:00
István Zoltán Szabó
5153905e7e
[DOCS] Gives more details to the load data step of the semantic search tutorials (#113088)
Co-authored-by: Liam Thompson <32779855+leemthompo@users.noreply.github.com>
2024-09-18 11:37:35 +02:00
István Zoltán Szabó
2371caeb15
[DOCS] Fixes response object indentation in semantic text tutorial (#112915) 2024-09-16 14:40:54 +02:00
István Zoltán Szabó
c2bd0de278
[DOCS] Simplifies semantic_text tutorial by removing copy_to field (#112864) 2024-09-13 16:50:41 +02:00
István Zoltán Szabó
a151bb0bfb
[DOCS] Rework semantic search main page (#112452)
Co-authored-by: Liam Thompson <32779855+leemthompo@users.noreply.github.com>
Co-authored-by: Mike Pellegrini <mike.pellegrini@elastic.co>
2024-09-12 14:09:34 +02:00
Simon Cooper
a36d90cf34
Use CLDR locale provider on JDK 23+ (#110222)
JDK 23 removes the COMPAT locale provider, leaving CLDR as the only option. This commit configures Elasticsearch
to use the CLDR provider when on JDK 23, but still use the existing COMPAT provider when on JDK 22 and below.

This causes some differences in locale behaviour; this also adapts various tests to still work whether run on COMPAT or CLDR.
2024-09-04 13:42:40 +01:00
István Zoltán Szabó
adb23531f9
[DOCS] Adds Google Vertex AI tutorial (#112339)
Co-authored-by: Liam Thompson <32779855+leemthompo@users.noreply.github.com>
2024-08-30 13:17:59 +02:00
Liam Thompson
aa57a1553e
[DOCS] Rewrite "What is Elasticsearch?" (Part 1) (#112213) 2024-08-29 10:13:30 +02:00
weizijun
b9dea69b5c
[Inference API] Add Docs for AlibabaCloud AI Search Support for the Inference API (#112273)
Co-authored-by: István Zoltán Szabó <istvan.szabo@elastic.co>
2024-08-29 09:17:27 +02:00
Panagiotis Bailis
7563a724f0
Updating retriever documentation to better explain how filters are applied (#112201) 2024-08-26 16:15:31 +03:00
Panagiotis Bailis
785fe5384b
Adding support for allow_partial_search_results in PIT (#111516) 2024-08-26 12:56:08 +03:00
Liam Thompson
84ddd6c7af
[DOCS] Update rank_constant value in retriever example (#112056) 2024-08-21 15:11:19 +02:00
Aurélien FOUCRET
2d7724b3f8
LTR documentation - Remove tech preview note. (#111803) 2024-08-12 17:57:07 +02:00
Liam Thompson
06e24a19b7
[DOCS] Add retriever examples, semantic reranking step-by-step guide (#111793) 2024-08-12 17:45:25 +02:00
Quentin Pradet
7b7c310ea5
[DOCS] Fix elasticsearch-py helpers page link (#111789) 2024-08-12 09:38:02 +02:00
Liam Thompson
4173feefc2
[DOCS] Update CCS matrix for 8.15 (#111587) 2024-08-05 15:31:23 +02:00
Woody Walton
715449ec00
Update paginate-search-results.asciidoc (#111565)
Modified broken link to the Python docs page
2024-08-05 09:15:21 -04:00
Kathleen DeRusso
02c494963a
[Query rules] Add exclude query rule type (#111420)
* Cleanup: Remove pinned IDs from applied rules in favor of single applied docs

* Add support for query rules of type exclude, to exclude specified documents from result sets

* Support exluded documents that specify the _index as well as the _id

* Cleanup

* Update docs/changelog/111420.yaml

* Update docs

* Spotless

* PR feedback - docs updates

* Apply PR feedback

* PR feedback

---------

Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com>
2024-08-02 08:03:54 -04:00
David Turner
586405d11f
Remove trappy timeout from ClusterSearchShardsRequest (#111442)
Exposes the `?master_timeout` parameter to the REST API and sets it
appropriately on internal/test requests.

Relates #107984
2024-07-31 08:53:24 +01:00
István Zoltán Szabó
3f7f5ac48a
[DOCS] Adds elser service to the inference tutorial (#111406)
* [DOCS] Adds elser service to the inference tutorial.

* [DOCS] Amends search intro text.

* [DOCS] Addresses feedback.
2024-07-30 14:49:58 +02:00
Liam Thompson
38f301a4de
[DOCS] Additional reranking docs updates (#111350)
* Simplify overview, specify available rerank options

* Update links

* Clarify eland-uploaded models is for hugging face
2024-07-29 14:37:45 +02:00
weizijun
2e0d0e2c90
fix text_similarity_reranker doc (#111256) 2024-07-29 12:17:25 +02:00
Liam Thompson
56f5dfb9a0
[DOCS] Update retriever reranker options (#111337)
* [DOCS] Update retriever reranker options

* Fix typo
2024-07-26 15:47:43 +02:00
Liam Thompson
3de980f8fc
[DOCS] Fix rendering bug (#111025)
Closes https://github.com/elastic/elasticsearch/issues/111023
2024-07-18 14:09:09 +02:00
Carlos Delgado
6191fe3b16
Clarify synonyms docs (#110822) 2024-07-18 10:20:26 +02:00
Liam Thompson
b535df78df
[DOCS] Retrievers and rerankers (#110007)
Co-authored-by: Adam Demjen <demjened@gmail.com>
2024-07-18 09:41:00 +02:00
Liam Thompson
6590894c99
[DOCS] Add note about ML model 502 timeout when using Create inference API (#110835)
* [DOCS] Add note about ml model 502 timeout

* Add note to API ref
2024-07-15 12:19:21 +02:00
Mark J. Hoy
560d4048d2
[Inference API] Add Docs for Amazon Bedrock Support for the Inference API (#110594)
* Add Amazon Bedrock Inference API to docs

* fix example errors

* update semantic search tutorial; add changelog

* fix typo

* fix error; accept suggestions
2024-07-12 10:14:54 -04:00
Benjamin Trent
ea9baab6c9
Fix search template examples by removing params on put (#110660) 2024-07-10 07:32:58 -04:00
Kathleen DeRusso
7a1d532ffb
Pass over Sparse Vector docs for correctness (#110282)
* Remove legacy mentions of text expansion queries

* Add missing query_vector param to sparse_vector query docs

* Fix formatting errors in sparse vector query dsl doc

* Remove unnecessary test setup block
2024-07-02 13:37:25 -04:00
George Wallace
dea593db3f
Update behavioral-analytics-start.asciidoc (#110271) 2024-06-28 09:01:48 -06:00
Kathleen DeRusso
19fc0d9cad
Deprecate text_expansion and weighted_tokens queries (#109880) 2024-06-27 13:24:57 -04:00
István Zoltán Szabó
31f0253b43
[DOCS] Adds link to ES-Cohere notebook and clarifies requirements. (#110195) 2024-06-26 17:22:40 +02:00
Pius
79623c7609
Update search-application-api.asciidoc (#110113)
Add a subsection about cross cluster search support (or the lack of).
2024-06-26 12:20:28 +02:00
Benjamin Trent
1c1733d823
Add some docs explaining filter performance and behavior for HNSW (#110108) 2024-06-25 08:42:24 -04:00
Kathleen DeRusso
41a61b069b
Mark Query Rules as GA (#110004)
* Mark query rules APIs as stable

* Remove preview label from docs

* Update docs/changelog/110004.yaml
2024-06-21 15:26:51 -04:00
Benjamin Trent
3aed0afb2b
Add new int4 quantization to dense_vector (#109317)
This adds a new quantization mechanism for HNSW and flat indices. Here
we add `int4` quantization via the `int4_hnsw` and `int4_flat` index
types. This quantization methodology further reduces the memory required
for fast HNSW, meaning that the memory required is 8x smaller than with
regular float32 values. 

8x reduction means that 1M 1024 dimension vectors goes from requiring
3.8GB to 477MB.

Recall continues to stay steady, there is some reduction that is
recoverable via slightly oversampling and reranking. For example over
500k CohereV3 vectors, only 5 extra vectors are required to be gathered
to achieve over 0.98 recall in a brute-force scenario.

![recall](b47a79d0-020d-4baa-8199-41a932df00f7)
2024-06-18 00:15:43 +10:00
Benjamin Trent
a5fbfe81b2 Merge remote-tracking branch 'upstream/main' into lucene_snapshot_9_11 2024-06-07 07:24:43 -04:00
Panagiotis Bailis
1c3b3d8f11
Adding support for explain in rrf (#108682) 2024-06-07 11:09:06 +03:00
Benjamin Trent
d3561f9cf3 Merge remote-tracking branch 'upstream/main' into lucene_snapshot_9_11 2024-06-06 18:22:08 -04:00
István Zoltán Szabó
d89dae2a32
[DOCS] Modifies semantic search-related docs to refer to the semantic_text workflow (#109418)
Co-authored-by: Carlos Delgado <6339205+carlosdelest@users.noreply.github.com>
2024-06-06 16:45:46 +02:00
Benjamin Trent
ac53d6020b Merge remote-tracking branch 'upstream/main' into lucene_snapshot_9_11 2024-06-05 12:38:23 -04:00
Mark J. Hoy
80a22ec046
[Inference API] Add Docs for Mistral Embedding Support for the Inference API (#109319)
* Initial docs for put-inference for Mistral

* adds mistral embeddings to tutorial; add changelog

* update mistral text and dimensions

* fix mistral spelling error

* fix azure AI studio; fix Mistral label

* fix auto-formatted items

* change pipeline button back to azure openai

* put proper Azure AI Studio include in

* fix missing azure-openai; fix huggingface hidden

* fix mistral tab for reindex

* re-add Mistral service settings to put inference
2024-06-05 11:23:29 -04:00
Benjamin Trent
9cd123d6cc Merge remote-tracking branch 'upstream/main' into lucene_snapshot_9_11 2024-06-02 16:46:19 -04:00
István Zoltán Szabó
95ce898436
[DOCS] Adds docs to semantic text (#108311)
Co-authored-by: Carlos Delgado <6339205+carlosdelest@users.noreply.github.com>
Co-authored-by: Mike Pellegrini <mike.pellegrini@elastic.co>
Co-authored-by: Kathleen DeRusso <kathleen.derusso@elastic.co>
2024-05-31 16:56:07 +02:00
elasticsearchmachine
7b5925f4b6 Merge remote-tracking branch 'origin/main' into lucene_snapshot 2024-05-30 10:01:52 +00:00
István Zoltán Szabó
1413c67d99
[DOCS] Amends inference reference docs and tutorials (#109159)
* [DOCS] Fixes inference tutorial widgets.

* [DOCS] Adds link to notebooks, rearranges sections in PUT inference API docs.
2024-05-29 17:43:10 +02:00
Liam Thompson
b6241711ef
[DOCS] Update CCS matrix for 8.14 (#109142) 2024-05-29 14:00:19 +02:00
ChrisHegarty
45a51d558c Merge branch 'main' into lucene_snapshot 2024-05-23 14:03:51 +01:00