elasticsearch

mirror of https://github.com/elastic/elasticsearch.git synced 2025-04-24 23:27:25 -04:00

Author	SHA1	Message	Date
Carlos Delgado	9362cafcf3	[Docs] kNN vector rescoring for quantized vectors (#118425 ) (#120407 )	2025-01-18 03:30:11 +11:00
Kathleen DeRusso	13c4f5d593	[Docs] Add docs for new semantic text query functionality (#119520 ) (#119883 ) * Update docs with new semantic text functionality * PR feedback * PR feedback * PR Feedback	2025-01-10 03:39:00 +11:00
Benjamin Trent	e5dc17313d	Apply default k for knn query eagerly (#118774 ) (#119700 ) When originally added, the knn query didn't apply `top-k` restrictions to the query. Instead it would allow the resulting `num_candidate` to be combined with sibling queries without restricting to `top-size` results ahead of time. This honestly is confusing behavior and leads to some bugs in understand how it all works. This commit addresses this by eagerly gathering only `size` results when `k==null` before combining with other queries. To achieve the previous behavior, this can be done directly by setting `k==num_candidates` in the query. (cherry picked from commit `c18b48dbd4`)	2025-01-08 09:11:03 +11:00
Benjamin Trent	96cc923dcf	Update knn-query.asciidoc (#112833 ) (#112868 )	2024-09-13 21:40:59 +10:00
Mayya Sharipova	405e39660b	Support k parameter for knn query (#110233 ) Introduce an optional k param for knn query If k is not set, knn query has the previous behaviour: - `num_candidates` docs is collected from each shard. This `num_candidates` docs are used for combining with results with other queries and aggregations on each shard. - docs from all shards are merged to produce the top global `size` results If k is set, the behaviour instead is following: - `k` docs is collected from each shard. This `k` docs are used for combining results with other queries and aggregations on each shard. - similarly, docs from all shards are merged to produce the top global `size` results. Having `k` param makes it more intuitive for users to address their needs. They also don't need to care and can skip `num_candidates` param for this query as it is of more internal details to tune how knn search operates. Closes #108473	2024-06-28 09:59:28 -04:00
Liam Thompson	33a71e3289	[DOCS] Refactor book-scoped variables in `docs/reference/index.asciidoc` (#107413 ) * Remove `es-test-dir` book-scoped variable * Remove `plugins-examples-dir` book-scoped variable * Remove `:dependencies-dir:` and `:xes-repo-dir:` book-scoped variables - In `index.asciidoc`, two variables (`:dependencies-dir:` and `:xes-repo-dir:`) were removed. - In `sql/index.asciidoc`, the `:sql-tests:` path was updated to fuller path - In `esql/index.asciidoc`, the `:esql-tests:` path was updated idem * Replace `es-repo-dir` with `es-ref-dir` * Move `:include-xpack: true` to few files that use it, remove from index.asciidoc	2024-04-17 14:37:07 +02:00
Tommaso Teofili	7bff3b3bec	Add modelId and modelText to KnnVectorQueryBuilder (#106068 ) * Add modelId and modelText to KnnVectorQueryBuilder Use QueryVectorBuilder within KnnVectorQueryBuilder to make it possible to perform knn queries also when a query vector is not immediately available. Supplying a text_embedding query_vector_builder with model_text and model_id instead of the query_vector will result in the generation of a query_vector by calling inference on the specified model_id with the supplied model_text (during query rewrite). This is consistent with the way query vectors are built from model_id / model_text in KnnSearchBuilder (DFS phase).	2024-03-18 16:13:38 +01:00
Panagiotis Bailis	d471ccb5bb	Adding support for hex-encoded byte vectors on knn-search (#105393 )	2024-03-13 09:24:51 +02:00
Panagiotis Bailis	7ce8d76559	Making k and num_candidates optional for knn search (#101209 )	2024-02-01 15:43:09 +02:00
Mayya Sharipova	669d4ae9b9	Add hybrid search to knn query documentation (#104562 ) Relates to PR #98916 Closes elastic/search-docs-team#39	2024-01-18 15:53:48 -05:00
Mayya Sharipova	61c7483fc9	Make knn search a query (#98916 ) This introduced a new knn query: - knn query is executed during the Query phase similar to all other queries. - No k parameter, k defaults to size - num_candidates is a size of queue for candidates to consider while search a graph on each shard - For aggregations: "size" results are collected with total = size * shards. Aggregations will see size * shards results. - All filters from DSL are applied as post-filters, except: 1) alias filter is applied as pre-filter or 2) a filter provided as a parameter inside knn query.	2023-11-01 14:21:40 -04:00

11 commits