elasticsearch/docs/reference/search
Benjamin Trent 3aed0afb2b
Add new int4 quantization to dense_vector (#109317)
This adds a new quantization mechanism for HNSW and flat indices. Here
we add `int4` quantization via the `int4_hnsw` and `int4_flat` index
types. This quantization methodology further reduces the memory required
for fast HNSW, meaning that the memory required is 8x smaller than with
regular float32 values. 

8x reduction means that 1M 1024 dimension vectors goes from requiring
3.8GB to 477MB.

Recall continues to stay steady, there is some reduction that is
recoverable via slightly oversampling and reranking. For example over
500k CohereV3 vectors, only 5 extra vectors are required to be gathered
to achieve over 0.98 recall in a brute-force scenario.

![recall](b47a79d0-020d-4baa-8199-41a932df00f7)
2024-06-18 00:15:43 +10:00
..
search-your-data Add new int4 quantization to dense_vector (#109317) 2024-06-18 00:15:43 +10:00
suggesters Update term-suggest.asciidoc (#86780) 2023-05-09 12:23:45 +02:00
async-search.asciidoc [Doc] Privileges required to retrieve the status of async searches 2024-04-02 09:35:02 +03:00
clear-scroll-api.asciidoc [DOCS] Fix case for 'Boolean' (#64299) 2020-10-29 09:04:43 -04:00
count.asciidoc [DOCS] Refactor book-scoped variables in docs/reference/index.asciidoc (#107413) 2024-04-17 14:37:07 +02:00
explain.asciidoc [DOCS] Refactor book-scoped variables in docs/reference/index.asciidoc (#107413) 2024-04-17 14:37:07 +02:00
field-caps.asciidoc [DOCS] Refactor book-scoped variables in docs/reference/index.asciidoc (#107413) 2024-04-17 14:37:07 +02:00
knn-search.asciidoc [DOCS] Refactor book-scoped variables in docs/reference/index.asciidoc (#107413) 2024-04-17 14:37:07 +02:00
multi-search-template-api.asciidoc [DOCS] Refactor book-scoped variables in docs/reference/index.asciidoc (#107413) 2024-04-17 14:37:07 +02:00
multi-search.asciidoc [DOCS] Refactor book-scoped variables in docs/reference/index.asciidoc (#107413) 2024-04-17 14:37:07 +02:00
point-in-time-api.asciidoc Add support for index_filter to open pit (#102388) 2023-11-21 15:35:49 +01:00
profile.asciidoc Adding aggregations support for the _ignored field (#101373) 2024-04-29 16:41:34 +02:00
rank-eval.asciidoc [DOCS] Refactor book-scoped variables in docs/reference/index.asciidoc (#107413) 2024-04-17 14:37:07 +02:00
render-search-template-api.asciidoc [DOCS] Overhaul search template docs (#72583) 2021-06-30 16:03:49 -04:00
retriever.asciidoc [DOCS] Add crosslink to update retriever.asciidoc (#108608) 2024-05-15 10:10:08 +02:00
rrf.asciidoc Adding support for explain in rrf (#108682) 2024-06-07 11:09:06 +03:00
scroll-api.asciidoc [DOCS] Update alias references (#73427) 2021-05-27 16:00:57 -04:00
search-shards.asciidoc [DOCS] Refactor book-scoped variables in docs/reference/index.asciidoc (#107413) 2024-04-17 14:37:07 +02:00
search-template-api.asciidoc [DOCS] Refactor book-scoped variables in docs/reference/index.asciidoc (#107413) 2024-04-17 14:37:07 +02:00
search-vector-tile-api.asciidoc [DOCS] Refactor book-scoped variables in docs/reference/index.asciidoc (#107413) 2024-04-17 14:37:07 +02:00
search.asciidoc [DOCS] Refactor book-scoped variables in docs/reference/index.asciidoc (#107413) 2024-04-17 14:37:07 +02:00
suggesters.asciidoc Remove 'under development' note in suggester docs (#84366) 2022-02-24 13:27:03 -08:00
terms-enum.asciidoc Limit _terms_enum prefix size (#97488) 2023-07-10 12:21:07 +02:00
validate.asciidoc [DOCS] Refactor book-scoped variables in docs/reference/index.asciidoc (#107413) 2024-04-17 14:37:07 +02:00