elasticsearch/docs/reference/mapping
Benjamin Trent 3aed0afb2b
Add new int4 quantization to dense_vector (#109317)
This adds a new quantization mechanism for HNSW and flat indices. Here
we add `int4` quantization via the `int4_hnsw` and `int4_flat` index
types. This quantization methodology further reduces the memory required
for fast HNSW, meaning that the memory required is 8x smaller than with
regular float32 values. 

8x reduction means that 1M 1024 dimension vectors goes from requiring
3.8GB to 477MB.

Recall continues to stay steady, there is some reduction that is
recoverable via slightly oversampling and reranking. For example over
500k CohereV3 vectors, only 5 extra vectors are required to be gathered
to achieve over 0.98 recall in a brute-force scenario.

![recall](b47a79d0-020d-4baa-8199-41a932df00f7)
2024-06-18 00:15:43 +10:00
..
dynamic Add unmatch_mapping_type, and support array of types (#103171) 2024-02-09 10:42:26 -05:00
fields Enable fallback synthetic source for token_count (#109044) 2024-05-27 10:22:59 -07:00
params Add note about optional times and epochs (#105786) 2024-03-05 08:44:03 -05:00
types Add new int4 quantization to dense_vector (#109317) 2024-06-18 00:15:43 +10:00
dynamic-mapping.asciidoc [DOCS] Clean up index template xrefs (#67264) 2021-01-11 12:38:09 -05:00
explicit-mapping.asciidoc [DOCS] Refactor book-scoped variables in docs/reference/index.asciidoc (#107413) 2024-04-17 14:37:07 +02:00
fields.asciidoc Support fetching _tier field value (#71379) 2021-04-08 11:41:52 -07:00
mapping-settings-limit.asciidoc [DOCS] Refactor book-scoped variables in docs/reference/index.asciidoc (#107413) 2024-04-17 14:37:07 +02:00
params.asciidoc [DOCS] Remove redirect pages (#88738) 2023-05-24 12:32:46 +01:00
removal_of_types.asciidoc [DOCS] Fix double spaces (#71082) 2021-03-31 09:57:47 -04:00
runtime.asciidoc Fix backwards description in runtime fields documentation (#94608) (#94642) 2023-03-22 11:53:35 -04:00
types.asciidoc [DOCS] Adds docs to semantic text (#108311) 2024-05-31 16:56:07 +02:00