mirror of
https://github.com/elastic/elasticsearch.git
synced 2025-04-22 06:07:55 -04:00
A Lucene limitation on doc values for UTF-8 fields does not allow us to write keyword fields whose size is larger then 32K. This limits our ability to map more than a certain number of dimension fields for time series indices. Before introducing this change the tsid is created as a catenation of dimension field names and values into a keyword field. To overcome this limitation we hash the tsid. This PR is intended to be used as a draft to test different options. Note that, as a side effect, this reduces the size of the tsid field as a result of storing far less data when the tsid is hashed. Anyway, we expect tsid hashing to affect compression of doc values and resulting in larger storage footprint. Effect on query latency needs to be evaluated too. Resolves #93564 |
||
---|---|---|
.. | ||
src/javaRestTest/java/org/elasticsearch/upgrades | ||
build.gradle |