elasticsearch

mirror of https://github.com/elastic/elasticsearch.git synced 2025-06-28 17:34:17 -04:00

Author	SHA1	Message	Date
Pm Ching	d68f2fa4a6	fix a couple of docs typos (#112901 )	2024-09-20 18:34:24 +03:00
Bogdan Pintea	f7ff00f645	ESQL: Align year diffing to the rest of the units in DATE_DIFF: chronological (#113103 ) This will correct/switch "year" unit diffing from the current integer subtraction to a crono subtraction. Consequently, two dates are (at least) one year apart now if (at least) a full calendar year separates them. The previous implementation simply subtracted the year part of the dates. Note: this parts with ES SQL's implementation of the same function, which itself is aligned with MS SQL's implementation, which works equivalent to an integer subtraction. Fixes #112482.	2024-09-20 20:21:29 +10:00
István Zoltán Szabó	9b7d808bf4	[DOCS] Fixes adaptive_allocations examples (#113248 ) Co-authored-by: Jan Kuipers <148754765+jan-elastic@users.noreply.github.com>	2024-09-20 11:31:04 +02:00
Alexander Spies	2e1fccf07c	ESQL Docs: Mention Discover/Field Statistics in OOM known issue in 8.15.1/2 (#113196 )	2024-09-20 18:31:53 +10:00
Pius	3b854a6bee	Update 8.15.1.asciidoc (#113221 )	2024-09-20 10:02:00 +02:00
Liam Thompson	163b2c781f	[DOCS] Fix reranking IA, move retrievers to search api overview (#112949 )	2024-09-19 17:20:52 +02:00
Carlos Delgado	8d1b22e7bc	ESQL QSTR function (#112590 )	2024-09-19 16:34:42 +02:00
David Turner	33af77bcb3	Mention full-cluster restart in `initial_master_node` docs (#112986 ) Apparently some users consider "node is restarting" not to apply to a full-cluster restart. This commit further clarifies that you must not set `cluster.initial_master_nodes` in a full cluster restart.	2024-09-19 10:41:39 +01:00
Carlos Delgado	838b5a860d	ESQL - generate docs for snapshot functions (#113080 )	2024-09-19 07:46:43 +02:00
Stef Nestor	63e0897526	(Doc+) Update example SAML blog for Okta (#112934 )	2024-09-18 12:04:44 +02:00
István Zoltán Szabó	5153905e7e	[DOCS] Gives more details to the load data step of the semantic search tutorials (#113088 ) Co-authored-by: Liam Thompson <32779855+leemthompo@users.noreply.github.com>	2024-09-18 11:37:35 +02:00
Jedr Blaszyk	b0b9b1f807	Add connector permissions to fleet server service account (#112556 ) * Add connector permissions to fleet server service account * [Security] Add permissions to manage connectors for fleet-server service account * Fix tests * Fix tests * Fix typ again (tm) * switch to connector/* vs manage_connectors @jakelandis pointed out that we don't need connector secrets, which is the only difference between these too. We don't have a pretty name for the narrower permissions, but we don't need one here. Co-authored-by: Artem Shelkovnikov <lavatroublebubble@gmail.com> --------- Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com> Co-authored-by: Sean Story <sean.j.story@gmail.com> Co-authored-by: Artem Shelkovnikov <lavatroublebubble@gmail.com>	2024-09-17 15:35:45 -05:00
Martijn van Groningen	639a1e0891	Revert "Emit deprecation warning when executing one of the rollup APIs (#112807 )" (#113047 ) This reverts commit `3d6e9a2850`, because the original change is causing xpack rest test failures in other repositories.	2024-09-18 05:30:35 +10:00
Martijn van Groningen	3d6e9a2850	Emit deprecation warning when executing one of the rollup APIs (#112807 ) Currently none of the rollup APIs deprecation warning. Only the reference docs indicate that rollup has been deprecated. Relates to #112690	2024-09-17 19:23:22 +02:00
Salvatore Campagna	f7880ae85f	LogsDB data migration integration testing (#112710 ) Here we test reindexing logsdb indices, creating and restoring snapshots. Note that logsdb uses synthetic source and restoring source only snapshots fails due to missing _source.	2024-09-17 16:26:48 +02:00
Alexander Spies	70fcc54db7	ESQL: Add known issue to 8.15 docs for OOM due to wide index pattern (#112926 )	2024-09-17 05:30:41 +10:00
elasticsearchmachine	9aa39ce835	Forward port release notes for v8.15.1 (#112568 )	2024-09-16 17:21:38 +02:00
István Zoltán Szabó	2371caeb15	[DOCS] Fixes response object indentation in semantic text tutorial (#112915 )	2024-09-16 14:40:54 +02:00
István Zoltán Szabó	ac27e735e8	[DOCS] Improves inference workflow tutorial. (#112870 )	2024-09-13 17:34:17 +02:00
István Zoltán Szabó	c2bd0de278	[DOCS] Simplifies semantic_text tutorial by removing copy_to field (#112864 )	2024-09-13 16:50:41 +02:00
Benjamin Trent	4e0b197f30	Update knn-query.asciidoc (#112833 )	2024-09-13 13:15:44 +02:00
Martijn van Groningen	661efa9d97	Remove zstd feature flag for index codec best compression. (#112665 ) ZStandard was added via #103374 a few months ago to snapshot builds of Elasticsearch only and benchmark results have shown that using zstd is a better trade off compared to deflate for when index.codec is set to best_compression. This change removes the feature flag for ZStandard stored field compression for indices with index.codec set to best_compression.	2024-09-13 10:49:49 +02:00
Luigi Dell'Aquila	f7a0196b45	ES\|QL: Add 'preview' information to functions docs for Kibana (#112792 )	2024-09-12 16:49:55 +02:00
Stef Nestor	76ed892bc6	(Docs+) Flush out Resource+Task troubleshooting (#111773 ) * (Docs+) Flush out Resource+Task troubleshooting --------- Co-authored-by: shainaraskas <58563081+shainaraskas@users.noreply.github.com> Co-authored-by: David Turner <david.turner@elastic.co>	2024-09-12 07:46:40 -06:00
István Zoltán Szabó	a151bb0bfb	[DOCS] Rework semantic search main page (#112452 ) Co-authored-by: Liam Thompson <32779855+leemthompo@users.noreply.github.com> Co-authored-by: Mike Pellegrini <mike.pellegrini@elastic.co>	2024-09-12 14:09:34 +02:00
Stef Nestor	e6b15f4bf7	(Doc+) Inference Pipeline ignores Mapping Analyzers (#112522 ) * (Doc+) Inference Pipeline ignores Mapping Analyzers From internal Dev feedback (will cross-link after), this updates that inference processors within ingest pipelines run before mapping analyzers effectively ignoring them. So if users want analyzers to take effect, they would need to select the analyzer's ingest pipeline process equivalent and run it higher in flow than the inference processor. --------- Co-authored-by: István Zoltán Szabó <istvan.szabo@elastic.co>	2024-09-11 16:05:15 -06:00
Stef Nestor	bb6104eb02	(Doc+) Terminating Exit Codes (#112530 ) 👋 howdy, team! Mini PR to cross-replicate [this knowledge article](https://support.elastic.co/knowledge/6610ba83) about Elasticsearch's exit codes which expands [this ES doc section](https://www.elastic.co/guide/en/elasticsearch/reference/master/stopping-elasticsearch.html#fatal-errors).	2024-09-11 15:33:47 -06:00
Stef Nestor	5624d7d98f	(Doc+) CAT Nodes default columns (#112715 ) 👋 howdy, team! 1. Related to https://github.com/elastic/dev/issues/2631, highlights customers are usually seeking `heap.percent` instead of `ram.percent` 2. Aligns the claimed "(Default)" columns in doc to what returned for v8.15.1 test cluster	2024-09-11 15:32:40 -06:00
Lisa Cawley	db1916a6ec	[DOCS] Augment installation warnings (#112756 )	2024-09-11 14:18:17 -07:00
David Turner	f79fb8c25b	Introduce repository integrity verification API (#112348 ) Adds an API which scans all the metadata (and optionally the raw data) in a snapshot repository to look for corruptions or other inconsistencies. Closes https://github.com/elastic/elasticsearch/issues/52622 Closes ES-8560	2024-09-11 23:17:59 +10:00
Mary Gouseti	c1a2d390ef	Update data stream lifecycle telemetry to track global retention (#112451 ) Currently, the data stream lifecycle telemetry has the following structure: ``` { .... "data_lifecycle" : { "available": true, "enabled": true, "count": 0, "default_rollover_used": true, "retention": { "minimum_millis": 0, "maximum_millis": 0, "average_millis": 0.0 } }.... ``` In the snippet above you can see that we track: - The amount of data streams managed by the data stream lifecycle by `count` - If the default rollover has been overwritten by `default_rollover_used` - The min, max and average of the `data_retention` configured on a data stream level. In this PR we propose the following extention: ``` .... "data_lifecycle" : { "available": true, "enabled": true, "count": 0, "default_rollover_used": true, "effective_retention": { #https://github.com/elastic/dev/issues/2537 "retained_data_streams": 5, "minimum_millis": 0, # Only if retained data streams > 1 "maximum_millis": 0, "average_millis": 0.0 }, "data_retention": { "configured_data_streams": 5, "minimum_millis": 0, # Only if retained data streams > 1 "maximum_millis": 0, "average_millis": 0.0 }, "global_retention": { "default": { "defined": true/false, "affected_data_streams": 0, "millis": 0 }, "max": { "defined": true/false, "affected_data_streams": 0, "millis": 0 } } ``` With this extension we are tracking: - The amount of data streams managed by the data stream lifecycle by `count` - If the default rollover has been overwritten by `default_rollover_used` - The min, max and average of the `data_retention` configured on a data stream level and the number of data streams that have it configured. We add the min, max and avg only if there are data streams with data retention configuration to avoid messing with the stats in a dashboard. - The min, max and average of the `effective_retention` and the number of data streams that are retained. We add the min, max and avg only if there are retained data streams to avoid messing with the stats in a dashboard. - Global retention stats, if they are defined, if the number of the affected data streams and the actual value. The above metrics allow us to answer questions like: - How many data streams are affected by global retention. - How big is the difference between the longest data retention compared to max global retention. - How much does the effective retention diverging from the data retention, this will show the impact of the global retention.	2024-09-11 18:31:04 +10:00
kosabogi	6e7a9eb629	Adds details on Kibana access credentials (#112695 )	2024-09-11 06:20:08 +02:00
Stanislav Malyshev	9081a951d5	Implement CCS telemetry export as part of _cluster/stats (#112310 ) * Implement CCS telemetry export as part of _cluster/stats	2024-09-10 09:31:06 -06:00
István Zoltán Szabó	3636797cfe	[DOCS] Adds path params and available task types to the PUT inference page (#112696 ) Co-authored-by: Liam Thompson <32779855+leemthompo@users.noreply.github.com>	2024-09-10 12:43:08 +02:00
Liam Thompson	c2d4543250	[DOCS][101] Refine mappings + documents/indices overviews (#112545 )	2024-09-10 12:17:10 +02:00
kosabogi	6da37658ad	#101472 Updates default index.translog.flush_threshold_size value (#112052 ) * #101472 Updates default index.translog.flush_threshold_size value * Update docs/reference/index-modules/translog.asciidoc Co-authored-by: István Zoltán Szabó <istvan.szabo@elastic.co> * Updates the description --------- Co-authored-by: István Zoltán Szabó <istvan.szabo@elastic.co>	2024-09-10 11:08:53 +02:00
Fang Xing	e8569356ea	[ES\|QL] explicit cast a string literal to date_period and time_duration in arithmetic operations (#109193 ) explicit cast to date_period and time_duration in arithmic operation	2024-09-09 14:56:43 -04:00
Nik Everett	ef3a5a1385	ESQL: Fix CASE when conditions are multivalued (#112401 ) When CASE hits a multivalued field it was previously either crashing on fold or evaluating it to the first value. Since booleans are loaded in sorted order from lucene that usually means `false`. This changes the behavior to line up with the rest of ESQL - now multivalued fields are treated as `false` with a warning. You might say "hey wait! multivalued fields usually become `null`, not `false`!". Yes, dear reader, you are right. Very right. But! `CASE`'s contract is to immediatly convert its values into `true` or `false` using the standard boolean tri-valued logic. So `null` just become `false` immediately. This is how PostgreSQL, MySQL, and SQLite behave: ``` > SELECT CASE WHEN null THEN 1 ELSE 2 END; 2 ``` They turn that `null` into a false. And we're right there with them. Except, of course, that we're turning `[false, false]` and the like into `null` first. See!? It's consitent. Consistently confusing, but sane at least. The warning message just says "treating multivalued field as false" rather than explaining all of that. This also fixes up a few of CASE's docs which I noticed were kind of busted while working on CASE. I think the docs generation is having a lot of trouble with CASE so I've manually hacked the right thing into place, but we should figure out a better solution eventually. Closes #112359	2024-09-10 02:32:19 +10:00
Nik Everett	cf98240950	Update docs from code	2024-09-09 11:28:31 -04:00
David Turner	1977a715df	Add links to network disconnect troubleshooting (#112330 ) Makes the docs added in #112271 more discoverable.	2024-09-10 00:59:39 +10:00
Chris Berkhout	fbaeb1ee61	[ESQL] Add `SPACE` function (#112350 ) Adds the SPACE(number) function, which is equivalent to REPEAT(" ", number).	2024-09-09 21:41:35 +10:00
Iván Cea Fontenla	fc2760cfd4	ESQL: mv_median_absolute_deviation function (#112055 ) - Added mv_median_absolute_deviation function - Added possibility of having a fixed param in Multivalue "ascending" functions - Add surrogate to MedianAbsoluteDeviation ### Calculations used to avoid overflows First, a quick recap of how the MAD is calculated: 1. Sort values, and get the median 2. Calculate the difference between each value with the median (`abs(median - value)`) 3. Sort the differences, and get their median Calculating a MAD may overflow when calculating the differences (Step 2), given the type is a signed number, as the difference is a positive value, with potentially the same value as `POSITIVE_MAX - NEGATIVE_MIN`. To solve this, some types are up-casted as follow: - Int: Stored as longs, simple approach - Long: Stored as longs, but switched to unsigned long representation when calculating the differences - Unsigned long: No effect; the resulting range is the same - Doubles: Nothing. If the values overflow to +/-infinity, they're left that way, as we'll just use those outliers to sort Closes https://github.com/elastic/elasticsearch/issues/111590	2024-09-09 10:04:25 +02:00
Mayya Sharipova	bd28e77df0	Similarity is applicable for text & keyword fields (#112613 ) Note that similarity is only applicable for text and keyword field types.	2024-09-06 15:54:15 -04:00
Stef Nestor	a4dba7db8d	(Doc+) Sparse Vectors NA to mapping analyzers (#112523 ) * retry	2024-09-05 09:19:19 -06:00
Keith Massey	4aa3c3d7ee	Add support for templates when validating mappings in the simulate ingest API (#111161 )	2024-09-05 09:25:53 -05:00
Liam Thompson	04678e9a15	[DOCS][ESQL] Include bucket in agg functions list (#112513 )	2024-09-05 11:43:20 +02:00
István Zoltán Szabó	e54f46e4eb	[DOCS] Fixes indentation issue on PUT trained models docs page. (#112538 )	2024-09-05 10:46:41 +02:00
wajihaparvez	41f05348f8	[Docs] Update Monitoring docs with integration assets info (#112164 ) * [Docs] Update Monitoring docs with integration assets info * [Docs] Adding test commit to fix CLA error	2024-09-04 16:07:45 -04:00
Mike Barretta	d41885347f	Missing link added to fips-140-compliance.asciidoc (#112477 ) (#112516 ) completed a missing link to the support matrix	2024-09-04 23:34:01 +10:00
Simon Cooper	a36d90cf34	Use CLDR locale provider on JDK 23+ (#110222 ) JDK 23 removes the COMPAT locale provider, leaving CLDR as the only option. This commit configures Elasticsearch to use the CLDR provider when on JDK 23, but still use the existing COMPAT provider when on JDK 22 and below. This causes some differences in locale behaviour; this also adapts various tests to still work whether run on COMPAT or CLDR.	2024-09-04 13:42:40 +01:00

1 2 3 4 5 ...

11958 commits