Commit graph

11958 commits

Author SHA1 Message Date
Pm Ching
d68f2fa4a6
fix a couple of docs typos (#112901) 2024-09-20 18:34:24 +03:00
Bogdan Pintea
f7ff00f645
ESQL: Align year diffing to the rest of the units in DATE_DIFF: chronological (#113103)
This will correct/switch "year" unit diffing from the current integer
subtraction to a crono subtraction. Consequently, two dates are (at
least) one year apart now if (at least) a full calendar year separates
them. The previous implementation simply subtracted the year part of the
dates.

Note: this parts with ES SQL's implementation of the same function,
which itself is aligned with MS SQL's implementation, which works
equivalent to an integer subtraction.

Fixes #112482.
2024-09-20 20:21:29 +10:00
István Zoltán Szabó
9b7d808bf4
[DOCS] Fixes adaptive_allocations examples (#113248)
Co-authored-by: Jan Kuipers <148754765+jan-elastic@users.noreply.github.com>
2024-09-20 11:31:04 +02:00
Alexander Spies
2e1fccf07c
ESQL Docs: Mention Discover/Field Statistics in OOM known issue in 8.15.1/2 (#113196) 2024-09-20 18:31:53 +10:00
Pius
3b854a6bee
Update 8.15.1.asciidoc (#113221) 2024-09-20 10:02:00 +02:00
Liam Thompson
163b2c781f
[DOCS] Fix reranking IA, move retrievers to search api overview (#112949) 2024-09-19 17:20:52 +02:00
Carlos Delgado
8d1b22e7bc
ESQL QSTR function (#112590) 2024-09-19 16:34:42 +02:00
David Turner
33af77bcb3
Mention full-cluster restart in initial_master_node docs (#112986)
Apparently some users consider "node is restarting" not to apply to a
full-cluster restart. This commit further clarifies that you must not
set `cluster.initial_master_nodes` in a full cluster restart.
2024-09-19 10:41:39 +01:00
Carlos Delgado
838b5a860d
ESQL - generate docs for snapshot functions (#113080) 2024-09-19 07:46:43 +02:00
Stef Nestor
63e0897526
(Doc+) Update example SAML blog for Okta (#112934) 2024-09-18 12:04:44 +02:00
István Zoltán Szabó
5153905e7e
[DOCS] Gives more details to the load data step of the semantic search tutorials (#113088)
Co-authored-by: Liam Thompson <32779855+leemthompo@users.noreply.github.com>
2024-09-18 11:37:35 +02:00
Jedr Blaszyk
b0b9b1f807
Add connector permissions to fleet server service account (#112556)
* Add connector permissions to fleet server service account

* [Security] Add permissions to manage connectors for fleet-server service account

* Fix tests

* Fix tests

* Fix typ again (tm)

* switch to connector/* vs manage_connectors

@jakelandis pointed out that we don't need connector secrets, which is the only difference between these too. We don't have a pretty name for the narrower permissions, but we don't need one here.

Co-authored-by: Artem Shelkovnikov <lavatroublebubble@gmail.com>

---------

Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com>
Co-authored-by: Sean Story <sean.j.story@gmail.com>
Co-authored-by: Artem Shelkovnikov <lavatroublebubble@gmail.com>
2024-09-17 15:35:45 -05:00
Martijn van Groningen
639a1e0891
Revert "Emit deprecation warning when executing one of the rollup APIs (#112807)" (#113047)
This reverts commit 3d6e9a2850, because
the original change is causing xpack rest test failures in other
repositories.
2024-09-18 05:30:35 +10:00
Martijn van Groningen
3d6e9a2850
Emit deprecation warning when executing one of the rollup APIs (#112807)
Currently none of the rollup APIs deprecation warning. Only the reference docs indicate that rollup has been deprecated.

Relates to #112690
2024-09-17 19:23:22 +02:00
Salvatore Campagna
f7880ae85f
LogsDB data migration integration testing (#112710)
Here we test reindexing logsdb indices, creating and restoring
snapshots. Note that logsdb uses synthetic source and restoring
source only snapshots fails due to missing _source.
2024-09-17 16:26:48 +02:00
Alexander Spies
70fcc54db7
ESQL: Add known issue to 8.15 docs for OOM due to wide index pattern (#112926) 2024-09-17 05:30:41 +10:00
elasticsearchmachine
9aa39ce835
Forward port release notes for v8.15.1 (#112568) 2024-09-16 17:21:38 +02:00
István Zoltán Szabó
2371caeb15
[DOCS] Fixes response object indentation in semantic text tutorial (#112915) 2024-09-16 14:40:54 +02:00
István Zoltán Szabó
ac27e735e8
[DOCS] Improves inference workflow tutorial. (#112870) 2024-09-13 17:34:17 +02:00
István Zoltán Szabó
c2bd0de278
[DOCS] Simplifies semantic_text tutorial by removing copy_to field (#112864) 2024-09-13 16:50:41 +02:00
Benjamin Trent
4e0b197f30
Update knn-query.asciidoc (#112833) 2024-09-13 13:15:44 +02:00
Martijn van Groningen
661efa9d97
Remove zstd feature flag for index codec best compression. (#112665)
ZStandard was added via #103374 a few months ago to snapshot builds of Elasticsearch only and benchmark results have shown that using zstd is a better trade off compared to deflate for when index.codec is set to best_compression.

This change removes the feature flag for ZStandard stored field compression for indices with index.codec set to best_compression.
2024-09-13 10:49:49 +02:00
Luigi Dell'Aquila
f7a0196b45
ES|QL: Add 'preview' information to functions docs for Kibana (#112792) 2024-09-12 16:49:55 +02:00
Stef Nestor
76ed892bc6
(Docs+) Flush out Resource+Task troubleshooting (#111773)
* (Docs+) Flush out Resource+Task troubleshooting

---------

Co-authored-by: shainaraskas <58563081+shainaraskas@users.noreply.github.com>
Co-authored-by: David Turner <david.turner@elastic.co>
2024-09-12 07:46:40 -06:00
István Zoltán Szabó
a151bb0bfb
[DOCS] Rework semantic search main page (#112452)
Co-authored-by: Liam Thompson <32779855+leemthompo@users.noreply.github.com>
Co-authored-by: Mike Pellegrini <mike.pellegrini@elastic.co>
2024-09-12 14:09:34 +02:00
Stef Nestor
e6b15f4bf7
(Doc+) Inference Pipeline ignores Mapping Analyzers (#112522)
* (Doc+) Inference Pipeline ignores Mapping Analyzers

From internal Dev feedback (will cross-link after), this updates that inference processors within ingest pipelines run before mapping analyzers effectively ignoring them. So if users want analyzers to take effect, they would need to select the analyzer's ingest pipeline process equivalent and run it higher in flow than the inference processor.

---------

Co-authored-by: István Zoltán Szabó <istvan.szabo@elastic.co>
2024-09-11 16:05:15 -06:00
Stef Nestor
bb6104eb02
(Doc+) Terminating Exit Codes (#112530)
👋 howdy, team! Mini PR to cross-replicate [this knowledge article](https://support.elastic.co/knowledge/6610ba83) about Elasticsearch's exit codes which expands [this ES doc section](https://www.elastic.co/guide/en/elasticsearch/reference/master/stopping-elasticsearch.html#fatal-errors).
2024-09-11 15:33:47 -06:00
Stef Nestor
5624d7d98f
(Doc+) CAT Nodes default columns (#112715)
👋 howdy, team!

1. Related to https://github.com/elastic/dev/issues/2631, highlights customers are usually seeking `heap.percent` instead of `ram.percent`
2. Aligns the claimed "(Default)" columns in doc to what returned for v8.15.1 test cluster
2024-09-11 15:32:40 -06:00
Lisa Cawley
db1916a6ec
[DOCS] Augment installation warnings (#112756) 2024-09-11 14:18:17 -07:00
David Turner
f79fb8c25b
Introduce repository integrity verification API (#112348)
Adds an API which scans all the metadata (and optionally the raw data)
in a snapshot repository to look for corruptions or other
inconsistencies.

Closes https://github.com/elastic/elasticsearch/issues/52622 Closes
ES-8560
2024-09-11 23:17:59 +10:00
Mary Gouseti
c1a2d390ef
Update data stream lifecycle telemetry to track global retention (#112451)
Currently, the data stream lifecycle telemetry has the following
structure:

```
{
....
  "data_lifecycle" : {
    "available": true,
    "enabled": true,
    "count": 0,
    "default_rollover_used": true,
    "retention": {
        "minimum_millis": 0,
        "maximum_millis": 0,
        "average_millis": 0.0
    }
  }....
```

In the snippet above you can see that we track:

- The amount of data streams managed by the data stream lifecycle by `count`
- If the default rollover has been overwritten by `default_rollover_used`
- The min, max and average of the `data_retention` configured on a data stream level.

In this PR we propose the following extention:

```
....
  "data_lifecycle" : {
    "available": true,
    "enabled": true,
    "count": 0,
    "default_rollover_used": true,
    "effective_retention": { #https://github.com/elastic/dev/issues/2537
        "retained_data_streams": 5,
        "minimum_millis": 0, # Only if retained data streams > 1
        "maximum_millis": 0,
        "average_millis": 0.0
    },
    "data_retention": {
        "configured_data_streams": 5,
        "minimum_millis": 0, # Only if retained data streams > 1
        "maximum_millis": 0,
        "average_millis": 0.0
    },
    "global_retention": {
      "default": {
         "defined": true/false,
	  "affected_data_streams": 0,
         "millis": 0 
      },
      "max": {
         "defined": true/false,
	  "affected_data_streams": 0,
         "millis": 0 
      }
    }
```

With this extension we are tracking:

- The amount of data streams managed by the data stream lifecycle by `count`
- If the default rollover has been overwritten by `default_rollover_used`
- The min, max and average of the `data_retention` configured on a data stream level and the number of data streams that have it configured. We add the min, max and avg only if there are data streams with data retention configuration to avoid messing with the stats in a dashboard.
- The min, max and average of the `effective_retention` and the number of data streams that are retained. We add the min, max and avg only if there are retained data streams to avoid messing with the stats in a dashboard.
- Global retention stats, if they are defined, if the number of the affected data streams and the actual value.

The above metrics allow us to answer questions like:

- How many data streams are affected by global retention.
- How big is the difference between the longest data retention compared to max global retention.
- How much does the effective retention diverging from the data retention, this will show the impact of the global retention.
2024-09-11 18:31:04 +10:00
kosabogi
6e7a9eb629
Adds details on Kibana access credentials (#112695) 2024-09-11 06:20:08 +02:00
Stanislav Malyshev
9081a951d5
Implement CCS telemetry export as part of _cluster/stats (#112310)
* Implement CCS telemetry export as part of _cluster/stats
2024-09-10 09:31:06 -06:00
István Zoltán Szabó
3636797cfe
[DOCS] Adds path params and available task types to the PUT inference page (#112696)
Co-authored-by: Liam Thompson <32779855+leemthompo@users.noreply.github.com>
2024-09-10 12:43:08 +02:00
Liam Thompson
c2d4543250
[DOCS][101] Refine mappings + documents/indices overviews (#112545) 2024-09-10 12:17:10 +02:00
kosabogi
6da37658ad
#101472 Updates default index.translog.flush_threshold_size value (#112052)
* #101472 Updates default index.translog.flush_threshold_size value

* Update docs/reference/index-modules/translog.asciidoc

Co-authored-by: István Zoltán Szabó <istvan.szabo@elastic.co>

* Updates the description

---------

Co-authored-by: István Zoltán Szabó <istvan.szabo@elastic.co>
2024-09-10 11:08:53 +02:00
Fang Xing
e8569356ea
[ES|QL] explicit cast a string literal to date_period and time_duration in arithmetic operations (#109193)
explicit cast to date_period and time_duration in arithmic operation
2024-09-09 14:56:43 -04:00
Nik Everett
ef3a5a1385
ESQL: Fix CASE when conditions are multivalued (#112401)
When CASE hits a multivalued field it was previously either crashing on
fold or evaluating it to the first value. Since booleans are loaded in
sorted order from lucene that *usually* means `false`. This changes the
behavior to line up with the rest of ESQL - now multivalued fields are
treated as `false` with a warning.

You might say "hey wait! multivalued fields usually become `null`, not
`false`!". Yes, dear reader, you are right. Very right. But! `CASE`'s
contract is to immediatly convert its values into `true` or `false`
using the standard boolean tri-valued logic. So `null` just become
`false` immediately. This is how PostgreSQL, MySQL, and SQLite behave:

```
> SELECT CASE WHEN null THEN 1 ELSE 2 END;
2
```

They turn that `null` into a false. And we're right there with them.
Except, of course, that we're turning `[false, false]` and the like into
`null` first. See!? It's consitent. Consistently confusing, but sane at
least.

The warning message just says "treating multivalued field as false"
rather than explaining all of that.

This also fixes up a few of CASE's docs which I noticed were kind of
busted while working on CASE. I think the docs generation is having a
lot of trouble with CASE so I've manually hacked the right thing into
place, but we should figure out a better solution eventually.

Closes #112359
2024-09-10 02:32:19 +10:00
Nik Everett
cf98240950 Update docs from code 2024-09-09 11:28:31 -04:00
David Turner
1977a715df
Add links to network disconnect troubleshooting (#112330)
Makes the docs added in #112271 more discoverable.
2024-09-10 00:59:39 +10:00
Chris Berkhout
fbaeb1ee61
[ESQL] Add SPACE function (#112350)
Adds the SPACE(number) function, which is equivalent to REPEAT(" ", number).
2024-09-09 21:41:35 +10:00
Iván Cea Fontenla
fc2760cfd4
ESQL: mv_median_absolute_deviation function (#112055)
- Added mv_median_absolute_deviation function
- Added possibility of having a fixed param in Multivalue "ascending" functions
- Add surrogate to MedianAbsoluteDeviation

### Calculations used to avoid overflows
First, a quick recap of how the MAD is calculated:
1. Sort values, and get the median
2. Calculate the difference between each value with the median (`abs(median - value)`)
3. Sort the differences, and get their median

Calculating a MAD may overflow when calculating the differences (Step 2), given the type is a signed number, as the difference is a positive value, with potentially the same value as `POSITIVE_MAX - NEGATIVE_MIN`.
To solve this, some types are up-casted as follow:
- Int: Stored as longs, simple approach
- Long: Stored as longs, but switched to unsigned long representation when calculating the differences
- Unsigned long: No effect; the resulting range is the same
- Doubles: Nothing. If the values overflow to +/-infinity, they're left that way, as we'll just use those outliers to sort

Closes https://github.com/elastic/elasticsearch/issues/111590
2024-09-09 10:04:25 +02:00
Mayya Sharipova
bd28e77df0
Similarity is applicable for text & keyword fields (#112613)
Note that similarity is only applicable for text and keyword field types.
2024-09-06 15:54:15 -04:00
Stef Nestor
a4dba7db8d
(Doc+) Sparse Vectors NA to mapping analyzers (#112523)
* retry
2024-09-05 09:19:19 -06:00
Keith Massey
4aa3c3d7ee
Add support for templates when validating mappings in the simulate ingest API (#111161) 2024-09-05 09:25:53 -05:00
Liam Thompson
04678e9a15
[DOCS][ESQL] Include bucket in agg functions list (#112513) 2024-09-05 11:43:20 +02:00
István Zoltán Szabó
e54f46e4eb
[DOCS] Fixes indentation issue on PUT trained models docs page. (#112538) 2024-09-05 10:46:41 +02:00
wajihaparvez
41f05348f8
[Docs] Update Monitoring docs with integration assets info (#112164)
* [Docs] Update Monitoring docs with integration assets info

* [Docs] Adding test commit to fix CLA error
2024-09-04 16:07:45 -04:00
Mike Barretta
d41885347f
Missing link added to fips-140-compliance.asciidoc (#112477) (#112516)
completed a missing link to the support matrix
2024-09-04 23:34:01 +10:00
Simon Cooper
a36d90cf34
Use CLDR locale provider on JDK 23+ (#110222)
JDK 23 removes the COMPAT locale provider, leaving CLDR as the only option. This commit configures Elasticsearch
to use the CLDR provider when on JDK 23, but still use the existing COMPAT provider when on JDK 22 and below.

This causes some differences in locale behaviour; this also adapts various tests to still work whether run on COMPAT or CLDR.
2024-09-04 13:42:40 +01:00