Commit graph

12605 commits

Author SHA1 Message Date
Liam Thompson
c85c2d9204
[DOCS][101] Update first quick start with mappings examples (#113558) 2024-10-02 20:00:09 +02:00
Simon Cooper
31d50eed0f
Update 9.0 with various locale changes from 8.x (#113787) (#113870)
Forward-port changes from #113787, and update the docs with similar information to #113587
2024-10-02 11:41:33 +01:00
Liam Thompson
51570a0b78
[DOCS] Update local dev instructions to use start-local (#113848)
https://www.elastic.co/start-local is live and will be our go-to local
dev setup.

This PR:

- Updates both the Elasticsearch root readme and `run-elasticsearch-locally.asciidoc`

🧹 Also try to keep as concise as possible by not mirroring _everything_
in readme
2024-10-02 19:12:58 +10:00
Chenhui Wang
7f2f0b8568
Update elastic-connectors docker namespace to integrations (#113839) 2024-10-02 10:20:01 +02:00
Liam Thompson
9b582c15ff
[DOCS] Fix typo in knn tuning guide (#113880) 2024-10-02 09:55:52 +02:00
Luigi Dell'Aquila
9a652829a3
ES|QL: provide snapshot_only info for functions (Kibana) (#113544) 2024-10-02 09:27:05 +02:00
Benjamin Trent
8ed0df42f6
Add more dense_vector details for cluster stats field stats (#113607)
This adds some more counts for dense_vector field mapping stats. This
allows for seeing the number of mappings with a given element type,
similarity, or index type.
2024-10-02 01:58:04 +10:00
john-wagster
0fbb3bcb45
Updated Date Range to Follow Documentation When Assuming Missing Values (#112258)
* updated rangetype to be more inline with the docs (https://www.elastic.co/guide/en/elasticsearch/reference/current/query-dsl-range-query.html) and added tests to reflect as much
2024-10-01 09:21:47 -05:00
Stef Nestor
aedbd5c911
(Doc+) Hotspotting link example troubleshooting video (#112932)
* (Doc+) Hotspotting link video

👋 howdy, team! Ongoing improvements for common support topics, this [links our example hotspotting video](https://www.youtube.com/watch?v=Q5ODJ5nIKAM&list=PL_mJOmq4zsHbQlfEMEh_30_LuV_hZp-3d&index=5).

---------

Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com>
Co-authored-by: shainaraskas <58563081+shainaraskas@users.noreply.github.com>
2024-10-01 07:35:29 -06:00
Benjamin Trent
0cf6a83cd9
Adjust the knn tuning guide (#113566) 2024-10-01 08:52:35 -04:00
David Turner
69fffdfecb
Misc ReferenceDocs improvements (#113667)
- Forbid ephemeral `_auto_gen.html` and `page.html#_auto_gen` links.
- Remove dangling/unused `BOOTSTRAP_CHECK_G1GC` link.
- Separate test suite into individual tests.
2024-10-01 13:44:00 +01:00
kosabogi
9568d9ced9
Updates 'Getting ready for production' page (#113679)
* Updates 'Getting ready for production' page

* Update docs/reference/intro.asciidoc

Co-authored-by: shainaraskas <58563081+shainaraskas@users.noreply.github.com>

* Update docs/reference/intro.asciidoc

Co-authored-by: shainaraskas <58563081+shainaraskas@users.noreply.github.com>

* Update docs/reference/intro.asciidoc

Co-authored-by: shainaraskas <58563081+shainaraskas@users.noreply.github.com>

* Update docs/reference/intro.asciidoc

Co-authored-by: shainaraskas <58563081+shainaraskas@users.noreply.github.com>

* Update docs/reference/intro.asciidoc

Co-authored-by: Liam Thompson <32779855+leemthompo@users.noreply.github.com>

* Update docs/reference/intro.asciidoc

Co-authored-by: Liam Thompson <32779855+leemthompo@users.noreply.github.com>

* Update docs/reference/intro.asciidoc

Co-authored-by: shainaraskas <58563081+shainaraskas@users.noreply.github.com>

* Update docs/reference/intro.asciidoc

Co-authored-by: Liam Thompson <32779855+leemthompo@users.noreply.github.com>

---------

Co-authored-by: shainaraskas <58563081+shainaraskas@users.noreply.github.com>
Co-authored-by: Liam Thompson <32779855+leemthompo@users.noreply.github.com>
2024-10-01 13:42:37 +02:00
István Zoltán Szabó
44667d52a0
[DOCS] Documents completion task type for the AlibabaCloud AI Searc inference service. (#113845) 2024-10-01 13:41:38 +02:00
Chris Hegarty
32dde26e49
Upgrade to Lucene 9.12.0 (#113333)
This commit upgrades to Lucene 9.12.0.

Co-authored-by: Adrien Grand <jpountz@gmail.com>
Co-authored-by: Armin Braun <me@obrown.io>
Co-authored-by: Benjamin Trent <ben.w.trent@gmail.com>
Co-authored-by: Chris Hegarty <chegar999@gmail.com>
Co-authored-by: John Wagster <john.wagster@elastic.co>
Co-authored-by: Luca Cavanna <javanna@apache.org>
Co-authored-by: Mayya Sharipova <mayya.sharipova@elastic.co>
2024-10-01 08:39:27 +01:00
Michael Peterson
ddba47407d
Collect and display execution metadata for ES|QL cross cluster searches (#112595)
Enhance ES|QL responses to include information about `took` time (search latency), shards, and
clusters against which the query was executed.

The goal of this PR is to begin to provide parity between the metadata displayed for 
cross-cluster searches in _search and ES|QL.

This PR adds the following features:
- add overall `took` time to all ES|QL query responses. And to emphasize: "all" here 
means: async search, sync search, local-only and cross-cluster searches, so it goes
beyond just CCS.
- add `_clusters` metadata to the final response for cross-cluster searches, for both
async and sync search (see example below)
- tracking/reporting counts of skipped shards from the can_match (SearchShards API)
phase of ES|QL processing
- marking clusters as skipped if they cannot be connected to (during the field-caps
phase of processing)

Out of scope for this PR:
- honoring the `skip_unavailable` cluster setting
- showing `_clusters` metadata in the async response **while** the search is still running
- showing any shard failure messages (since any shard search failures in ES|QL are
automatically fatal and _cluster/details is not shown in 4xx/5xx error responses). Note that 
this also means that the `failed` shard count is always 0 in ES|QL `_clusters` section.

Things changed with respect to behavior in `_search`:
- the `timed_out` field in `_clusters/details/mycluster` was removed in the ESQL
response, since ESQL does not support timeouts. It could be added back later
if/when ESQL supports timeouts.
- the `failures` array in `_clusters/details/mycluster/_shards` was removed in the ESQL
response, since any shard failure causes the whole query to fail.

Example output from ES|QL CCS:

```es
POST /_query
{
  "query": "from blogs,remote2:bl*,remote1:blogs|\nkeep authors.first_name,publish_date|\n limit 5"
}
```

```json
{
  "took": 49,
  "columns": [
    {
      "name": "authors.first_name",
      "type": "text"
    },
    {
      "name": "publish_date",
      "type": "date"
    }
  ],
  "values": [
    [
      "Tammy",
      "2009-11-04T04:08:07.000Z"
    ],
    [
      "Theresa",
      "2019-05-10T21:22:32.000Z"
    ],
    [
      "Jason",
      "2021-11-23T00:57:30.000Z"
    ],
    [
      "Craig",
      "2019-12-14T21:24:29.000Z"
    ],
    [
      "Alexandra",
      "2013-02-15T18:13:24.000Z"
    ]
  ],
  "_clusters": {
    "total": 3,
    "successful": 2,
    "running": 0,
    "skipped": 1,
    "partial": 0,
    "failed": 0,
    "details": {
      "(local)": {
        "status": "successful",
        "indices": "blogs",
        "took": 43,
        "_shards": {
          "total": 13,
          "successful": 13,
          "skipped": 0,
          "failed": 0
        }
      },
      "remote2": {
        "status": "skipped",  // remote2 was offline when this query was run
        "indices": "remote2:bl*",
        "took": 0,
        "_shards": {
          "total": 0,
          "successful": 0,
          "skipped": 0,
          "failed": 0
        }
      },
      "remote1": {
        "status": "successful",
        "indices": "remote1:blogs",
        "took": 47,
        "_shards": {
          "total": 13,
          "successful": 13,
          "skipped": 0,
          "failed": 0
        }
      }
    }
  }
}
```

Fixes https://github.com/elastic/elasticsearch/issues/112402 and https://github.com/elastic/elasticsearch/issues/110935
2024-09-30 16:03:39 -04:00
Benjamin Trent
5c840f72b7
Deprecate dutch_kp and lovins stemmer as they are removed in Lucene 10 (#113143)
Lucene 10 has upgraded its Snowball stemming support, as part of those
upgrades, two no longer supported stemmers were removed, `KpStemmer` and
`LovinsStemmer`. These are `dutch_kp` and `lovins`, respectively.

We will deprecate in 8.16 and will remove support for these in a future
version.
2024-10-01 04:03:44 +10:00
Stanislav Malyshev
b26d81c713
Implement remote cluster CCS telemetry (#112478)
* Add remote cluster stats to _cluster/stats
* Implement remote cluster stats polling
* Add docs for the include_remotes part
2024-09-30 11:50:22 -06:00
Liam Thompson
55078d4c5e
[DOCS] Fix heading level (#113800) 2024-09-30 16:11:46 +02:00
Luke Whiting
b1b249d26b
#101193 Preserve Step Info Across ILM Auto Retries (#113187)
* Add new Previous Step Info field to LifecycleExecutionState

* Add new field to IndexLifecycleExplainResponse

* Add new field to TransportExplainLifecycleAction

* Add logic to IndexLifecycleTransition to keep previous setp info

* Switch tests to use Java standard Clock class

for any time based testing, this is the recommended method

* Fix tests for new field

Also refactor tests to newer style

* Add test to ensure step info is preserved

Across auto retries

* Add docs for new field

* Changelog Entry

* Update docs/changelog/113187.yaml

* Revert "Switch tests to use Java standard Clock class"

This reverts commit 241074c735.

* PR Changes

* PR Changes - Improve docs wording

Co-authored-by: Mary Gouseti <mgouseti@gmail.com>

* Integration test for new ILM explain field

* Use ROOT locale instead of default toLowerCase

* PR Changes - Switch to block strings

* Remove forbidden API usage

---------

Co-authored-by: Mary Gouseti <mgouseti@gmail.com>
2024-09-30 11:44:46 +01:00
Liam Thompson
6e400c12a7
[DOCS] Port connector docs from Enterprise Search guide (#112953) 2024-09-30 10:22:37 +02:00
István Zoltán Szabó
436c6c85ff
[DOCS] Adds an admonition to the transform painless examples. (#113706) 2024-09-30 09:28:28 +02:00
Kostas Krikellas
c9f378da29
Revert "Apply auto-flattening to subobjects: auto (#112092)" (#113692)
* Revert "Apply auto-flattening to `subobjects: auto` (#112092)"

This reverts commit fffe8844

* fix DataGenerationHelper
2024-09-30 10:11:15 +03:00
Sam Xiao
6917f1679a
Tag redacted document in ingest pipeline (#113552)
Adds a new option trace_redact in redact processor to indicate a document has been redacted in the ingest pipeline. If a document is processed by a redact processor AND any field is redacted, ingest metadata _ingest._redact._is_redacted = true will be set.

Closes #94633
2024-09-27 12:24:24 -04:00
Mike Pellegrini
8344d3a6ac
Add inner hits support to semantic query (#111834)
Adds inner hits support to the semantic query through a restricted inner_hits parameter, which exposes from and size from the inner_hits options
2024-09-27 10:51:11 -04:00
Luke Whiting
db632ee3cd
Note in docs about interpreting IO stats when running in docker (#113676)
* Note in docs about incorrect IO stats when running in docker

* Update docs/reference/cluster/nodes-stats.asciidoc

Co-authored-by: David Turner <david.turner@elastic.co>

* Requested PR changes to wording

* Update docs/reference/cluster/nodes-stats.asciidoc

Co-authored-by: David Turner <david.turner@elastic.co>

---------

Co-authored-by: David Turner <david.turner@elastic.co>
2024-09-27 13:32:23 +01:00
Carson Ip
4932697142
[docs] Fix typo in repository-s3.asciidoc (#113678)
Same as #113572 but targeting `8.x` and `main`.
2024-09-27 20:36:22 +10:00
Mark Tozzi
122e728820
[ESQL] Add TO_DATE_NANOS conversion function (#112150)
Resolves #111842

This adds a conversion function that yields DATE_NANOS. Mostly this is straight forward.

It is worth noting that when converting a millisecond date into a nanosecond date, the conversion function truncates it to 0 nanoseconds (i.e. first nanosecond of that millisecond). This is, of course, a bit of an assumption, but I don't have a better assumption we can make. I'd thought about adding a second, optional, parameter to control this behavior, but it's important that TO_DATE_NANOS extend AbstractConvertFunction, which itself extends UnaryScalarFunction, so that it will work correctly with union types. Also, it's unlikely the user will have any better guess than we do for filling in the nanoseconds.

Making that assumption does, however, create some weirdness. Consider two comparisons:

TO_DATETIME("2023-03-23T12:15:03.360103847") == TO_DATETIME("2023-03-23T12:15:03.360") will return true while TO_DATE_NANOS("2023-03-23T12:15:03.360103847") == TO_DATE_NANOS("2023-03-23T12:15:03.360") will return false. This is akin to casting between longs and doubles, where things may compare equal in one type that are not equal in the other. This seems fine, and I can't think of a better way to do it, but it's worth being aware of.

---------

Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com>
2024-09-26 12:03:01 -04:00
Stef Nestor
0ea8a78ca7
(Doc+) Avoid search pile up by setting default timeout (#112846)
👋! Mini doc PR to say can avoid search task pile-ups by setting [`search.default_search_timeout`](https://www.elastic.co/guide/en/elasticsearch/reference/current/search-your-data.html#search-timeout) under [High JVM > avoid expensive searches](https://www.elastic.co/guide/en/elasticsearch/reference/master/high-jvm-memory-pressure.html#reduce-jvm-memory-pressure).
2024-09-26 09:05:21 -06:00
kosabogi
6e73c1423b
Adds text_similarity task type to inference processor documentation (#113517) 2024-09-26 16:12:28 +02:00
István Zoltán Szabó
5e019998ef
[DOCS] Improves semantic text documentation. (#113606) 2024-09-26 16:09:28 +02:00
Kostas Krikellas
fffe8844e9
Apply auto-flattening to subobjects: auto (#112092)
* Introduce mode `subobjects=auto` for objects

* Update docs/changelog/110524.yaml

* compilation error

* tests and fixes

* refactor

* spotless

* more tests

* fix nested objects

* fix test

* update fetch test

* add QA coverage

* update tests

* update tests

* update tests

* Apply auto-flattening to `subobjects: auto`

* Update docs/changelog/112092.yaml

* sync

* dont flatten subobjects auto

* refine test

* fix path for nested flattened objects and dynamic

* document `subobjects: auto`

* Apply suggestions from code review

Co-authored-by: Felix Barnsteiner <felixbarny@users.noreply.github.com>

* comment updates

* restore indentation in comment

* update comment

* update comment

* update comment

* update comment

* rename isFlattenable

* add test for dynamic template

* fix copy_to and noop dynamic updates

* tests

* update comment

* fix tests

* update cluster feature in yaml test

* address comments

---------

Co-authored-by: Felix Barnsteiner <felixbarny@users.noreply.github.com>
2024-09-26 11:42:40 +03:00
Keith Massey
cd950bb2fa
Adding component template substitutions to the simulate ingest API (#113276) 2024-09-25 15:30:22 -05:00
Mayya Sharipova
c18c531d72
Deprecate legacy params from range query (#113286)
Deprecate to, from, include_lower, include_upper range query params.
These params have been removed from our documentation in v. 0.90.4 (d6ecdecc19),
but did not got through deprecation cycle.

These params to be removed in v9.0.

Related to #81276

Closes #48538
2024-09-25 14:48:45 -04:00
Liam Thompson
4f666310c7
[DOCS] Create Elasticsearch basics section, refactor quickstarts section (#112436)
Co-authored-by: shainaraskas <58563081+shainaraskas@users.noreply.github.com>
2024-09-25 17:30:01 +02:00
Smriti
0638d3977a
Update index-templates.asciidoc (#113461)
Adding `security_solution-*-*` in list of index nae to avoid the pattern collisions.
2024-09-25 13:55:17 +02:00
Luigi Dell'Aquila
7ba26892f3
ES|QL: make CSV date tests more friendly for Java 23 (#113472)
Following [this
suggestion](https://github.com/elastic/elasticsearch/pull/113376#issuecomment-2370817089),
switching date patterns from week years to calendar years, that have the
same behavior in java <=22 and java 23.
2024-09-25 02:57:22 +10:00
Valeriy Khakhutskyy
7b7dd91f62
[ML] Add documentation for post calendar events API (#113188)
This PR updates the documentation for the extension of the POST calendar events API implemented in #112837.
2024-09-24 09:46:42 +02:00
Sam Xiao
80dd56398f
ILM: Add total_shards_per_node setting to searchable snapshot (#112972)
Allows setting index total_shards_per_node in the SearchableSnapshot action of ILM to remediate hot spot in shard allocation for searchable snapshot index.

Closes #112261
2024-09-23 13:37:58 -04:00
Nik Everett
58021c3405
ESQL: TOP support for strings (#113183)
Adds support to the `TOP` aggregation for `keyword` and `text` field
types.

Closes #109849
2024-09-24 03:00:18 +10:00
Salvatore Campagna
208a1fe571
Introduce an ignore_above index-level setting (#113121)
Here we introduce a new index-level setting, `ignore_above`, similar to what we have
for `ignore_malformed`. The setting will apply to all `keyword`, `wildcard` and `flattened`
fields. Each field mapping will still be allowed to override the index-level setting using a
mapping-level `ignore_above` value.
2024-09-23 18:05:02 +02:00
Pm Ching
e7bbcb9883
fix typos (#113329) 2024-09-23 17:38:51 +02:00
David Kyle
884196ced3
[ML] Add deployment threading details and memory usage to telemetry (#113099)
Adds deployment threading options and a new memory section reporting 
the memory usage for each of the ml features
2024-09-23 16:08:46 +01:00
Nik Everett
3de17b6d44
ESQL: Document esql_worker threadpool (#113203)
Documents the thread pool we use to run ESQL operations. It's the same
size and queue depth as the `search` thread pool.

Closes #113130
2024-09-23 11:03:37 -04:00
Liam Thompson
5750696069
[DOCS] Add snippet tests to retriever API docs (#113289) 2024-09-23 16:59:01 +02:00
Jedr Blaszyk
ce79fa4847
Add view_index_matadata connector permission for fleet-server account (#113262)
* Add view_index_matadata to fleet-server for elastic_connetors package

* Fix typo
2024-09-23 10:19:00 +02:00
Felix Barnsteiner
8d223cbf7a
Add support for multi-value dimensions (#112645)
Closes https://github.com/elastic/elasticsearch/issues/110387

Having this in now affords us not having to introduce version checks in
the ES exporter later. We can simply use the same serialization logic
for metric attributes as we do for other signals. This also enables us
to properly map `*.ip` fields to the ip field type as ip fields
containing a list of IPs are not converted to a comma-separated list.
2024-09-23 17:31:18 +10:00
Martijn van Groningen
c21342a499
Added known issue entry for synthetic source bug. (#113269)
Added known issue entry for synthetic source bug.

Co-authored-by: Oleksandr Kolomiiets <olkolomiiets@gmail.com>
2024-09-23 07:06:55 +02:00
Pm Ching
d68f2fa4a6
fix a couple of docs typos (#112901) 2024-09-20 18:34:24 +03:00
Bogdan Pintea
f7ff00f645
ESQL: Align year diffing to the rest of the units in DATE_DIFF: chronological (#113103)
This will correct/switch "year" unit diffing from the current integer
subtraction to a crono subtraction. Consequently, two dates are (at
least) one year apart now if (at least) a full calendar year separates
them. The previous implementation simply subtracted the year part of the
dates.

Note: this parts with ES SQL's implementation of the same function,
which itself is aligned with MS SQL's implementation, which works
equivalent to an integer subtraction.

Fixes #112482.
2024-09-20 20:21:29 +10:00
István Zoltán Szabó
9b7d808bf4
[DOCS] Fixes adaptive_allocations examples (#113248)
Co-authored-by: Jan Kuipers <148754765+jan-elastic@users.noreply.github.com>
2024-09-20 11:31:04 +02:00