elasticsearch

mirror of https://github.com/elastic/elasticsearch.git synced 2025-06-30 02:13:33 -04:00

Author	SHA1	Message	Date
Keith Massey	2566cd1738	Adding DiskHealthIndicatorService (#90041 ) Adds a new health indicator that reports problems if indexes have a block placed on them, or if any nodes in the cluster are running low on disk space.	2022-09-20 15:27:26 -05:00
Iraklis Psaroudakis	56dd7c74da	Forward port 8.4.2 documentation to main (#90170 ) Forward port 8.4.2 documentation to main	2022-09-20 18:42:47 +03:00
Julie Tibshirani	b1acb3603d	Clarify that knn does not use postfiltering (#89897 ) This PR expands the approximate kNN docs to clarify the filter is applied during the kNN search, not after. It explains the downsides of postfiltering.	2022-09-19 16:47:17 -07:00
Joe Gallo	21356162eb	ILM don't rollover empty indices (#89557 )	2022-09-19 18:37:41 -04:00
István Zoltán Szabó	45646b78e2	[DOCS] Adds missing_bucket setting to transform APIs (#90111 )	2022-09-19 15:22:48 +02:00
Iraklis Psaroudakis	3ed7a04d22	Introduce node mappings stats (#89807 ) So that they are visible in NodeIndicesStats only at the node and index (but not shard) levels. Also visible in the _cat/nodes table. And make an exact count yaml REST test.	2022-09-19 15:47:47 +03:00
Iraklis Psaroudakis	34471b1cd2	Introduce max headroom for disk watermark stages (#88639 ) Introduce max headroom settings for the low, high, and flood disk watermark stages, similar to the existing max headroom setting for the flood stage of the frozen tier. Introduce new max headrooms in HealthMetadata and in ReactiveStorageDeciderService. Add multiple tests in DiskThresholdDeciderUnitTests, DiskThresholdDeciderTests and DiskThresholdMonitorTests. Moreover, addition & subtraction for ByteSizeValue, and min.	2022-09-19 14:59:18 +03:00
James Baiera	db73aa0498	Add repeated snapshot failure troubleshooting guide (#89762 ) This troubleshooting guide is what will be returned from the SLM health indicator when a SLM policy has suffered from too many repeat failures without a successful execution.	2022-09-15 17:01:32 -04:00
David Turner	c7da07068c	Reinstate prerelease upgrade warning (#90093 ) This warning was lost in #83489, but it's important we have it in these docs since users keep on trying this kind of invalid upgrade. This commit reinstates the lost warning.	2022-09-16 00:06:08 +09:30
Keith Massey	8f591e72f8	Adding node name to cluster_formation section of stable master health API results (#89982 )	2022-09-14 13:31:03 -05:00
Joe Gallo	f262f36564	Add master_timeout to the snapshot delete docs (#90032 )	2022-09-14 12:23:46 -04:00
Nik Everett	17967a98d3	Remove synthetic _source from tech preview (#90042 ) I've been hacking on synthetic source for a while now and not seen any need to break backwards compatibility or any major bugs. I think it's time to remove the `preview` marker from it so folks can use it without fear.	2022-09-13 16:33:10 -04:00
Iraklis Psaroudakis	d83ed3315a	Re-registering corrupt repository unblocks it (#89719 ) Fixes #89130	2022-09-12 20:21:35 +03:00
didendron	84eb60ee16	SQL: Implement DATE_FORMAT function (#88388 )	2022-09-12 09:33:34 +02:00
James Baiera	c570d3e88e	Add advice on restoring ILM indices into unmanaged state (#89140 )	2022-09-11 23:29:54 -04:00
Alan Woodward	224f48e637	[DOCS] document that date and date_nanos fields support synthetic source (#89968 )	2022-09-09 17:21:43 +01:00
Christos Soulios	1a709caa65	[TSDB] Removed `summary` and `histogram` metric types (#89937 ) It seems that for now we don't have a good use for the histogram and summary metric types. They had been left as place holders for a while, but at this point there is no concrete plan forward for them. This PR removes the histogram and summary metric types. We may add them back in the future. Also, this PR completely removes the time_series_metric mapping parameter from the histogram field type and only allows the gauge metric type for aggregate_metric_double fields.	2022-09-09 15:04:30 +03:00
Ed Savage	fd20027751	[ML] Performance improvements for categorization jobs (#89824 ) Categorization of strings which break down to a huge number of tokens can cause the C++ backend process to choke - see elastic/ml-cpp#2403. This PR adds a limit filter to the default categorization analyzer which caps the number of tokens passed to the backend at 100. Unfortunately this isn't a complete panacea to all the issues surrounding categorization of many tokened / large messages as verification checks on the frontend can also fail due to calls to the datafeed _preview API returning an excessive amount of data.	2022-09-08 18:41:01 +01:00
Nikola Grcevski	f16110da9b	[DOCS] Revert changes for ES_JAVA_OPTS (#89931 ) ES_JAVA_OPTS is still the correct way to pass options to the Elasticsearch process, CLI_JAVA_OPTS affects only the command line tool. CLI_JAVA_OPTS is the correct way to pass options for plugin installation or other tools.	2022-09-08 13:37:39 -04:00
Nik Everett	c4a77d572d	Synthetic _source: support dense_vector (#89840 ) This adds support for synthetic _source to `dense_vector` fields. ![image](https://user-images.githubusercontent.com/215970/188734496-0f0772c7-4c7a-46b6-b978-0c220e73474d.png)	2022-09-09 00:54:59 +09:30
Nik Everett	e89586c20d	Document synthetic source for text and keyword (#89893 ) `text` and `keyword` fields support synthetic _source in a few more configurations now. This documents those configurations.	2022-09-08 23:35:27 +09:30
Adam Locke	93ad8828a3	[DOCS] Add note about passwords in .env Docker file (#89892 )	2022-09-08 08:40:09 -04:00
Abdon Pijpelink	346f7848e6	[DOCS] Add troubleshooting searches guide (#89583 ) * [DOCS] Adds troubleshooting searches guide * Additional troubleshooting steps * Apply review suggestions * Replace separate _cat aliases/indices requests with one get indices call * Reorder steps to move field caps forward * Add note about ignore_unavailable Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com>	2022-09-08 14:30:21 +02:00
Pooya Salehi	72b2bd90e2	Update running snapshot state value to `STARTED` (#89863 ) The get snapshot status API will currently return a value of `STARTED` for the state of a snapshot that is currently running. The documentation says that the `state` value for a running snapshot is `IN_PROGRESS`. This documentation change will align the docs with the actual result of the get snapshot status API. Co-authored-by: Austin Smith <76973609+asmith-elastic@users.noreply.github.com>	2022-09-08 10:12:44 +02:00
Keith Massey	06cfa747be	Avoiding the use of dynamic map keys in the cluster_formation results of the stable master health indicator (#89842 ) Part of the stable master history health indicator's results (the `cluster_formation` section within `details`) used dynamic keys in a map. This gets rid of that. So now instead of: ``` "details": { "current_master": { "node_id": null, "name": null }, "recent_masters": [ { "node_id": "31WBm9iTTRuMyWnBhWNUGA", "name": "master-node-3" } ], "cluster_formation": { "31WBm9iTTRuMyWnBhWNUGA": "master not discovered or elected yet, an election requires at least 2 nodes with ids from [nADkAeGsT-q12gw89Ga1FA, 31WBm9iTTRuMyWnBhWNUGA, w8v48JvuRsuDCjwBn8KbRw], have only discovered non-quorum [{master-node-3}{31WBm9iTTRuMyWnBhWNUGA}{lJmGYiTPS_W7AJU7csG_gQ}{master-node-3}{127.0.0.1}{127.0.0.1:9301}{dm}]; discovery will continue using [127.0.0.1:9300, 127.0.0.1:9302, 127.0.0.1:9303, 127.0.0.1:9304, 127.0.0.1:9305, [::1]:9300, [::1]:9302, [::1]:9303, [::1]:9304, [::1]:9305] from hosts providers and [{master-node-2}{nADkAeGsT-q12gw89Ga1FA}{logzEHuuTpqwJp-RWssBPw}{master-node-2}{127.0.0.1}{127.0.0.1:9300}{dm}, {master-node-3}{31WBm9iTTRuMyWnBhWNUGA}{lJmGYiTPS_W7AJU7csG_gQ}{master-node-3}{127.0.0.1}{127.0.0.1:9301}{dm}] from last-known cluster state; node term 39, last-accepted version 461 in term 39" } } ``` We will have: ``` "details": { "current_master": { "node_id": null, "name": null }, "recent_masters": [ { "node_id": "31WBm9iTTRuMyWnBhWNUGA", "name": "master-node-3" } ], "cluster_formation": [ { "node_id": "31WBm9iTTRuMyWnBhWNUGA", "cluster_formation_message": "master not discovered or elected yet, an election requires at least 2 nodes with ids from [nADkAeGsT-q12gw89Ga1FA, 31WBm9iTTRuMyWnBhWNUGA, w8v48JvuRsuDCjwBn8KbRw], have only discovered non-quorum [{master-node-3}{31WBm9iTTRuMyWnBhWNUGA}{lJmGYiTPS_W7AJU7csG_gQ}{master-node-3}{127.0.0.1}{127.0.0.1:9301}{dm}]; discovery will continue using [127.0.0.1:9300, 127.0.0.1:9302, 127.0.0.1:9303, 127.0.0.1:9304, 127.0.0.1:9305, [::1]:9300, [::1]:9302, [::1]:9303, [::1]:9304, [::1]:9305] from hosts providers and [{master-node-2}{nADkAeGsT-q12gw89Ga1FA}{logzEHuuTpqwJp-RWssBPw}{master-node-2}{127.0.0.1}{127.0.0.1:9300}{dm}, {master-node-3}{31WBm9iTTRuMyWnBhWNUGA}{lJmGYiTPS_W7AJU7csG_gQ}{master-node-3}{127.0.0.1}{127.0.0.1:9301}{dm}] from last-known cluster state; node term 39, last-accepted version 461 in term 39" } ] } ```	2022-09-08 06:22:36 +09:30
Nik Everett	b667aa33f0	Synthetic _source: support histogram field (#89833 ) Adds support for the `histogram` field type to synthetic _source. ![image](https://user-images.githubusercontent.com/215970/188691249-9d23d1dc-64ab-49a4-8b24-f60fc966c0ac.png)	2022-09-08 01:55:38 +09:30
István Zoltán Szabó	7de1a6efc5	[DOCS] Simplifies composite aggregation recommendation (#89878 )	2022-09-07 17:54:05 +02:00
Adam Locke	686a3fd45d	[DOCS] Update CCS compatibility matrix for 8.3 (#88906 ) * [DOCS] Update CCS compatibility matrix for 8.3 Updates the CCS compatibility table to include 8.3. * Fixing busted table 🔨 * Update table for 8.3 -> 8.1 support Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com>	2022-09-07 11:53:22 -04:00
Abdon Pijpelink	56edb88fed	Update disk-usage.asciidoc (#89709 ) (#89874 ) added missing word (cherry picked from commit `3e35455511`) Co-authored-by: Brady Vidovic <bradvido@users.noreply.github.com>	2022-09-07 23:28:44 +09:30
Pooya Salehi	59c1c81262	Fix typo in get-snapshot-status-api doc (#89865 )	2022-09-07 15:21:41 +02:00
Julie Tibshirani	3c1b070329	Avoid negative scores with cross_fields type (#89016 ) The cross_fields scoring type can produce negative scores when some documents are missing fields. When blending term document frequencies, we take the maximum document frequency across all fields. If one field appears in fewer documents than another, this means that its IDF can become negative. This is because IDF is calculated as `Math.log(1 + (docCount - docFreq + 0.5) / (docFreq + 0.5))` This change adjusts the docFreq for each field to `Math.min(docCount, docFreq)` so that the IDF can never become negative. It makes sense that the term document frequency should never exceed the number of documents containing the field.	2022-09-06 13:02:24 -07:00
Tanguy Leroux	33ff7b2cb8	[Docs] Indicate that CCR does not auto-follow existing indices, only newly created ones (#89498 )	2022-09-07 01:43:19 +09:30
Anthony McGlone	492f5b1751	[DOCS] Update search_after section with an example (#89631 ) * [DOCS] Update search_after section with an example * Update docs/reference/search/search-your-data/paginate-search-results.asciidoc Co-authored-by: Abdon Pijpelink <abdon@abdon.nl> * Update docs/reference/search/search-your-data/paginate-search-results.asciidoc Co-authored-by: Abdon Pijpelink <abdon@abdon.nl> * Update docs/reference/search/search-your-data/paginate-search-results.asciidoc Co-authored-by: Abdon Pijpelink <abdon@abdon.nl> * [DOCS] Update search_after section with an example * [DOCS] Update search_after example with a response with sort values Co-authored-by: Abdon Pijpelink <abdon@abdon.nl>	2022-09-05 15:33:03 +02:00
David Turner	6e3fe01b5a	Reject unknown bulk actions (#89450 ) Requests to the bulk API comprise a sequence of items, each of which starts with a JSON object describing the item. This object includes the type of action to perform with the item which should be one of `create`, `update`, `index`, or `delete`. In earlier versions Elasticsearch would ignore items with an unrecognized type, skipping the next line in the request, but this lenient behaviour means that there is no way for the client to associate the items in the response with the items in the request, and in some cases it would cause the remainder of the request to be parsed incorrectly. With this commit, requests to the bulk API must comprise only items with recognized types. Elasticsearch will reject requests containing any items with an unrecognized type with a `400 Bad Request` error response.	2022-09-05 20:57:02 +09:30
debadair	ba14aaa004	[DOCS] Add simple dev setup instructions back to the README (#89714 ) * [DOCS] Add Docker-based quickstart instructions to README & ES guide * Removed attributes * Added basic tutorial * Fixed headings * Edit * [DOCS] Adding basic dev setup instructions to the ES guide * Incorporated security config from Kibana docs * Update README.asciidoc Co-authored-by: Adam Locke <adam.locke@elastic.co> * Update README.asciidoc Co-authored-by: Adam Locke <adam.locke@elastic.co> * Added callouts for version attributes * Update README.asciidoc Co-authored-by: Adam Locke <adam.locke@elastic.co> * Update README.asciidoc Co-authored-by: Adam Locke <adam.locke@elastic.co> * Added README content to guide topic * Snippet test fixes * Incorporated review feedback * Remove version * Update docs/reference/setup/run-elasticsearch-locally.asciidoc * Added info about security on by default * Fix xrefs * Update docs/reference/setup/run-elasticsearch-locally.asciidoc * Fix headers * Edits Co-authored-by: Adam Locke <adam.locke@elastic.co>	2022-09-01 16:49:05 -07:00
István Zoltán Szabó	e244473962	[DOCS] Reworks aggregating data for faster performance page (#89575 )	2022-09-01 13:59:05 +02:00
István Zoltán Szabó	cbda0a51c6	[DOCS] Adds text similarity task example to API docs (#89756 )	2022-09-01 11:53:26 +02:00
Dimitris Athanasiou	b5504ea701	[ML] Lift limit of max number of classes for classification to 100 (#89755 ) Limit was previously set to `30`. After the improvements in elastic/ml-cpp#2395 we now raist the limit to `100`.	2022-09-01 10:47:58 +03:00
István Zoltán Szabó	a23c77ceaa	[DOCs] Orders transform API parameters in alphabetical order. (#89746 )	2022-08-31 11:42:07 +02:00
Jack Conradson	8c30b86fe2	Fix bug for kNN with filtered aliases (#89621 ) This change adds the filter query for a filtered alias to the knn query during the dfs phase on the shard. This ensures the correct number of k results are returned instead of removing results as a post filter. Fixes: #89561	2022-08-30 15:57:37 -07:00
debadair	30bb54a4e2	[DOCS] Remove coming tag from 8.4.1 RNs (#89727 ) * [Doc] Release notes for v8.4.1 (#89636) * [Doc] Release notes for v8.4.1 Gradle generated release notes for v8.4.1 * address feedback * [DOCS] Remove coming tag for 8.4.1 RNs Co-authored-by: Yang Wang <yang.wang@elastic.co>	2022-08-30 08:00:08 -07:00
debadair	e0fe6de63a	[Doc] Release notes for v8.4.1 (#89636 ) (#89726 ) * [Doc] Release notes for v8.4.1 Gradle generated release notes for v8.4.1 * address feedback Co-authored-by: Yang Wang <yang.wang@elastic.co>	2022-08-30 07:59:04 -07:00
Andrei Dan	7edf5aa750	[DOCS] health api details descriptions (#89553 )	2022-08-30 15:55:18 +01:00
Nik Everett	104f4e9fb5	Synthetic _source: support version field type (#89706 ) This adds support for synthetic _source to the `version` field type. It works very similarly to `keyword` but with an extra decode step. I modified the decoder to return a `BytesRef` instead of a `String` because many of the callers seemed to be converting that string directly into bytes again. Synthetic source would have wanted to do that. As was the query infrastructure.	2022-08-30 09:39:50 -04:00
Abdon Pijpelink	e891909dfa	[DOCS] Explain dynamic behavior for unmapped copy_to fields (#89626 ) * [DOCS] Explain dynamic behavior for unmapped copy_to fields * Review suggestions	2022-08-30 15:15:35 +02:00
debadair	cda5ce42aa	remove 8.3.0 coming tag (#89701 )	2022-08-29 12:32:14 -07:00
Leaf-Lin	78d9d6844f	[DOCS] Add warning on restarting nodes exceeding low disk watermark (#89018 ) * Create restart-cluster.asciidoc As per https://github.com/elastic/elasticsearch/issues/49972 and https://github.com/elastic/elasticsearch/issues/56578, if a node is above low disk threshold when being restarted (rolling restart, network disruption or crash), the disk threshold decider prevents reusing the shard content on the restarted node. The consequence of the event is the node may take a long time to start. * Update docs/reference/setup/restart-cluster.asciidoc LGTM! Thanks! Co-authored-by: Adam Locke <adam.locke@elastic.co> Co-authored-by: Adam Locke <adam.locke@elastic.co>	2022-08-29 15:14:44 -04:00
David Kilfoyle	2a44a8982f	[DOCS] Remove feature flag from TSDS docs (#89673 ) * Docs: Remove feature flag and add preview label to TSDS docs * Fix technical preview tag	2022-08-29 10:33:55 -04:00
Abdon Pijpelink	772784f3c9	[DOCS] Add note that terms enum API may return terms from deleted docs (#89654 )	2022-08-29 15:19:04 +02:00
István Zoltán Szabó	9a71d1fa78	[DOCS] Clarifies retention policy for transforms (#89685 )	2022-08-29 13:17:15 +02:00

1 2 3 4 5 ...

9962 commits