Your window into the Elastic Stack
Find a file
Marco Antonio Ghiani c3fdb39dfb
[Streams 🌊] Enrichment simulation behaviour improvements (#209985)
## 📓 Summary

Part of https://github.com/elastic/streams-program/issues/127
Closes https://github.com/elastic/streams-program/issues/114

This update overhauls the internal logic of our processing simulation
endpoint. It now runs parallel simulations (pipeline and, conditionally,
ingest) to extract detailed document reports and processor metrics,
while also handling a host of edge cases.

The key improvements include:

- **Parallel Simulation Execution**  
Executes both pipeline and ingest simulations concurrently. The pipeline
simulation always runs to extract per-document reports and metrics. The
ingest simulation runs conditionally when detected fields are provided,
enabling fast failures on mapping mismatches.

- **Document Reporting & Metrics**  
Extracts granular differences between source and simulated documents.
Reports include:
- Field-level diffs indicating which processor added or updated fields.
- Detailed error messages (e.g., generic processor failure, generic
simulation failure, non-additive processor failure).
- Calculation of overall success and failure rates, as well as
per-processor metrics.

- **Sequential Processors & Field Overriding**  
Supports multiple sequential processors. In cases where later processors
override fields produced by earlier ones, the logic bypasses
non-additive checks to accept the new value.

- **Robust Handling of Partial & Failed Simulations**  
  Simulations now correctly mark documents as:
  - **Parsed** when all processors succeed.
  - **Partially parsed** when some processors fail.
- **Failed** when none of the processors processing the document
succeed.

- **Mapping Validation & Non-Additive Detection**  
The simulation verifies that the detected field mappings are compatible.
If a processor introduces non-additive changes—updating an existing
field rather than appending—the simulation flags the error and sets a
dedicated `is_non_additive_simulation` flag. Additionally, a failed
ingest simulation (e.g., due to incompatible mapping types) results in
an immediate failure.

The final returned API response adheres to the following TypeScript
type:

```typescript
interface SimulationResponse {
  detected_fields: DetectedField[];
  documents: SimulationDocReport[];
  processors_metrics: Record<string, ProcessorMetrics>;
  failure_rate: number;
  success_rate: number;
  is_non_additive_simulation: boolean;
}
```

## Updated tests
```
Processing Simulation
├── Successful simulations
│   ├── should simulate additive processing
│   ├── should simulate with detected fields
│   ├── should simulate multiple sequential processors
│   ├── should simulate partially parsed documents
│   ├── should return processor metrics
│   ├── should return accurate success/failure rates
│   ├── should allow overriding fields detected by previous simulation processors (skip non-additive check)
│   ├── should gracefully return the errors for each partially parsed or failed document
│   ├── should gracefully return failed simulation errors
│   ├── should gracefully return non-additive simulation errors
│   └── should return the is_non_additive_simulation simulation flag
└── Failed simulations
    └── should fail with incompatible detected field mappings
```

## 🚨 API Failure Conditions & Handler Corner Cases

The simulation API handles and reports the following corner cases:

- **Pipeline Simulation Failures** _(Gracefully reported)_
- Syntax errors in processor configurations (e.g., malformed grok
patterns) trigger a pipeline-level failure with detailed error
information (processor ID, error type, and message).

- **Non-Additive Processor Behavior**  _(Gracefully reported)_
- If a processor modifies fields already present in the source document
rather than strictly appending new fields, the simulation flags this as
a non-additive change.
- The error is recorded both at the document level (resulting in a
"partially_parsed" or "failed" status) and within per-processor metrics,
with the global flag `is_non_additive_simulation` set to true.
  
- **Partial Document Processing**  _(Gracefully reported)_
- In scenarios with sequential processors where the first processor
succeeds (e.g., a dissect processor) and the subsequent grok processor
fails, documents are marked as "partially_parsed."
- These cases are reflected in the overall success/failure rates and
detailed per-document error lists.

- **Field Overriding**  
- When a later processor intentionally overrides fields (for instance,
reassigning a previously calculated field), the simulation bypasses the
non-additive check, and detected fields are aggregated accordingly,
noting both the original and overridden values.

- **Mapping Inconsistencies**  _(API failure bad request)_
- When the ingest simulation detects incompatibility between the
provided detected field mappings (such as defining a field as a boolean
when it should be a date) and the source document, it immediately fails.
- The failure response includes an error message explaining the
incompatibility.

## 🔜 Follow-up Work

- **Integrate Schema Editor**  
Given the improved support for detected fields, a follow up PR will
introduced the Schema Editor and will allow mapping along the data
enrichment.
- **Granular filtering and report**
Having access to more granular details such as status, errors and
detected fields for each documents, we could enhance the table with
additional information and better filters. cc @LucaWintergerst @patpscal

## 🎥 Demo recordings


https://github.com/user-attachments/assets/29f804eb-6dd4-4452-a798-9d48786cbb7f

---------

Co-authored-by: Jean-Louis Leysens <jloleysens@gmail.com>
Co-authored-by: kibanamachine <42973632+kibanamachine@users.noreply.github.com>
2025-02-19 11:25:41 +01:00
.buildkite Deprecate universal entity 2025-02-18 16:49:32 +02:00
.devcontainer Removing experimental for the FIPS mode config (#200734) 2024-11-19 15:23:20 -05:00
.github SKA: Fix kebab-case issues in security-threat-hunting packages (#211349) 2025-02-19 11:21:45 +01:00
api_docs [api-docs] 2025-02-19 Daily api_docs build (#211680) 2025-02-19 08:19:06 +01:00
config Enable the new Borealis theme (#210468) 2025-02-14 09:39:01 +01:00
dev_docs Revert "[ResponseOps] Document creating task-manager serverless monitoring assets - adding to kibana dev docs navigation" (#211030) 2025-02-13 18:09:06 +01:00
docs SKA: Fix kebab-case issues in observability-ui packages (#211248) 2025-02-14 19:34:14 +01:00
examples chore(NA): upgrade to webpack 5 (#191106) 2025-02-14 03:01:36 +00:00
kbn_pm Sustainable Kibana Architecture: Move CodeEditor related packages #205587 (#205738) 2025-01-08 15:25:47 +01:00
legacy_rfcs remove timeslider RFC (#211305) 2025-02-14 11:39:48 -07:00
licenses Adds AGPL 3.0 license (#192025) 2024-09-06 19:02:41 -06:00
oas_docs Deprecate universal entity 2025-02-18 16:49:32 +02:00
packages SKA: Fix kebab-case issues in security-threat-hunting packages (#211349) 2025-02-19 11:21:45 +01:00
plugins
scripts SKA: Update broken references and URLs (#206836) 2025-01-28 03:32:48 +00:00
src [Streams 🌊] Enrichment simulation behaviour improvements (#209985) 2025-02-19 11:25:41 +01:00
test Consolidate all language definitions used in monaco within the @kbn/monaco package (#208950) 2025-02-18 19:27:36 +01:00
typings make emotion typing global (#200958) 2024-12-05 12:20:43 -06:00
x-pack [Streams 🌊] Enrichment simulation behaviour improvements (#209985) 2025-02-19 11:25:41 +01:00
.backportrc.json chore(NA): adds 9.0 into backportrc (#208999) 2025-01-31 06:04:50 +00:00
.bazelignore Remove references to deleted .ci folder (#177168) 2024-02-20 19:54:21 +01:00
.bazeliskversion chore(NA): upgrade bazelisk into v1.11.0 (#125070) 2022-02-09 20:43:57 +00:00
.bazelrc chore(NA): use new and more performant BuildBuddy servers (#130350) 2022-04-18 02:01:38 +01:00
.bazelrc.common Transpile packages on demand, validate all TS projects (#146212) 2022-12-22 19:00:29 -06:00
.bazelversion chore(NA): revert bazel upgrade for v5.2.0 (#135096) 2022-06-24 03:57:21 +01:00
.browserslistrc Add Firefox ESR to browserlistrc (#184462) 2024-05-29 17:53:18 -05:00
.editorconfig
.eslintignore SKA: Fix outdated eslint rules (#206961) 2025-01-28 08:49:39 +01:00
.eslintrc.js [scout] enable eslint plugin playwright (#210666) 2025-02-14 10:21:49 +01:00
.gitattributes
.gitignore [Investigate App] add MVP evaluation framework for AI root cause analysis integration (#204634) 2025-01-17 12:16:10 -05:00
.i18nrc.json SKA: Fix kebab-case issues in security-threat-hunting packages (#211349) 2025-02-19 11:21:45 +01:00
.node-version Upgrade Node.js to 20.18.2 (#207431) 2025-01-22 12:00:14 -06:00
.npmrc [npmrc] Fix puppeteer_skip_download configuration (#177673) 2024-02-22 18:59:01 -07:00
.nvmrc Upgrade Node.js to 20.18.2 (#207431) 2025-01-22 12:00:14 -06:00
.prettierignore
.prettierrc
.puppeteerrc Add .puppeteerrc (#179847) 2024-04-03 09:14:39 -05:00
.stylelintignore
.stylelintrc Bump stylelint to ^14 (#136693) 2022-07-20 10:11:00 -05:00
.telemetryrc.json Sustainable Kibana Architecture: Move modules owned by @elastic/kibana-core (#201653) 2025-01-04 11:47:24 -07:00
.yarnrc
BUILD.bazel Transpile packages on demand, validate all TS projects (#146212) 2022-12-22 19:00:29 -06:00
catalog-info.yaml Configures PagerDuty Backstage Integration for kbn (#208440) 2025-01-27 23:29:11 +00:00
CODE_OF_CONDUCT.md
CONTRIBUTING.md
FAQ.md Fix small typos in the root md files (#134609) 2022-06-23 09:36:11 -05:00
fleet_packages.json [main] Sync bundled packages with Package Storage (#211272) 2025-02-17 13:18:22 +01:00
github_checks_reporter.json
kibana.d.ts Adds AGPL 3.0 license (#192025) 2024-09-06 19:02:41 -06:00
LICENSE.txt Adds AGPL 3.0 license (#192025) 2024-09-06 19:02:41 -06:00
NOTICE.txt [api-docs] 2025-01-01 Daily api_docs build (#205342) 2025-01-01 01:37:13 -06:00
package.json SKA: Fix kebab-case issues in security-threat-hunting packages (#211349) 2025-02-19 11:21:45 +01:00
preinstall_check.js Adds AGPL 3.0 license (#192025) 2024-09-06 19:02:41 -06:00
README.md
renovate.json [scout] enable eslint plugin playwright (#210666) 2025-02-14 10:21:49 +01:00
RISK_MATRIX.mdx
run_fleet_setup_parallel.sh Sustainable Kibana Architecture: Move modules owned by @elastic/fleet (#202422) 2024-12-24 15:32:43 +01:00
SECURITY.md
sonar-project.properties [sonarqube] update memory, cpu (#190547) 2024-09-09 16:16:30 -05:00
STYLEGUIDE.mdx [styleguide] update path to scss theme (#140742) 2022-09-15 10:41:14 -04:00
tsconfig.base.json SKA: Fix kebab-case issues in security-threat-hunting packages (#211349) 2025-02-19 11:21:45 +01:00
tsconfig.browser.json
tsconfig.browser_bazel.json
tsconfig.json Transpile packages on demand, validate all TS projects (#146212) 2022-12-22 19:00:29 -06:00
TYPESCRIPT.md Fix small typos in the root md files (#134609) 2022-06-23 09:36:11 -05:00
updatecli-compose.yaml deps(updatecli): bump all policies (#195865) 2024-10-15 07:37:12 -05:00
versions.json chore(NA): update versions after v8.17.3 bump (#210648) 2025-02-12 01:23:33 +00:00
WORKSPACE.bazel Upgrade Node.js to 20.18.2 (#207431) 2025-01-22 12:00:14 -06:00
yarn.lock SKA: Fix kebab-case issues in security-threat-hunting packages (#211349) 2025-02-19 11:21:45 +01:00

Kibana

Kibana is your window into the Elastic Stack. Specifically, it's a browser-based analytics and search dashboard for Elasticsearch.

Getting Started

If you just want to try Kibana out, check out the Elastic Stack Getting Started Page to give it a whirl.

If you're interested in diving a bit deeper and getting a taste of Kibana's capabilities, head over to the Kibana Getting Started Page.

Using a Kibana Release

If you want to use a Kibana release in production, give it a test run, or just play around:

Building and Running Kibana, and/or Contributing Code

You might want to build Kibana locally to contribute some code, test out the latest features, or try out an open PR:

Documentation

Visit Elastic.co for the full Kibana documentation.

For information about building the documentation, see the README in elastic/docs.

Version Compatibility with Elasticsearch

Ideally, you should be running Elasticsearch and Kibana with matching version numbers. If your Elasticsearch has an older version number or a newer major number than Kibana, then Kibana will fail to run. If Elasticsearch has a newer minor or patch number than Kibana, then the Kibana Server will log a warning.

Note: The version numbers below are only examples, meant to illustrate the relationships between different types of version numbers.

Situation Example Kibana version Example ES version Outcome
Versions are the same. 7.15.1 7.15.1 💚 OK
ES patch number is newer. 7.15.0 7.15.1 ⚠️ Logged warning
ES minor number is newer. 7.14.2 7.15.0 ⚠️ Logged warning
ES major number is newer. 7.15.1 8.0.0 🚫 Fatal error
ES patch number is older. 7.15.1 7.15.0 ⚠️ Logged warning
ES minor number is older. 7.15.1 7.14.2 🚫 Fatal error
ES major number is older. 8.0.0 7.15.1 🚫 Fatal error

Questions? Problems? Suggestions?

  • If you've found a bug or want to request a feature, please create a GitHub Issue. Please check to make sure someone else hasn't already created an issue for the same topic.
  • Need help using Kibana? Ask away on our Kibana Discuss Forum and a fellow community member or Elastic engineer will be glad to help you out.