mirror of https://github.com/elastic/kibana.git synced 2025-06-27 18:51:07 -04:00

Your window into the Elastic Stack

dashboards elasticsearch hacktoberfest kibana metrics observability visualizations

Find a file

Andrew Macri 05c4b1c9ae Gemini connector: makes maxOutputTokens an optional, configurable request parameter (#220586 ) ### Gemini connector: makes `maxOutputTokens` an optional, configurable request parameter This PR updates the Gemini connector to make the `maxOutputTokens` request parameter, which was previously included in all requests with a (non configurable) value of `8192`, an optional, configurable request parameter. - `maxOutputTokens` is no longer included in requests to Gemini models by default - It may be optionally included in requests by passing the new `maxOutputTokens` parameter to the Gemini connector's `invokeAI` and `invokeAIRaw`, and `invokeStream` APIs The driver for this change was some requests to newer models, e.g. Gemini Pro 2.5 were failing, because previously all requests from the Gemini Connector were sent with the value `8192` [here](`f3115c6746/x-pack/platform/plugins/shared/stack_connectors/server/connector_types/gemini/gemini.ts (L397)`). The previous, non-configurable limit was less than the output token limit of newer Gemini models, as illustrated by the following table: \| Model \| Description \| \|--------------------\|------------------------------------\| \| Gemini 1.5 Flash \| 1 (inclusive) to 8193 (exclusive) \| \| Gemini 1.5 Pro \| 1 (inclusive) to 8193 (exclusive) \| \| Gemini 1.5 Pro 002 \| 1 (inclusive) to 32769 (exclusive) \| \| Gemini 2.5 Pro \| 1 (inclusive) to 65536 (exclusive) \| When newer Gemini models generated a response with more than `8192` output tokens, the API response from Gemini would: - Not contain the generated output, (because the `maxOutputTokens` limit was reached) - Include a finish reason of `MAX_TOKENS`, - Fail response validation in the connector, because the `usageMetadata` response metadata was missing the required `candidatesTokenCount` property, per the _Details_ below. #### Details Attack discovery requests to Gemini 2.5 Pro were failing with errors like the following: ``` ActionsClientLlm: action result status is error: an error occurred while running the action - Response validation failed (Error: [usageMetadata.candidatesTokenCount]: expected value of type [number] but got [undefined]) ``` A full example error trace appears in the _Details_ below (click to expand): <details> ``` ActionsClientLlm: action result status is error: an error occurred while running the action - Response validation failed (Error: [usageMetadata.candidatesTokenCount]: expected value of type [number] but got [undefined]) Response validation failed (Error: [usageMetadata.candidatesTokenCount]: expected value of type [number] but got [undefined]): ActionsClientLlm: action result status is error: an error occurred while running the action - Response validation failed (Error: [usageMetadata.candidatesTokenCount]: expected value of type [number] but got [undefined]) at ActionsClientLlm._call (/Users/$USER/git/kibana/x-pack/platform/packages/shared/kbn-langchain/server/language_models/llm.ts:112:21) at async ActionsClientLlm._generate (/Users/$USER/git/kibana/node_modules/@langchain/core/dist/language_models/llms.cjs:375:29) at async ActionsClientLlm._generateUncached (/Users/$USER/git/kibana/node_modules/@langchain/core/dist/language_models/llms.cjs:176:26) at async ActionsClientLlm.invoke (/Users/$USER/git/kibana/node_modules/@langchain/core/dist/language_models/llms.cjs:34:24) at async RunnableSequence.invoke (/Users/$USER/git/kibana/node_modules/@langchain/core/dist/runnables/base.cjs:1291:27) at async RunnableCallable.generate [as func] (/Users/$USER/git/kibana/x-pack/solutions/security/plugins/elastic_assistant/server/lib/attack_discovery/graphs/default_attack_discovery_graph/nodes/generate/index.ts:61:27) at async RunnableCallable.invoke (/Users/$USER/git/kibana/node_modules/@langchain/langgraph/dist/utils.cjs:82:27) at async RunnableSequence.invoke (/Users/$USER/git/kibana/node_modules/@langchain/core/dist/runnables/base.cjs:1285:33) at async _runWithRetry (/Users/$USER/git/kibana/node_modules/@langchain/langgraph/dist/pregel/retry.cjs:70:22) at async PregelRunner._executeTasksWithRetry (/Users/$USER/git/kibana/node_modules/@langchain/langgraph/dist/pregel/runner.cjs:211:33) at async PregelRunner.tick (/Users/$USER/git/kibana/node_modules/@langchain/langgraph/dist/pregel/runner.cjs:48:40) at async CompiledStateGraph._runLoop (/Users/$USER/git/kibana/node_modules/@langchain/langgraph/dist/pregel/index.cjs:1018:17) at async createAndRunLoop (/Users/$USER/git/kibana/node_modules/@langchain/langgraph/dist/pregel/index.cjs:907:17) at ActionsClientLlm._generateUncached (/Users/$USER/git/kibana/node_modules/@langchain/core/dist/language_models/llms.cjs:176:37) at ActionsClientLlm._generate (/Users/$USER/git/kibana/node_modules/@langchain/core/dist/language_models/llms.cjs:374:20) at ActionsClientLlm._generateUncached (/Users/$USER/git/kibana/node_modules/@langchain/core/dist/language_models/llms.cjs:176:37) at async ActionsClientLlm.invoke (/Users/$USER/git/kibana/node_modules/@langchain/core/dist/language_models/llms.cjs:34:24) at async RunnableSequence.invoke (/Users/$USER/git/kibana/node_modules/@langchain/core/dist/runnables/base.cjs:1291:27) at async RunnableCallable.generate [as func] (/Users/$USER/git/kibana/x-pack/solutions/security/plugins/elastic_assistant/server/lib/attack_discovery/graphs/default_attack_discovery_graph/nodes/generate/index.ts:61:27) at async RunnableCallable.invoke (/Users/$USER/git/kibana/node_modules/@langchain/langgraph/dist/utils.cjs:82:27) at async RunnableSequence.invoke (/Users/$USER/git/kibana/node_modules/@langchain/core/dist/runnables/base.cjs:1285:33) at async _runWithRetry (/Users/$USER/git/kibana/node_modules/@langchain/langgraph/dist/pregel/retry.cjs:70:22) at async PregelRunner._executeTasksWithRetry (/Users/$USER/git/kibana/node_modules/@langchain/langgraph/dist/pregel/runner.cjs:211:33) at async PregelRunner.tick (/Users/$USER/git/kibana/node_modules/@langchain/langgraph/dist/pregel/runner.cjs:48:40) at async CompiledStateGraph._runLoop (/Users/$USER/git/kibana/node_modules/@langchain/langgraph/dist/pregel/index.cjs:1018:17) at async createAndRunLoop (/Users/$USER/git/kibana/node_modules/@langchain/langgraph/dist/pregel/index.cjs:907:17) at ActionsClientLlm._generateUncached (/Users/$USER/git/kibana/node_modules/@langchain/core/dist/language_models/llms.cjs:142:51) at CallbackManager.handleLLMStart (/Users/$USER/git/kibana/node_modules/@langchain/core/dist/callbacks/manager.cjs:464:25) at ActionsClientLlm._generateUncached (/Users/$USER/git/kibana/node_modules/@langchain/core/dist/language_models/llms.cjs:142:51) at ActionsClientLlm._generateUncached (/Users/$USER/git/kibana/node_modules/@langchain/core/dist/language_models/llms.cjs:136:73) at ActionsClientLlm._generateUncached (/Users/$USER/git/kibana/node_modules/@langchain/core/dist/language_models/llms.cjs:136:73) at ActionsClientLlm._generateUncached (/Users/$USER/git/kibana/node_modules/@langchain/core/dist/language_models/llms.cjs:129:28) at ActionsClientLlm.generate (/Users/$USER/git/kibana/node_modules/@langchain/core/dist/language_models/llms.cjs:281:25) at ActionsClientLlm.generatePrompt (/Users/$USER/git/kibana/node_modules/@langchain/core/dist/language_models/llms.cjs:97:21) at ActionsClientLlm.invoke (/Users/$USER/git/kibana/node_modules/@langchain/core/dist/language_models/llms.cjs:34:35) at RunnableSequence.invoke (/Users/$USER/git/kibana/node_modules/@langchain/core/dist/runnables/base.cjs:1291:43) at async RunnableCallable.generate [as func] (/Users/$USER/git/kibana/x-pack/solutions/security/plugins/elastic_assistant/server/lib/attack_discovery/graphs/default_attack_discovery_graph/nodes/generate/index.ts:61:27) at async RunnableCallable.invoke (/Users/$USER/git/kibana/node_modules/@langchain/langgraph/dist/utils.cjs:82:27) at async RunnableSequence.invoke (/Users/$USER/git/kibana/node_modules/@langchain/core/dist/runnables/base.cjs:1285:33) at async _runWithRetry (/Users/$USER/git/kibana/node_modules/@langchain/langgraph/dist/pregel/retry.cjs:70:22) at async PregelRunner._executeTasksWithRetry (/Users/$USER/git/kibana/node_modules/@langchain/langgraph/dist/pregel/runner.cjs:211:33) at async PregelRunner.tick (/Users/$USER/git/kibana/node_modules/@langchain/langgraph/dist/pregel/runner.cjs:48:40) at async CompiledStateGraph._runLoop (/Users/$USER/git/kibana/node_modules/@langchain/langgraph/dist/pregel/index.cjs:1018:17) at async createAndRunLoop (/Users/$USER/git/kibana/node_modules/@langchain/langgraph/dist/pregel/index.cjs:907:17) at RunnableSequence.invoke (/Users/$USER/git/kibana/node_modules/@langchain/core/dist/runnables/base.cjs:1285:70) at raceWithSignal (/Users/$USER/git/kibana/node_modules/@langchain/core/dist/utils/signal.cjs:4:30) at RunnableSequence.invoke (/Users/$USER/git/kibana/node_modules/@langchain/core/dist/runnables/base.cjs:1285:70) at async RunnableCallable.generate [as func] (/Users/$USER/git/kibana/x-pack/solutions/security/plugins/elastic_assistant/server/lib/attack_discovery/graphs/default_attack_discovery_graph/nodes/generate/index.ts:61:27) at async RunnableCallable.invoke (/Users/$USER/git/kibana/node_modules/@langchain/langgraph/dist/utils.cjs:82:27) at async RunnableSequence.invoke (/Users/$USER/git/kibana/node_modules/@langchain/core/dist/runnables/base.cjs:1285:33) ``` </details> The root cause of the validation error above was models like Gemini 2.5 Pro were generating responses with output tokens that reached the `maxOutputTokens` limit sent in the request. When this happens, the response from the model: - Includes a `finishReason` of `MAX_TOKENS` - The `usageMetadata` does NOT include the `candidatesTokenCount`, which causes the validation error to occur as illustrated by the following (partial) response: ```json { "candidates": [ { "content": { "role": "model", "parts": [ { "text": "" } ] }, "finishReason": "MAX_TOKENS", // ... } ], "usageMetadata": { "promptTokenCount": 82088, "totalTokenCount": 90280, "trafficType": "ON_DEMAND", "promptTokensDetails": [ { "modality": "TEXT", "tokenCount": 82088 } ], "thoughtsTokenCount": 8192 }, "modelVersion": "gemini-2.5-pro-preview-03-25", "createTime": "2025-05-06T18:05:44.475008Z", "responseId": "eE8aaID_HJXj-O4PifyReA" } ``` To resolve this issue, the `maxOutputTokens` request parameter is only sent when it is specified via the Gemini connector's `invokeAI` and `invokeAIRaw` APIs. #### Alternatives considered All of the Gemini models tested will fail with errors like the following example when their (model-specific) output token limits are reached: ``` Status code: 400. Message: API Error: INVALID_ARGUMENT: Unable to submit request because it has a maxOutputTokens value of 4096000 but the supported range is from 1 (inclusive) to 65536 (exclusive). Update the value and try again.: ActionsClientLlm: action result status is error: an error occurred while running the action - Status code: 400. Message: API Error: INVALID_ARGUMENT: Unable to submit request because it has a maxOutputTokens value of 4096000 but the supported range is from 1 (inclusive) to 65536 (exclusive). Update the value and try again. ``` For example, passing a `maxOutputTokens` value of `32768` will work OK for `Gemini 1.5 Pro 002` and `Gemini 2.5 Pro`, but fail for `Gemini 1.5 Flash` and `Gemini 1.5 Pro`: \| Model \| Description \| \|--------------------\|------------------------------------\| \| Gemini 1.5 Flash \| 1 (inclusive) to 8193 (exclusive) \| \| Gemini 1.5 Pro \| 1 (inclusive) to 8193 (exclusive) \| \| Gemini 1.5 Pro 002 \| 1 (inclusive) to 32769 (exclusive) \| \| Gemini 2.5 Pro \| 1 (inclusive) to 65536 (exclusive) \| As an alternative to removing `maxOutputTokens` from requests, we could have made it a required parameter to the `invokeAI`, `invokeAIRaw`, and `invokeStream` APIs, but that would require callers of these APIs to know the correct model-specific value to provide. #### Desk testing This PR was desk tested: - Interactively in Attack discovery with the following models: \| Model \| \|--------------------\| \| Gemini 1.5 Flash \| \| Gemini 1.5 Pro \| \| Gemini 1.5 Pro 002 \| \| Gemini 2.5 Pro \| - Interactively in Attack discovery with non-Gemini models, e.g. a subset of the GPT and Claude family of models (to test for unintended side effects) - Via evaluations using `Eval AD: All Scenarios` for the Gemini connectors above, (and with the Claude / OpenAI models for regression testing) - Interactively via the security assistant, to answer requests in tool calling, and non-tool calling scenarios		2025-05-30 08:47:32 -04:00
.buildkite	[Security Solution] Refactor prebuilt rules integration tests (#219831 )	2025-05-30 12:23:45 +02:00
.devcontainer	Upgrade to Storybook 8 (#195148 )	2025-03-14 15:41:03 -07:00
.github	fix issue accessing repo owner name (#221751 )	2025-05-28 15:39:36 +02:00
api_docs	[api-docs] 2025-05-29 Daily api_docs build (#221876 )	2025-05-29 09:25:17 +03:00
config	[AI SOC] Grant fleet (v1) access to see integrations in Search AI Lake tier (#221189 )	2025-05-24 13:48:44 +02:00
dev_docs	Update ttfmp documentation to include the recent enhancements (#213416 )	2025-05-28 17:11:37 +03:00
docs	[Docs][9.0.x] Add known issue about AI Obs Assistant getting stuck when attempting to call the call the `execute_connector` function (#221834 )	2025-05-28 20:44:28 +00:00
examples	[Discover] Persist tabs in local storage and sync selected tab ID with URL (#217706 )	2025-05-27 23:32:56 +03:00
kbn_pm	Add conditional switching between EUI releases (#219818 )	2025-05-28 15:41:19 +02:00
legacy_rfcs	SKA: Relocate "platform" packages that remain on `/packages` (#208704 )	2025-02-24 11:03:30 +00:00
licenses	Adds AGPL 3.0 license (#192025 )	2024-09-06 19:02:41 -06:00
oas_docs	[Fleet] When removing a inputs type package policy, clean up assets (#218582 )	2025-05-30 09:15:22 +02:00
packages	[Fleet] Handle install status and errors when uninstalling remote integrations is enabled (#220990 )	2025-05-28 11:24:00 -05:00
plugins
scripts	[Inference] Instrument inference with OpenTelemetry (#218694 )	2025-05-07 11:44:29 +02:00
src	[ES\|QL] Add support for `RRF` (#221349 )	2025-05-30 10:21:20 +02:00
typings	chore: remove react-syntax-highlighter leftovers (#213076 )	2025-03-04 15:35:34 +01:00
x-pack	Gemini connector: makes maxOutputTokens an optional, configurable request parameter (#220586 )	2025-05-30 08:47:32 -04:00
.backportrc.json	chore(NA): prepare replacement of 8.x branch with 8.19 (#218514 )	2025-04-17 04:02:40 +01:00
.bazelignore	Remove references to deleted .ci folder (#177168 )	2024-02-20 19:54:21 +01:00
.bazeliskversion
.bazelrc
.bazelrc.common	Transpile packages on demand, validate all TS projects (#146212 )	2022-12-22 19:00:29 -06:00
.bazelversion	chore(NA): revert bazel upgrade for v5.2.0 (#135096 )	2022-06-24 03:57:21 +01:00
.browserslistrc	Add Firefox ESR to browserlistrc (#184462 )	2024-05-29 17:53:18 -05:00
.editorconfig
.eslintignore	chore(fullstory): serve the snippet as an asset (#220368 )	2025-05-07 23:06:24 +02:00
.eslintrc.js	[ska][x-pack] relocate platform tests [4] (#219691 )	2025-05-07 19:13:04 +02:00
.gitattributes
.gitignore	Add windsurf to gitignore (#221801 )	2025-05-28 12:29:33 -06:00
.i18nrc.json	[Discover] Support Lens fetches across tabs (#218506 )	2025-05-02 13:39:25 -03:00
.node-version	Upgrade Node.js to 20.18.2 (#207431 )	2025-01-22 12:00:14 -06:00
.npmrc	[npmrc] Fix puppeteer_skip_download configuration (#177673 )	2024-02-22 18:59:01 -07:00
.nvmrc	Upgrade Node.js to 20.18.2 (#207431 )	2025-01-22 12:00:14 -06:00
.prettierignore
.prettierrc
.puppeteerrc	Add .puppeteerrc (#179847 )	2024-04-03 09:14:39 -05:00
.stylelintignore
.stylelintrc	Bump stylelint to ^14 (#136693 )	2022-07-20 10:11:00 -05:00
.telemetryrc.json	Sustainable Kibana Architecture: Move modules owned by `@elastic/kibana-core` (#201653 )	2025-01-04 11:47:24 -07:00
.yarnrc
BUILD.bazel	Transpile packages on demand, validate all TS projects (#146212 )	2022-12-22 19:00:29 -06:00
catalog-info.yaml	Configures PagerDuty Backstage Integration for kbn (#208440 )	2025-01-27 23:29:11 +00:00
CODE_OF_CONDUCT.md
CONTRIBUTING.md	Docs: fix broken links in CONTRIBUTING.md (#219158 )	2025-04-24 17:25:00 -06:00
FAQ.md	Fix small typos in the root md files (#134609 )	2022-06-23 09:36:11 -05:00
fleet_packages.json	[main] Sync bundled packages with Package Storage (#221619 )	2025-05-27 18:08:49 +03:00
github_checks_reporter.json
kibana.d.ts	Adds AGPL 3.0 license (#192025 )	2024-09-06 19:02:41 -06:00
LICENSE.txt	Adds AGPL 3.0 license (#192025 )	2024-09-06 19:02:41 -06:00
NOTICE.txt	[api-docs] 2025-01-01 Daily api_docs build (#205342 )	2025-01-01 01:37:13 -06:00
package.json	Add Ruby and PHP request conversion to the Search dev console (#221771 )	2025-05-29 10:54:50 +01:00
preinstall_check.js	Adds AGPL 3.0 license (#192025 )	2024-09-06 19:02:41 -06:00
README.md
renovate.json	[Renovate] Prevent immortal vega PRs (#221660 )	2025-05-29 17:40:01 +02:00
RISK_MATRIX.mdx
run_fleet_setup_parallel.sh	Sustainable Kibana Architecture: Move modules owned by `@elastic/fleet` (#202422 )	2024-12-24 15:32:43 +01:00
SECURITY.md
sonar-project.properties	[sonarqube] update memory, cpu (#190547 )	2024-09-09 16:16:30 -05:00
STYLEGUIDE.mdx	[styleguide] update path to scss theme (#140742 )	2022-09-15 10:41:14 -04:00
tsconfig.base.json	[onechat] Introduce plugin and tool registry (#220889 )	2025-05-28 00:45:01 +03:00
tsconfig.browser.json
tsconfig.browser_bazel.json
tsconfig.json	Transpile packages on demand, validate all TS projects (#146212 )	2022-12-22 19:00:29 -06:00
TYPESCRIPT.md	Fix small typos in the root md files (#134609 )	2022-06-23 09:36:11 -05:00
updatecli-compose.yaml	deps(updatecli): bump all policies (#195865 )	2024-10-15 07:37:12 -05:00
versions.json	chore(NA): update versions after v9.0.2 bump (#220254 )	2025-05-07 03:23:40 +01:00
WORKSPACE.bazel	Upgrade Node.js to 20.18.2 (#207431 )	2025-01-22 12:00:14 -06:00
yarn.lock	Add Ruby and PHP request conversion to the Search dev console (#221771 )	2025-05-29 10:54:50 +01:00

README.md

Kibana

Kibana is your window into the Elastic Stack. Specifically, it's a browser-based analytics and search dashboard for Elasticsearch.

Getting Started
- Using a Kibana Release
- Building and Running Kibana, and/or Contributing Code
Documentation
Version Compatibility with Elasticsearch
Questions? Problems? Suggestions?

Getting Started

If you just want to try Kibana out, check out the Elastic Stack Getting Started Page to give it a whirl.

If you're interested in diving a bit deeper and getting a taste of Kibana's capabilities, head over to the Kibana Getting Started Page.

Using a Kibana Release

If you want to use a Kibana release in production, give it a test run, or just play around:

Download the latest version on the Kibana Download Page.
Learn more about Kibana's features and capabilities on the Kibana Product Page.
We also offer a hosted version of Kibana on our Cloud Service.

Building and Running Kibana, and/or Contributing Code

You might want to build Kibana locally to contribute some code, test out the latest features, or try out an open PR:

CONTRIBUTING.md will help you get Kibana up and running.
If you would like to contribute code, please follow our STYLEGUIDE.mdx.
For all other questions, check out the FAQ.md and wiki.

Documentation

Visit Elastic.co for the full Kibana documentation.

For information about building the documentation, see the README in elastic/docs.

Version Compatibility with Elasticsearch

Ideally, you should be running Elasticsearch and Kibana with matching version numbers. If your Elasticsearch has an older version number or a newer major number than Kibana, then Kibana will fail to run. If Elasticsearch has a newer minor or patch number than Kibana, then the Kibana Server will log a warning.

Note: The version numbers below are only examples, meant to illustrate the relationships between different types of version numbers.

Situation	Example Kibana version	Example ES version	Outcome
Versions are the same.	7.15.1	7.15.1	💚 OK
ES patch number is newer.	7.15.0	7.15.1	⚠️ Logged warning
ES minor number is newer.	7.14.2	7.15.0	⚠️ Logged warning
ES major number is newer.	7.15.1	8.0.0	🚫 Fatal error
ES patch number is older.	7.15.1	7.15.0	⚠️ Logged warning
ES minor number is older.	7.15.1	7.14.2	🚫 Fatal error
ES major number is older.	8.0.0	7.15.1	🚫 Fatal error

Questions? Problems? Suggestions?

If you've found a bug or want to request a feature, please create a GitHub Issue. Please check to make sure someone else hasn't already created an issue for the same topic.
Need help using Kibana? Ask away on our Kibana Discuss Forum and a fellow community member or Elastic engineer will be glad to help you out.