kibana/dev_docs/tutorials
Garrett Spong e57663a0cf
[Security Assistant] Adds BuildKite pipeline for running Security GenAI Evaluations weekly (#215254)
## Summary

Introduces a new `security_solution/gen_ai_evals.yml` BuildKite pipeline
for automatically running our Assistant and Attack Discovery evaluation
suites weekly.

### To Run Locally:
Ensure you are authenticated with vault for LLM + LangSmith creds:

> See [internal
docs](https://github.com/elastic/infra/blob/master/docs/vault/README.md#login-with-your-okta)
for setup/login instructions.

Fetch Connectors and LangSmith creds:

> [!NOTE]
> In discussion with @elastic/kibana-operations it was preferred to use
the ci-prod secrets vault, so we cannot self-manage the secrets. To test
this locally though, you can grab the secrets and follow the
instructions in this [paste
bin](https://p.elstc.co/paste/q7k+zYOc#PN0kasw11u2J0XWC2Ls5PMNWreKzKTpgWA1wtsPzeH+).

```
cd x-pack/test/security_solution_api_integration
node scripts/genai/vault/retrieve_secrets.js  
```


Navigate to api integration directory, load the env vars, and start
server:
```
cd x-pack/test/security_solution_api_integration
export KIBANA_SECURITY_TESTING_AI_CONNECTORS=$(base64 -w 0 < scripts/genai/vault/connector_config.json) && export KIBANA_SECURITY_TESTING_LANGSMITH_KEY=$(base64 -w 0 < scripts/genai/vault/langsmith_key.txt)
yarn genai_evals:server:ess
```

Then in another terminal, load vars and run the tests:
```
cd x-pack/test/security_solution_api_integration
export KIBANA_SECURITY_TESTING_AI_CONNECTORS=$(base64 -w 0 < scripts/genai/vault/connector_config.json) && export KIBANA_SECURITY_TESTING_LANGSMITH_KEY=$(base64 -w 0 < scripts/genai/vault/langsmith_key.txt)
yarn genai_evals🏃ess
```

### To manually run on BuildKite:
Navigate to
[BuildKite](https://buildkite.com/elastic?filter=ftr-security-solution-gen-ai-evaluations)
and run `ftr-security-solution-gen-ai-evaluations` pipeline.

### To manually run on BuildKite for specific PR:
In `.buildkite/ftr_security_stateful_configs.yml`, temporarily move the
`genai/evaluations/trial_license_complete_tier/configs/ess.config.ts`
line down to the `enabled` section. Will see if we can do this without
requiring a commit. @elastic/kibana-operations is it possible to set a
buildkite env var that can be read in FTR tests when a specific GitHub
label is added to the PR? I.e. can I create a `SecurityGenAI:Run Evals`
label that when added will run this suite as part of the build?

> [!NOTE]
> Currently the connectors secrets only include `gpt-4o` and
`gpt-4o-mini`. Waiting on finalized list w/ credentials from @jamesspi
and @peluja1012 and then we can have ops update using the scripts
included in this PR.

---------

Co-authored-by: kibanamachine <42973632+kibanamachine@users.noreply.github.com>
Co-authored-by: Patryk Kopycinski <patryk.kopycinski@elastic.co>
2025-04-24 11:46:57 -06:00
..
data SKA: Update broken references and URLs (#206836) 2025-01-28 03:32:48 +00:00
performance [Performance] Refactor TTFMP query from, to fields (#213911) 2025-03-20 11:40:24 +01:00
screenshotting [Screenshotting] Organize dev docs for screenshotting/chromium (#198100) 2024-10-30 11:55:12 -05:00
advanced_settings.mdx [Advanced settings] Update migration guidance in advanced settings tutorial (#208010) 2025-01-29 14:42:29 +01:00
apm_ui.png Debugging with apm - fixes and tutorial (#127892) 2022-03-22 14:56:37 +02:00
building_a_kibana_distributable.mdx [api-docs] follow the correct schema for frontmatter (#138348) 2022-08-10 17:17:50 -05:00
ci.mdx [Security Assistant] Adds BuildKite pipeline for running Security GenAI Evaluations weekly (#215254) 2025-04-24 11:46:57 -06:00
configuring_cross_cluster_search.mdx [dev docs] How to set up cross cluster search locally (#182025) 2024-04-30 12:46:21 -05:00
data_views.mdx [api-docs] follow the correct schema for frontmatter (#138348) 2022-08-10 17:17:50 -05:00
debugging.mdx [Dev Docs] Add VS Code configurations to Dev Docs Debugging Tutorial (#212807) 2025-03-06 10:14:19 -05:00
debugging_fips_test_failures.mdx SKA: Relocate "platform" packages that remain on /packages (#208704) 2025-02-24 11:03:30 +00:00
development_windows.mdx [api-docs] follow the correct schema for frontmatter (#138348) 2022-08-10 17:17:50 -05:00
endpoints.mdx Deprecated authRequired in favor of security.authc.enabled (#202414) 2024-12-10 11:24:41 +01:00
expressions.mdx [api-docs] follow the correct schema for frontmatter (#138348) 2022-08-10 17:17:50 -05:00
fips_test_failure_license_override_example.png Adding documentation to aid team in triaging FIPS Test Failures (#203008) 2025-01-14 07:53:28 -05:00
generating_oas_for_http_apis.mdx [OAS] More guidance about publishing docs (#217524) 2025-04-09 12:39:25 +02:00
kibana_page_template.mdx Upgrade EUI to v63.0.6 - updates to new EuiPageTemplate and deprecates old page components (#139524) 2022-09-07 15:35:23 -05:00
saved_objects.mdx Document that index/enable: false SO mappings is an anti-pattern (#201969) 2024-11-28 17:50:34 -06:00
submit_a_pull_request.mdx [ci] Remove duplicate backport action (#190643) 2024-08-20 08:23:25 -05:00
testing_plugins.mdx Updates internal dev docs for Saved Objects (#178058) 2024-03-07 08:16:28 -07:00
versioning_http_apis.mdx [dev-docs] Adds section explaining internal and public APIs (#195840) 2024-10-11 16:16:47 +00:00
versioning_interfaces.mdx [Versioned APIs] Versioned interfaces dev docs guide (#150705) 2023-03-01 16:45:43 +01:00