Research conducted via conversational AI-led interviews with 27 senior infrastructure leaders (CTO, VP, Director-level and above) at companies of 500+ employees. Respondents were screened for direct involvement in observability budget approval and AI tooling decisions. The field instrument was a hybrid structured plus open-ended conversation guide containing 14 main questions and 13 follow-ups, all asked of every respondent regardless of branching logic.

The sample skews AMER (78%), with EMEA representation at 22% and no APAC respondents in this round. Industry composition is heavily concentrated in SaaS / Tech (89%), reflecting the panel quotas at recruitment for both supplier panels: Cint panelists were sourced primarily from Information Technology and Computer Software, and Pure Spectrum panelists were screened to "Science / Technology / Programming." Healthcare, Financial Services, and Retail are each represented by a single respondent. Vertical breadth is a known limitation and is being addressed in the next fielding wave with explicit quotas across additional industries. All percentages are calculated as a share of the 27 unique respondents (not total mentions). Multi-mention questions, including workflows in use, business-impact themes, and investment destinations, may sum to more than 100% as respondents could cite multiple categories. Open-ended responses were coded into themes by a single analyst using pre-defined keyword categories fixed before coding began; each respondent counted once per primary theme. Quotes are verbatim from respondents who passed both screener and post-fielding fraud review (42 of 69 completers were excluded for low-effort, off-topic, templated AI, or paste-artifact responses; only the remaining 27 contribute to this report).

Questions about data quality and signal-to-noise tuning are reported as distinct categories rather than grouped together, reflecting feedback from the prior round. The single-select trust prerequisites question was changed to a top-2 multi-select to surface meaningful differentiation. The prior round's flat distribution would not have been narratable.

The Smoke-Detector Problem.

AI does the summarizing. Engineers still do the thinking.

Which observability workflows AI is actually doing

The workflow most impacted by AI today

What teams say AI tools don't do yet

Detectors versus investigators.

Teams want AI where the data already is.

How AI is delivered in observability today

What teams want their AI experience to feel like

How monitoring quality limits AI performance

Buyers want AI that is unified, customizable, and grounded.

When AI gets it wrong, the bill arrives in engineer-hours.

The hardest part of getting AI to work reliably

What AI failure actually does to the business

Bad AI and broken stack produce the same engineering bill.

Trust earns itself with a track record.

What earns broader AI trust

How much autonomy teams give AI in production today

The autonomy gap between advanced and earlier-stage teams

Earlier-stage teams want explainability. Advanced teams want a track record.

EMEA leaders are notably more conservative on autonomy

The path to autonomy runs through audit logs.

The next round of spend goes to the data underneath AI.

Expected change in AI-for-observability investment

Where the AI-for-observability dollars are going

The forces making the case internally

Most of the new AI spend is foundation work.

One platform. Grounded in your telemetry.

How this research was conducted.

Industry Mix

Title Mix

Region

Company Size

AI Adoption Stage