How Mezmo Cuts AI Observability Costs by 90% With Context Engineering | Tucker Callaway

Tucker Callaway, CEO of Mezmo, explains how context engineering achieves 90% cost savings in AI observability without training models. Learn about headless observability.

AI promises to transform observability and free SREs from endless incident firefighting. But most approaches demand lengthy model training, massive data volumes, and costs that spiral as quickly as the problems they’re meant to solve. Tucker Callaway, CEO of Mezmo, has a different answer: stop training models and start engineering context.

At KubeCon and CloudNativeCon in Atlanta, Callaway outlined how Mezmo is challenging the conventional wisdom around AI-driven observability. Rather than following the industry trend of training custom models on customer data, Mezmo takes what Callaway calls a “context engineering” approach. The company processes, embeds, and vectorizes telemetry data offline before feeding it to foundation models. This preprocessing dramatically reduces the volume of data sent to the LLM—by 90%—while simultaneously improving accuracy and speed.

“We looked at the ‘train the model’ approach, and it just took too long, and the accuracy wasn’t there, and the cost was too high,” Callaway explained. “We scrapped that approach about six months ago and started looking at what are the bottlenecks inside of the LLM that are causing accuracy problems.”

The result is an AI SRE solution that customers can spin up in five minutes without training, without sharing their data for model fine-tuning, and at a fraction of the cost. Mezmo is so confident in the efficiency gains that the company plans to roll out its AI SRE agent to all 3,500 weekly active users at no additional charge.

Detection, Diagnosis, Remediation

Callaway is particular about terminology. He doesn’t love the term “AI SRE” because it suggests replacing a multifaceted job role with software. Instead, he frames Mezmo’s solution around three core functions: detection, diagnosis, and remediation of incidents.

“Our focus is really on the diagnosis, because we believe that you really have to deeply trust the diagnosis if we’re ever going to get to automation of the remediation and closing that full loop,” Callaway said. The goal isn’t to eliminate SREs but to free them from incident management minutiae so they can focus on designing, scaling, and building systems.

By offloading the majority of inference work—what Callaway describes as 80% deterministic processing—Mezmo drastically reduces the probabilistic nature of LLMs. This minimizes hallucinations, context confusion, and context poisoning. The model spends its attention on output and accuracy rather than parsing massive volumes of raw telemetry data.

The Path to Headless Observability

When asked about the future of observability, Callaway painted a provocative picture. He predicts the industry will move toward “headless observability” where dashboards and investigatory UIs become largely irrelevant.

“Today, the process is we identify an incident, and then we go through an investigative process in a UI trying to find the root cause,” he said. “All that’s going to go away in the years to come. There’s really no purpose in having this visualization layer in observability.”

Instead, analysis capabilities will happen agentically, and remediation will occur automatically. This shift is accelerated by parallel disruption in how applications are built. As AI-driven coding and agentic development transform the application layer, the combination of smarter apps and autonomous observability could arrive faster than most people expect.

Cost, Complexity, and the Data Problem

Kubernetes and cloud environments are notorious for complexity and cost overruns. Callaway acknowledged that left unchecked, AI observability could make both worse. The answer, he argues, is sophisticated data management.

Mezmo’s real-time data pipelining foundation allows it to structure, normalize, and focus on what Callaway calls “active telemetry”—the data needed to drive outcomes. Everything else gets archived in cold storage like an S3 bucket. This agentic data engineering happens behind the scenes, keeping the overall system simple and cost-effective.

“We can make this very complex, or we can make this very simple,” Callaway said. “The answer to that is really a data management problem.”

Customers and Feedback

Mezmo has been working with design partners and beta testers ahead of the KubeCon announcement. Callaway said the feedback has centered on one thing: speed to value. Customers struggled with the long onboarding cycles and training requirements of traditional AI observability tools. Mezmo’s approach, which requires no training and delivers value within an hour, has been transformative.

“People can spin it up in a morning and within an hour be getting value out of it and finding new outcomes very easy to get to,” Callaway said.

The solution is available through Mezmo’s UI, via MCP server integration, and within native customer environments. The company is not targeting any specific industry or workload, focusing instead on the SRE role and the universal challenge of managing complex systems efficiently.

Tucker Callaway’s vision for observability isn’t just incremental improvement. It’s a fundamental rethinking of how telemetry data is processed, how intelligence is applied, and what role humans play in keeping modern infrastructure running. If context engineering delivers on its promise, the dashboards SREs spend hours staring at today may soon become relics of a different era.

Next news
You're viewing our latest news item.
Previous news
You're viewing our oldest news item.
How Mezmo Cuts AI Observability Costs by 90% With Context Engineering | Tucker Callaway
Code Story: Insights from Startup Tech Leaders with Mezmo's Tucker Callaway
2026 Observability Predictions - Part 3
2026 Observability Predictions - Part 1
How AI-Driven Observability Is Transforming SRE: Insights from Mezmo CEO Tucker Callaway
2026: The End of the Dashboard as We Know It?
The Importance of Context Engineering in the AI Era
Mezmo: Named One of The Top 50 Software Companies of 2025
Why Synthetic Tracing Delivers Better Data, Not Just More Data
Why Agentic SREs Require Active Telemetry in Kubernetes
5 Startups Defining AI SRE
Mezmo Launches AI SRE Agent for Root Cause Analysis
AI-Driven Observability with Tucker Callaway | The Software With Podcast
Mezmo CEO Tucker Callaway on Active Telemetry, Context Engineering, and the Fastest AI SRE for Kubernetes | 10KMedia Podcast
Mezmo Launches Fast & Precise AI SRE for Kubernetes Ahead of KubeCon
Mezmo Wins 2025 Digital Innovator Award from Intellyx
Mezmo Announces Cost Optimization Workflow to Reduce Observability Spend for Datadog Users
Mezmo Disrupts Market by Reducing Observability Cost Structure by 90%
Building trust in telemetry data [Q&A]
2025 Observability Predictions - Part 1
Mezmo Simplifies Management of Telemetry Data to Reduce Observability Costs
At KubeCon/CloudNativeCon 2024, AI hype gives way to real application concerns
Mezmo Unveils Mezmo Flow for Guided Data Onboarding and One-Click Log Volume Optimization
Mezmo Flow Released
What’s new from KubeCon + Cloud Native Con North America 2024
Mezmo Unveils Mezmo Flow for Guided Data Onboarding and One-Click Log Volume Optimization - Yahoo Finance
Real-time Analytics News for the Week Ending November 16
Analytics and Data Science News for the Week of November 15; Updates from Alteryx, DataRobot, ThoughtSpot & More
Modern Observability Through Application Development
Mezmo Unveils Mezmo Flow for Guided Data Onboarding and One-Click Log Volume Optimization
Mezmo CEO Tucker Callaway Shares Observability Insights and KubeCon + CloudNativeCon 2024 Plans
Telemetry Data: The Puzzle Pieces of Observability
Q&A with Tucker Callaway, CEO of Mezmo
Mezmo Makes Inc. 5000’s List of Fastest Growing Companies in the Nation for Third Consecutive Year
7 Ways Telemetry Pipelines Unlock Data Confidence
The 2024 SD Times 100: 'Best in Show' in Software Development
Mezmo Hires Former StackHawk, New Relic Leader as Vice President of Product
Inside the VP of Sales' Journey: Financial Software to AI Startups - Craig McAndrews Spills it all!
Mezmo: Adding In-Stream Alert Capabilities to Telemetry Pipeline Platform
An IT Manager's (Re)View of the RSA Conference
Real-time Analytics News for the Week Ending May 11
Mezmo Adds Industry-First Stateful Processing in Telemetry Pipelines
SalesTechStar Interview with Craig McAndrews, Vice President of Sales at Mezmo
Mezmo Ranks No. 82 on Inc. Magazine’s List of the Pacific Region’s Fastest-Growing Private Companies
How To Break Down Silos To Get More Benefit From Your Data
Mezmo Bolsters Sales Leadership With New Hires From Chef and Apptio
How Metric Normalization Enhances Data Observability
KubeCon 2023: Telemetry and Data Management
Telemetry Data’s Role in Cybersecurity – Tucker Callaway – Enterprise Security Weekly
Breaking data silos between observability and security empowers organizations
2024 Application Performance Management Predictions - Part 3: Observability
Data Management News for the Week of November 10; Updates from AWS, Monte Carlo, Satori & More
Real-time Analytics News for the Week Ending November 11
At KubeCon NA 2023, finding cloud independence on the edges of Kubernetes
Mezmo Introduces Data Profiling and Responsive Telemetry Pipelines for Kubernetes
Data Profiling & Responsive Telemetry Pipelines For Kubernetes | Mezmo
KubeCon: GKE Enterprise gets release date, Mezmo adds data profiling feature, and more
Data Profiling & Responsive Telemetry Pipelines For Kubernetes | Mezmo
Data Profiling & Responsive Telemetry Pipelines For Kubernetes | Mezmo
Optimize Your Observability Spending in 5 Steps
Take Control of Your Kubernetes Telemetry Data
The Role of Observability Engineers in Managing Complex IT Systems
Mezmo Launches Welcome Pipeline to Unlock Kubernetes Insights Faster
Mezmo Ranks #1,386 on Inc. 5000’s List of Fastest Growing Companies in the Nation
Mezmo Simplifies Management of DevOps Telemetry Data
Mezmo Empowers Enterprises to Extract Business Insights from Telemetry Data
How DevOps Teams Can Manage Telemetry Data Complexity
Mezmo Wins the 2023 Digital Innovator Award from Intellyx
Tucker Callaway, Mezmo | RSA Conference 2023
Mezmo: Cloud Native Telemetry Pipeline
Mezmo Adds Free Community Plan for Managing Observability Data
Mezmo Announces Free Access to Telemetry Pipeline
Tame Telemetry Data With Mezmo Observability Pipeline
Mezmo Named 2023 Log Analytics Solution of the Year In Data Breakthrough Awards
Down the Observability Pipeline with Mezmo
How Developers, SRE Teams, and Security Engineers Use Telemetry Data
Data Pipeline Feeds IT's Observability Beast
How to Maximize Telemetry Data Value With Observability Pipelines
Mezmo Ranks #53 on Inc. Magazine’s List of Fastest-Growing Companies in the Pacific Region
Mezmo 2023 Predictions: More Organizations Adopt OpenTelemetry
Understanding Observability Data's Impact Across an Organization
Solutions Review Names 6 Data Observability Vendors to Watch, 2023
DevSecOps Accelerates Incident Detection, Response Efforts
2023 Application Performance Management Predictions - Part 3
Mezmo-Harris Poll Report Explores the Impact of Observability Data
Mezmo Wins Intellyx 2022 Digital Innovator Award
Mezmo Ranked No. 164 on Deloitte Technology Fast 500
Mezmo Wins 2022 Reworked IMPACT Award
Mezmo Unveils Observability Pipeline to Enhance the Value of Data
Launching a podcast? Try these 14 tips for greater exposure
DevSecOps Expedites Incident Detection and Response Time
Mezmo Named A Fastest Growing Company On Inc. 5000
DevSecOps Adoption Lags Despite Incident Detection Impact
Implementing DevSecOps Means Fewer Incidents
DevSecOps Reduces Security Incidents Research Finds
What is challenging successful DevSecOps adoption?
Fewer than one-quarter of organizations have a DevSecOps strategy
DevSecOps delivers significant results but take up remains low
DevSecOps adoption is low but packing a punch in user organizations
DevSecOps Drives Results, ESG Research Finds