Human-Governed Automation Loops for AI at Planet Scale | Suganya Nagarajan | Conf42 SRE 2026

Presenters Suganya Nagarajan Source Conf42 SRE 2026 🚀 Beyond Blind Automation: The Power of Human-Governed AI Loops In the high-stakes world of Site Reliability Engineering (SRE), we face a recurring dilemma: as systems scale, manual interventions become a bottleneck, yet blind automation remains a dangerous liability. I am Suganya Nagarajan, an engineering manager with a decade of experience in large-scale distributed systems. Today, I want to share a framework to bridge this gap: Human-Governed Automation Loops (HAL). This approach ensures that our AI systems remain reliable, accountable, and safe, even as they operate at breakneck speeds. ...

March 19, 2026 · 4 min

Reducing On-Call Pain in Hybrid Platforms | Shruthi Rajashekar | Conf42 SRE 2026

Presenters Shruthi Rajashekar Source Conf42 SRE 2026 Unifying the Hybrid Cloud: How VM Service is Revolutionizing VM and Container Management 🚀 Hey tech enthusiasts! Shruthi Rajashekar, an engineering manager at Broadcom, is here to shed some light on a game-changer for hybrid cloud environments. For the past decade at VMware Broadcom, Shruthi has been instrumental in developing foundational technologies like vMotion and VM service, bridging the gap between traditional virtualization and modern cloud-native infrastructure. Today, she’s diving deep into how a unified control plane for virtual machines (VMs) and container-based workloads can be achieved using VM service, a VCF offering, and why this approach is absolutely critical for platforms demanding high availability and operational excellence. ...

March 19, 2026 · 6 min

The Failures You Don’t See on Dashboards | Abhimanyu Narwal | Conf42 SRE 2026

Presenters Abhimanyu Narwal Source Conf42 SRE 2026 The Silent Killers: Unmasking Failures Beyond Your Dashboards 🕵️‍♀️ We’ve all been there. Alarms blaring, graphs spiking, the adrenaline rush of an incident. As engineers, we excel at fighting outages, diving into the chaos, and emerging victorious. But what if the most expensive reliability failures aren’t the ones that make noise? What if they’re the quiet, insidious ones that slow us down, all while our dashboards gleam with a reassuring green? ...

March 19, 2026 · 5 min

When the Stack Lies: Root Cause in Distr. Systems | Daniel Raskin & David McNerney | Conf42 SRE 2026

Presenters Daniel Raskin David McNerney Source Conf42 SRE 2026 🕵️‍♂️ When the Stack Lies: Unmasking Root Cause in Distributed Systems In the high-stakes world of modern enterprise technology, the “stack” often tells half-truths. As we migrate from classic three-tiered architectures to hyper-distributed environments, the complexity of our systems is outstripping our ability to monitor them. Daniel Raskin (CMO at Vertana) and David McNerney (Director of Product Management at Vertana) recently sat down to dissect why traditional observability is failing and how a system-aware approach can save the day. ...

March 19, 2026 · 5 min

SW Design, Architecture & Clarity at Scale • Sam Newman, Jacqui Read & Simon Rohrer

Presenters Sam Newman Jacqui Read Simon Rohrer Source GOTO podcast Unlocking Software Design: More Than Just Code – It’s a Conversation! 🚀💡 Ever felt like “software design” is this elusive, shape-shifting concept that everyone talks about but no one quite defines the same way? You’re not alone! In a recent GoTo podcast panel, industry luminaries Sam Newman, Jacqui Read, and Simon Rohrer dove deep into the heart of software design, its intricate dance with architecture, and how we can make it a more collaborative, effective, and less-overlooked part of our daily work. Prepare to rethink how you approach building software! ...

February 27, 2026 · 9 min