Compute-Sharded Stream Processing for Cybersecurity Analytics | Abhishek Suman | Conf42 SRE 2026

Presenters Abhishek Suman Source Conf42 SRE 2026 Outrunning the Adversary: Mastering Compute Sharded Stream Processing at Petabyte Scale 🚀 In the high-stakes world of cybersecurity, time is the only currency that truly matters. When a bad actor breaches a system, they don’t wait for your scheduled batch jobs to finish. They move with incredible speed, often completing lateral movement, credential harvesting, and establishing persistence within mere minutes. I am Abhishek Suman, a Senior Software Engineer at Microsoft. My daily mission involves building the fault-tolerant, large-scale distributed systems that power real-time data pipelines and cybersecurity infrastructure. Today, I am diving deep into how we handle petabyte-scale telemetry using compute sharded stream processing to close the detection gap once and for all. 🛡️ ...

March 19, 2026 · 4 min

Correlation Over Collection: A Layered Observability Framework | Khushboo Nigam | Conf42 SRE 2026

Presenters Khushboo Nigam Source Conf42 SRE 2026 Unraveling the Chaos: Why Correlating Telemetry is the New Superpower for Cloud-Native Observability ✨ Hey tech enthusiasts! Khushboo Nigam, a Cloud Architect specializing in observability for cloud-native systems, recently shed light on a pervasive challenge facing SRE teams today. It’s not about collecting more data; it’s about making sense of the massive amounts of telemetry modern distributed systems generate. The core message is clear: Correlation over Collection. ...

March 19, 2026 · 5 min

FLEX2 Analytics in AMBR250 Upstream Biologics Workflows | Amogha Tenneti | Conf42 SRE 2026

Presenters Amogha Tenneti Source Conf42 SRE 2026 From Lab Bench to High-Throughput: How SRE Principles Revolutionized Biologics Automation 🚀🔬 Good morning, everyone! Amogha Tenneti here, and I’m thrilled to share a fascinating journey with you. We often associate Site Reliability Engineering (SRE) with the world of software, protecting our digital systems from outages and ensuring seamless user experiences. But what if I told you that the very same SRE principles can unlock incredible reliability, compliance, and scalability in a highly regulated, hands-on environment like a biological process development lab? ...

March 19, 2026 · 5 min

Human-Governed Automation Loops for AI at Planet Scale | Suganya Nagarajan | Conf42 SRE 2026

Presenters Suganya Nagarajan Source Conf42 SRE 2026 🚀 Beyond Blind Automation: The Power of Human-Governed AI Loops In the high-stakes world of Site Reliability Engineering (SRE), we face a recurring dilemma: as systems scale, manual interventions become a bottleneck, yet blind automation remains a dangerous liability. I am Suganya Nagarajan, an engineering manager with a decade of experience in large-scale distributed systems. Today, I want to share a framework to bridge this gap: Human-Governed Automation Loops (HAL). This approach ensures that our AI systems remain reliable, accountable, and safe, even as they operate at breakneck speeds. ...

March 19, 2026 · 4 min

Program Leadership in AI-Enabled Platform Systems | Sonali Galhotra | Conf42 SRE 2026

Presenters Sonali Galhotra Source Conf42 SRE 2026 🌐 Beyond the Dashboard: Why Reliability is an Organizational System Problem In the world of Site Reliability Engineering (SRE), we often obsess over uptime, latency, error budgets, and system telemetry. While these metrics are vital, they don’t tell the whole story. According to Sonali Galhotra, a leader at the intersection of technical program leadership and platform engineering, the most critical reliability signals don’t always appear on a monitoring dashboard. Instead, they emerge from how an organization structures itself and where it chooses to invest. ...

March 19, 2026 · 4 min