Reliability-First Architectures for AI Analytics | Ajay Srinivas Kiran Gemidi | Conf42 SRE 2026

Presenters Ajay Srinivas Kiran Gemidi Source Conf42 SRE 2026 Reliability First: Building AI Systems That Don鈥檛 Break the Bank (or the Internet!) 馃殌 Hey tech enthusiasts! Ever wondered how those lightning-fast fraud detection systems or those eerily accurate recommendation engines actually work without bringing everything crashing down? We鈥檙e diving deep into the world of reliable AI with insights from Ajay Srinivas and Kiran Gemidi, system engineers with a combined 20 years of experience keeping large-scale production environments humming. ...

March 19, 2026 路 6 min

Migration from On-Prem Messaging System to The Cloud: What, How and Why | Ran Tao | Conf42 SRE 2026

Presenters Ran Tao Source Conf42 SRE 2026 Migrating Your Messaging Systems to the Cloud: A Journey to Scalability and Efficiency 馃殌 Ever felt the growing pains of your on-premise messaging systems? You鈥檙e not alone! Today, we鈥檙e diving deep into the exciting world of cloud migration for messaging systems, exploring how to ditch the on-prem headaches and embrace the power of the cloud. Get ready to unlock scalability, reduce costs, and focus on what truly matters: your business logic! ...

March 19, 2026 路 6 min

Operationalizing LLMs at Scale in Grocery Retail | Sanjay Basu | Conf42 SRE 2026

Presenters Sanjay Basu Source Conf42 SRE 2026 From Experiment to Mission-Critical: Operationalizing LLMs at Scale in Retail with Sanjay Basu Hey tech enthusiasts! Ever wonder what happens when cutting-edge AI moves beyond the lab and into the bustling world of retail? It鈥檚 a game-changer, but not without its unique challenges. Today, we鈥檙e diving deep with Sanjay Basu from TCS, a brilliant mind in the retail and AI vertical, as he unravels the complexities of operationalizing Large Language Models (LLMs) at scale, especially within the dynamic realm of cognitive commerce. ...

March 19, 2026 路 6 min

Risk-Aware Decision-Making for Scalable Reliability Systems | Oreoluwa Omoike | Conf42 SRE 2026

Presenters Oreoluwa Omoike Source Conf42 SRE 2026 馃殌 AI-Driven Risk-Aware Decisions: Transforming Scalable Reliability Systems Hey tech enthusiasts! Ever found yourself jolted awake at 2 AM by an alert, only to spend precious minutes (or more!) hunting down a root cause that feels eerily familiar? Oreoluwa Omoike, who has spent years building reliability systems at scale, shared some groundbreaking insights about moving beyond reactive problem-solving to smarter, proactive risk management. This isn鈥檛 just about detecting issues; it鈥檚 about making intelligent decisions before things go south. Let鈥檚 dive into the world of AI-driven risk-aware decision-making for scalable reliability systems! ...

March 19, 2026 路 5 min

Scale or Fail as Spotify's Growth Exposed the Abstraction Paradox | Stuart Clark | Conf42 SRE 2026

Presenters Stuart Clark Source Conf42 SRE 2026 Scale or Fail: How Spotify Solved the Abstraction Paradox 馃殌 In the high-stakes world of software engineering, we often treat abstraction as our greatest ally. We build layers to hide complexity, simplify workflows, and help our teams move faster. But what happens when those very abstractions become your biggest enemy during a 3:00 a.m. critical incident? Stuart Clark, Senior Developer Advocate at Spotify, recently shared a compelling story about how Spotify nearly fell into the abstraction trap and how they engineered their way out of it. This isn鈥檛 just a story about code; it鈥檚 about scaling operational knowledge across thousands of engineers. ...

March 19, 2026 路 4 min