Inside the Voyage AI Platform

Presenters Angel Lim Andrew Gaut Source MongoDB.local San Francisco 2026 🚀 Scaling AI Inference: A Deep Dive into the Boyi Inference Platform 🤖 The demand for real-time AI is exploding. But serving those models – especially complex embedding and re-ranking models – at scale while maintaining lightning-fast response times is a monumental challenge. Angel Lim and Andrew Gaut, engineers from the Boyi inference and research teams (formerly at Voyage, now part of a larger organization), recently shared their insights into how they built the Boyi inference platform to tackle this head-on. Let’s explore the key strategies and technologies they’re using to deliver high-performance AI in production. ...

January 22, 2026 · 4 min