From nothing to everything: One single alert to managing datacenters with Crusoe AI

Jeff Ferland (Staff Site Reliability Engineer, Crusoe) traces the full arc of Crusoe's infrastructure automation story, from a single Temporal Workflow that opened a JIRA ticket when a GPU dropped off the PCIe bus, to a persistent Lifecycle Workflow that supervises every server in the fleet from racking to workload deployment. The talk covers what broke along the way and how they arrived at an autonomous system capable of managing a datacenter from a single alert.

Build invincible apps

Ready to learn why companies like Netflix, Doordash, and Stripe trust Temporal as their secure and scalable way to build and innovate?