From the course: Cloud Computing: BC/DR Best Practices

Unlock this course with a free trial

Join today to access over 24,700 courses taught by industry experts.

Machine Learning Operations (MLOps) and BC/DR

Machine Learning Operations (MLOps) and BC/DR

From the course: Cloud Computing: BC/DR Best Practices

Machine Learning Operations (MLOps) and BC/DR

- [Instructor] In this video, we're taking a close look at machine learning operations, or MLOps, and how they tie into business continuity and disaster recovery. MLOps is all about applying automation, tracking, and robust management across every stage of the machine learning lifecycle, which spans from data preparation and training through model deployment and real-time monitoring. The aim is to make developing and running AI systems more reliable, repeatable, and efficient at scale. This has a big impact on BC/DR strategy. Traditionally, business continuity and disaster recovery plans focused mostly on keeping data and applications available if outages or disasters hit. But with MLOps, what's at stake grows to include datasets, train models, pipeline definitions, code repositories, and the infrastructure where everything runs, which are often distributed across multiple cloud platforms or regions. Automation is one of the foundational practices in a resilient MLOps environment. By…

Contents