From the course: Cloud Computing: BC/DR Best Practices
Unlock this course with a free trial
Join today to access over 24,700 courses taught by industry experts.
Machine Learning Operations (MLOps) and BC/DR
From the course: Cloud Computing: BC/DR Best Practices
Machine Learning Operations (MLOps) and BC/DR
- [Instructor] In this video, we're taking a close look at machine learning operations, or MLOps, and how they tie into business continuity and disaster recovery. MLOps is all about applying automation, tracking, and robust management across every stage of the machine learning lifecycle, which spans from data preparation and training through model deployment and real-time monitoring. The aim is to make developing and running AI systems more reliable, repeatable, and efficient at scale. This has a big impact on BC/DR strategy. Traditionally, business continuity and disaster recovery plans focused mostly on keeping data and applications available if outages or disasters hit. But with MLOps, what's at stake grows to include datasets, train models, pipeline definitions, code repositories, and the infrastructure where everything runs, which are often distributed across multiple cloud platforms or regions. Automation is one of the foundational practices in a resilient MLOps environment. By…
Contents
-
-
-
-
-
(Locked)
Identifying processes for backup2m 53s
-
(Locked)
BC/DR for platform instances2m 40s
-
(Locked)
BC/DR for other cloud services2m 14s
-
(Locked)
BC/DR for AI/ML inference processing3m 8s
-
(Locked)
Machine Learning Operations (MLOps) and BC/DR3m 19s
-
(Locked)
Challenge: BC/DR AI processing for Atlas Inc.1m 56s
-
(Locked)
Solution: BC/DR AI processing for Atlas Inc.2m 18s
-
(Locked)
-
-