From the course: AWS Certified Machine Learning Engineer Associate (MLA-C01) Cert Prep

Unlock this course with a free trial

Join today to access over 24,700 courses taught by industry experts.

Amazon SageMaker Serverless Inference

Amazon SageMaker Serverless Inference

- [Instructor] Hello, guys, and welcome. So, in today's session, we are going to talk about Amazon SageMaker Serverless Inference. So, Amazon SageMaker Serverless Inference, it's actually a purpose-built inference which allows you to deploy machine learning models for inference without the need to either configure or manage the underlying infrastructure, and this totally simplifies the deployment process and reduces the operational overhead. It's ideal for workloads with idle periods, so it's actually ideal for workloads that have the idle periods between traffic bursts, and could tolerate the cold starts. So, for example, it's useful for applications with unpredictable traffic, such as certain web applications or batch processing tasks. So, it's actually good for scaling, so if there are no requests, then the serverless endpoint scales down to zero, making it a cost-effective solution, so you only pay for what you use, and this significantly reduces the costs during the idle periods,…

Contents