From the course: AWS Certified Machine Learning Engineer Associate (MLA-C01) Cert Prep
Unlock this course with a free trial
Join today to access over 24,700 courses taught by industry experts.
Hands-on learning: Loading data into model training resource
From the course: AWS Certified Machine Learning Engineer Associate (MLA-C01) Cert Prep
Hands-on learning: Loading data into model training resource
(somber music) - [Narrator] Hello guys and welcome again. In today's lesson, we're going to walk you through how you would load the training data into the model training resource. So when you would create a training job, you would specify the location of the training data sets in a data storage that you want, and also the data input mode for this job. For Amazon SageMaker AI, it supports the Amazon S3, Amazon Elastic file system, which is the EFS and the Amazon FSx for Lustre. You can then choose one of those input modes in order to stream the dataset in real time, or even download the whole dataset at the start of the training job. But there is one note you would need to take care about that your dataset must reside in the same AWS region as your training job. We've mentioned that there are different modes for loading the data into the model training resource. The first mode that we have is the default Amazon S3 file mode. So SageMaker in this mode downloads the entire dataset to the…
Practice while you learn with exercise files
Download the files the instructor uses to teach the course. Follow along and learn by watching, listening and practicing.
Contents
-
-
-
(Locked)
Intro: Data storage and ingestion1m 10s
-
(Locked)
The three Vs1m 54s
-
(Locked)
Types of data3m 27s
-
(Locked)
Batch versus streaming1m 32s
-
(Locked)
OLTP vs. OLAP2m 11s
-
Data formats4m 10s
-
(Locked)
Data modeling3m 19s
-
(Locked)
Data warehouses1m 17s
-
(Locked)
Data lakes3m 1s
-
(Locked)
Data ingestion scenarios3m 5s
-
(Locked)
Amazon FSx4m 9s
-
(Locked)
Hands-on learning: Loading data into model training resource8m 24s
-
(Locked)
Amazon Kinesis Data Streams9m 18s
-
(Locked)
Hands-on learning: Create a data stream3m 30s
-
(Locked)
Using EFS with Lambda1m 25s
-
(Locked)
Hands-on learning: Create an AWS Lambda function to consume a Kinesis Data Stream3m 50s
-
(Locked)
Amazon Kinesis Client Library (KCL)2m 52s
-
(Locked)
Apache Kafka7m 32s
-
Amazon MSK6m 33s
-
(Locked)
Kinesis vs. MSK4m 1s
-
(Locked)
Amazon Data Firehose4m 9s
-
(Locked)
Hands-on learning: Configure an Amazon Data Firehose stream5m 33s
-
(Locked)
Amazon Managed Service for Apache Flink2m 22s
-
(Locked)
Amazon Kinesis Analytics5m 22s
-
(Locked)
Amazon Kinesis Video Streams5m 47s
-
(Locked)
Amazon Redshift5m 14s
-
(Locked)
Amazon Redshift Serverless5m 4s
-
(Locked)
Storage platforms4m 14s
-
(Locked)
Aligning to access patterns8m 35s
-
(Locked)
Cost and performance comparisons3m 4s
-
(Locked)
Extracting data from storage6m 56s
-
Summary of storage options7m 43s
-
(Locked)
Exam cram11m 34s
-
(Locked)
-
-
-
-
-
-
-
-
-
-