From the course: Data Science Foundations: Data Mining in R
Unlock the full course today
Join today to access over 24,700 courses taught by industry experts.
Challenge: K-nn
From the course: Data Science Foundations: Data Mining in R
Challenge: K-nn
(bright music) - [Narrator] Now that we've covered several methods of classification within a data mining context. I want to give you the opportunity to try doing this with a new dataset. To do this we're going to use a data set that comes from a package called mlbench which stands for machine learning benchmark. I'm going to load that package along with a few others. And then I'll set the random seed. Now I'm going to do a little bit of data preparation here. So you have things ready to go for the actual classification task. We're going to use a dataset that's about breast cancer that comes from the mlbench package. So let's get information on that one. The Wisconsin Breast Cancer Database and what it does is it looks at cells and it classifies them by clump thickness, uniformity of cell size, bare nuclei, mitoses and so on. And then finally class is whether it is benign or malignant. And let's start by loading…
Practice while you learn with exercise files
Download the files the instructor uses to teach the course. Follow along and learn by watching, listening and practicing.