From the course: Intermediate SQL for Data Scientists (2021)
Unlock the full course today
Join today to access over 24,700 courses taught by industry experts.
Use SOUNDEX with misspelled text
From the course: Intermediate SQL for Data Scientists (2021)
Use SOUNDEX with misspelled text
- [Instructor] Sometimes when we work with texts, we have mistakes or variations in spelling that make exact matching difficult. Now, when that's the case, we can use what's known as approximate matching. And in Postgres, there is an extension or an additional module called fuzzystrmatch, which we can use. So to use that we need to actually install the extension because it's not included by default in the Postgres installation. So to use an extension, we have to create it, so I'm going to specify, create extension. And then we specify the name of the extension. And these are documented in the Postgres documentation. And the extension we're going to install is called fuzzystrmatch. Now I'm going to execute this to show an error message that happens when the extension already exists. So it doesn't hurt anything, this doesn't change anything. It's just, if you are going to use this create extension command in a script, if…
Practice while you learn with exercise files
Download the files the instructor uses to teach the course. Follow along and learn by watching, listening and practicing.
Contents
-
-
-
-
-
(Locked)
Reformat character data8m 24s
-
Extract strings from character data6m 26s
-
(Locked)
Filter with regular expressions7m 14s
-
(Locked)
Reformat numeric data4m 11s
-
(Locked)
Use SOUNDEX with misspelled text7m 50s
-
(Locked)
Challenge: Prepare a data set for analysis26s
-
(Locked)
Solution: Prepare a data set for analysis32s
-
(Locked)
-
-
-
-