From the course: Hands-On Analytics Engineering Project
Unlock the full course today
Join today to access over 24,700 courses taught by industry experts.
What is data modeling?
From the course: Hands-On Analytics Engineering Project
What is data modeling?
- Now that we've built our pipeline and brought our data into our database, we can start modeling our data. Data modeling is the step where we start to add value to the data. After our raw data is available, but before we can start visualizing our data, we can model our data. If you remember our water well analogy, when we built our pipeline, we took a resource, our data, and we pipelined that data into our warehouse or database. Now within our warehouse, we can start to refine and distill our data. This modeling or refining process is what adds value to the resource we have. Pipelining the data alone is not enough, we should make sure the data is clean and safe for use by our customers or stakeholders, and in a structure that can provide value to the company. There are many tools in which one can model their data. Some popular tools include Dbt, BigQuery, SQL Server, and Snowflake. The great news is that at their core, each tool uses a version of SQL. As you know, SQL stands for…