Setting up dbt Core involves installing the tool, configuring your environment, and running your first dbt project. Below is a…
Category: Data Engineering
DBT Tutorial for Beginners: What is DBT and Why Use It?
This is DBT Tutorial and you can download pdf material at the end of this article. Let’s start by stating…
What is Data Fabric?
Data fabric is a unified layer that seamlessly connects diverse data sources, including relational databases, data warehouses, data lakes, cloud…
Data Warehouse vs Data Lake vs Data Lakehouse
“Data Warehouse vs Data Lake vs Data Lakehouse” is a topic that often confuses many people. In this blog, we…
What are Transactional Databases?
Transactional databases are designed to manage and facilitate transactions in a way that ensures data integrity and consistency. They are…
Understanding DAG Scheduling in Apache Airflow
Hey there! Ready to dive into the world of Apache Airflow and make your workflows run like a charm? Let’s…
What are the differences between ETL, ELT, and ELTP?
The differences between ETL, ELT, and ELTP methodologies highlight the evolution of data integration strategies over the past few decades.…
Apache Airflow Tutorial: Architecture, Concepts, and How to Run Airflow Locally With Docker
Apache Airflow is an open-source platform that provides a way to programmatically author, schedule, and monitor workflows. It is widely used…
How Uber Harnesses the Power of Presto and Apache Kafka for Data Analysis
At Uber, data is at the heart of everything we do. From optimizing routes for drivers to predicting rider demand,…
How Uber Uses Kafka in Its Dynamic Pricing Model
In the fast world of ride-sharing, Uber stands out because of its large network, easy-to-use app, and smart dynamic pricing,…