This project implements an end-to-end ETL (Extract, Transform, Load) pipeline using Python. The pipeline extracts data from a CSV file, transforms it using Pandas, and loads the cleaned data into a ...
In 2026, entry-level and transition roles often expect more than passive course completion. Employers want to see that you can write queries that answer business questions, work with browser logic, ...
Implemented pandas-based cleaning rules in data_preprocessing.py, transformations for salesorder.csv → clean_salesorder.csv, pipeline testing via multiple DAG runs.