This project implements an end-to-end data engineering pipeline specifically designed for collecting, processing, and annotating software engineering job descriptions to power AI recruitment models.