+4 years of industry experience as a Data Engineer
Experience developing large scale batch & streaming ETLs in spark, preferably in Databricks
Expertise developing in Python. Scala knowledge is a plus
Good command of Apache Airflow for workflow orchestration
Experience working in cloud environments, preferably GCP
Strong hands-on experience with modern, open-source table formats such as Apache Iceberg or Delta Lake to manage large-scale data lakehouse architectures
Proven track record managing data access controls, CLS, RLS and cataloging workflows. Experience with Unity Catalog is a plus
Hands-on experience working with containerized applications, preferably orchestrated in Kubernetes
Experience working with Google Pub/Sub is preferred
Knowledge of the software development cycle and its best practices (e.g. CI/CD, DevOps, testing)