Design, build, and maintain cloud-based data pipelines and workflows that support analytics and operational systems.
Integrate data from various sources using APIs and cloud services.
Develop clean, efficient, and test-driven code in Python for data ingestion and processing.
Optimize data storage and retrieval using big data formats like Apache Parquet and ORC.
Implement robust data models, including relational, dimensional, and NoSQL models.
Collaborate with cross-functional teams to gather and refine requirements and deliver high-quality solutions.
Deploy infrastructure using Infrastructure as Code (IaC) tools such as AWS
Qualifications
At least 3 years in a data engineering role working on data integration, processing, and transformation use cases with open-source languages (i.e. Python) and cloud technologies.