Location: Dubai 🕒 Experience: 5+ Years 🔎 About the Role We are looking for a highly skilled Data Engineer with strong expertise in PySpark and Cloudera Data Platform (CDP) to build and manage scalable data pipelines. The ideal candidate should have hands-on experience in big data ecosystems, data lakes, and advanced data processing techniques. 🛠️ Key Responsibilities Design, develop, and maintain scalable ETL pipelines using PySpark on CDP Implement robust data ingestion from databases, APIs, and file systems Perform large-scale data transformation and processing for analytics Optimize PySpark jobs and Cloudera components for performance and efficiency Ensure data quality, validation, and reliability across pipelines Automate workflows using Airflow, Oozie, or similar orchestration tools Monitor, troubleshoot, and maintain pipeline performance ✅ Mandatory Skills Strong hands-on experience in PySpark and Data Engineering Expertise in Cloudera Data Platform (CDP) Experience with data ingestion, transformation, and data modeling Knowledge of ETL pipeline design and optimization Experience with orchestration tools like Airflow/Oozie Strong understanding of big data ecosystems and data lakes 🎯 Looking for candidates with proven experience in building scalable data pipelines and working on enterprise-level data platforms.