Cloudera Data Engineer
Description
Cloudera Data Engineer
Location:
Remote (UAE Time Zone)/
Experience:
5+ Years/
Availability:
Immediate joiners or candidates who can start within 30 days preferred
Role Overview
We are seeking an experienced
Cloudera Data Engineer
to design, build, and maintain data pipelines and processing frameworks across Cloudera’s ecosystem. The ideal candidate will have strong expertise in Spark, Hive, Impala, Ni Fi, and Oozie, and proven experience optimizing large-scale data pipelines for performance and reliability.
Key Responsibilities
Design, build, and maintain
ETL/ELT pipelines
using Spark, Hive, Impala, Ni Fi, and Oozie.
Optimize data pipelines for
performance, reliability, and cost efficiency
across batch and streaming workloads.
Develop
automation scripts and reusable frameworks
for data ingestion, transformation, and cleansing.
Collaborate with Cloudera Administrators to
tune performance
and manage cluster resources efficiently.
Troubleshoot and resolve
data pipeline failures, performance bottlenecks, and data quality issues
.
Monitor
cluster utilization, job performance, and data throughput
using Cloudera Manager.
Work with Dev Ops/Infrastructure teams to define and maintain
CI/CD pipelines, automation for deployments
, and monitoring of data jobs.
Participate in
incident management, root cause analysis, and preventive maintenance
.
Ensure pipelines meet
SLAs, data quality, and latency requirements
for analytics workloads.
Requirements
5+ years of experience as a
Data Engineer
in a Cloudera-based environment.
Strong hands-on experience with
Spark, Hive, Impala, Ni Fi, and Oozie
.
Solid understanding of
ETL design, performance optimization
, and
data governance
principles.
Experience with
Linux/Unix scripting
and
CI/CD integration
.
Excellent troubleshooting and communication skills.
#J-18808-Ljbffr
Posted: 21st December 2025 9.42 am
Application Deadline: N/A