Cloudera Data Engineer
تفاصيل الوظيفة
Job Title: Senior Cloudera Data Engineer Location: India Job Type: Full-time Experience Level: Mid-Senior (8-9 years) Department: Data Engineering Job Summary: We are seeking a highly skilled and motivated Senior Cloudera Data Engineer with 8 to 9 years of experience to design, develop, and maintain data pipelines and infrastructure on the Cloudera platform. The ideal candidate will have a strong background in data engineering, hands-on experience with Cloudera tools, and a deep understanding of big data architectures alongside expertise on informatica toolset such as Informatica DEI, Informatica Powercentre. Key Responsibilities:
- Design, develop, and maintain scalable and robust data pipelines and ETL processes using Cloudera tools and frameworks (e.g., HDFS, Hive, Impala, HBase, Spark, Flink).
- Design, development , and implementation of complex ETL processes using Informatica PowerCenter.
- Utilize Informatica DEI for integrating, transforming, and managing large volumes of data across various data platforms, including Hadoop, Spark, and other big data technologies.
- Collaborate with data architects, data scientists, and business stakeholders to understand requirements and deliver solutions that meet their needs.
- Optimize and tune performance of data processing jobs, ensuring efficient resource usage and fast execution times.
- Implement and enforce best practices for data integration, data quality, data security, and governance.
- Monitor, troubleshoot, and resolve data pipeline and infrastructure issues promptly, ensuring high availability and reliability.
- Develop and maintain documentation related to data pipelines, processes, and infrastructure.
- Stay up to date with the latest industry trends, technologies, and best practices in big data and cloud computing.
- Provide technical guidance and mentorship to junior data engineers within the team.
- Bachelor’s or master’s degree in computer science, Information Technology, Data Science, or a related field.
- 8-9 years of professional experience in data engineering, with a significant focus on Cloudera platforms.
- Strong hands-on experience with Cloudera tools and technologies such as Hadoop, Hive, Impala, HBase, Spark, Flink, and Kafka.
- Experience in Design, development , and implementation of complex ETL processes using Informatica PowerCenter.
- Experience in Utilizing Informatica DEI for integrating, transforming, and managing large volumes of data across various data platforms, including Hadoop, Spark, and other big data technologies.
- Proficiency in programming languages such as Python, Java, or Scala.
- Solid understanding of data modelling, data warehousing concepts, and big data architectures.
- Experience with cloud platforms (e.g., AWS, Azure, GCP) and containerization (e.g., Docker, Kubernetes) is a plus.
- Proven track record of successfully designing and implementing large-scale data processing systems.
- Excellent problem-solving, analytical, and critical thinking skills.
- Strong communication and interpersonal skills, with the ability to collaborate effectively with cross-functional teams.
- Experience with machine learning and data science workflows.
- Knowledge of SQL and NoSQL databases.
- Familiarity with DevOps practices and CI/CD pipelines.
- Industry certifications related to big data and Cloudera platforms.
- Competitive salary and performance-based bonuses.
- Comprehensive health, dental, and vision insurance.
- Retirement savings plan with company match.
- Professional development opportunities and educational reimbursement.
- Flexible work hours and remote work options.
- Inclusive and collaborative work environment.
Apply safely
To stay safe in your job search, information on common scams and to get free expert advice, we recommend that you visit SAFERjobs, a non-profit, joint industry and law enforcement organization working to combat job scams.