Senior Data Engineer
Job details
Overview: As a Senior Data Engineer, you will play a pivotal role in designing, implementing, and maintaining the data infrastructure of our organization. You will be responsible for developing robust data pipelines, optimizing data workflows, and ensuring the reliability and scalability of our data systems. Collaboration with cross-functional teams including data scientists, software engineers, and business analysts will be essential to drive data-driven decision-making and support various business initiatives. Responsibilities: 1. Data Pipeline Development: - Design, build, and maintain scalable data pipelines to ingest, process, and transform large volumes of structured and unstructured data from diverse sources. - Implement efficient ETL (Extract, Transform, Load) processes to ensure timely and accurate data delivery to downstream systems. - Optimize data workflows for performance, reliability, and cost-effectiveness, leveraging technologies such as Apache Spark, Kafka, or similar distributed computing frameworks. 2. Data Modeling and Architecture: - Develop and maintain data models, schemas, and metadata to support analytical and reporting requirements. - Design and optimize data storage solutions including relational databases, NoSQL databases, data lakes, and data warehouses. - Collaborate with data architects and infrastructure teams to ensure alignment with overall data architecture and best practices. 3. Data Quality and Governance: - Implement data quality checks and validation processes to ensure data accuracy, consistency, and completeness. - Establish and enforce data governance policies, standards, and procedures to maintain data integrity and compliance with regulatory requirements. - Monitor data pipelines and proactively identify and resolve data quality issues or anomalies. 4. Performance Tuning and Optimization: - Identify performance bottlenecks in data processing and storage systems and implement optimizations to improve throughput, latency, and resource utilization. - Conduct capacity planning and scalability assessments to support growing data volumes and user demands. - Collaborate with infrastructure teams to fine-tune hardware configurations and cloud resources for optimal performance. 5. Collaboration and Communication: - Work closely with cross-functional teams including data scientists, software engineers, business analysts, and product managers to understand data requirements and deliver data solutions that meet business needs. - Communicate effectively with stakeholders to gather requirements, provide project updates, and present insights derived from data analysis. - Mentor junior data engineers and provide technical guidance and support as needed. Qualifications: - Bachelor's or Master's degree in Computer Science, Engineering, or a related field. - 8+ years of experience in data engineering roles, with a proven track record of designing and implementing data solutions at scale. - Proficiency in programming languages such as Python, Java, or Scala, with experience in building data pipelines using frameworks like Apache Spark, Apache Beam, or similar. - Strong SQL skills and experience working with relational databases (e.g., PostgreSQL, MySQL) and NoSQL databases (e.g., MongoDB, Cassandra). - Hands-on experience with cloud platforms such as AWS, GCP, or Azure, including services like S3, EC2, EMR, Dataflow, BigQuery, or equivalent. - Solid understanding of data modeling concepts, data warehousing principles, and ETL best practices. - Familiarity with data governance frameworks, data quality tools, and regulatory requirements (e.g., GDPR, HIPAA). - Excellent problem-solving skills and the ability to work effectively in a fast-paced, dynamic environment. - Strong communication and collaboration skills, with the ability to interact with stakeholders at all levels of the organization. Additional Preferred Skills: - Experience with containerization and orchestration tools such as Docker, Kubernetes. - Knowledge of stream processing frameworks like Apache Kafka, Apache Flink, or similar. - Familiarity with machine learning and data science concepts. - Certification in relevant technologies or cloud platforms (e.g., AWS Certified Big Data - Specialty, Google Professional Data Engineer).
Apply safely
To stay safe in your job search, information on common scams and to get free expert advice, we recommend that you visit SAFERjobs, a non-profit, joint industry and law enforcement organization working to combat job scams.