Systems Engineer, Managed Operations
Job details
Job ID: 2850057 | Amazon Web Services Development Center Germany GmbH AWS is set to introduce the inaugural European Sovereign Cloud (ESC), marking a significant development in utility computing (UC). To spearhead this initiative, we are actively seeking experienced systems engineers with a strong background in automation and operations. As part of the AWS Managed Operations team, you will play a pivotal role in building and leading operations and development teams dedicated to delivering high-availability AWS services, including EC2, S3, Dynamo, Lambda, and Bedrock, exclusively for EU customers. For more information on ESC please check out our blog: AWS European Sovereign Cloud Blog . Your responsibilities will encompass overseeing the launch of the ESC in 2025, working closely with global AWS teams, and influencing the evolution of AWS services and technology. A typical day in this role involves collaborating with technology leaders, contributing to the enhancement of day-to-day operations, and ensuring improvements in availability, reliability, latency, performance, and efficiency of the ESC. You will be required to occasionally participate in “on-call” rotations to resolve incidents occurring out-of-hours. The overarching goal is to deliver scalable services and ensure a high-availability experience for EU customers. If you are an experienced professional ready for a challenging and impactful opportunity, we invite you to join our efforts in building a best-in-class development engineering and operations team that aligns with AWS' commitment to customer satisfaction and continual innovation.
A day in the life
Embark on a week filled with meaningful contributions to the operation and improvement of significant software systems. You dedicate a substantial portion of your time to carefully review the operational health of services within your team's responsibility. In the process, you diligently identify anomalies and craft actionable bug reports, aspiring to enhance the overall efficiency and performance of your systems. In addition to these responsibilities, you offer constructive feedback on change management documents and work earnestly to address your team's operational backlog. Through a collaborative effort, you strive to navigate challenges and ensure the seamless functionality of your systems. Additionally, you engage in the development and testing of scripts, hoping to provide practical solutions to enhance your workflows. Beyond the technical aspects, you assume a role as an educator, sharing insights on the complexities of the European Sovereign Cloud with service teams. It's a humbling experience for you to contribute to the collective knowledge of the team, fostering a culture of mutual understanding. This week encapsulates your commitment to continuous learning and improvement, acknowledging that every effort, no matter how small, contributes to the collective success of your team and the reliability of your software systems. In addition to these responsibilities, your position involves 24x7 on-call responsibility. You work as a team to root-cause issues and ensure your systems remain resilient and fault-tolerant, underscoring your commitment to maintaining operational excellence.Eligibility requirement
- Fluency in written and spoken English is required.
- Successful applicants must have the legal right to work in Germany.
- Amazon will provide relocation support for successful applicants relocating within the European Union.
BASIC QUALIFICATIONS
- 5+ years of systems engineering experience, working with hardware, software, networking, operating systems
- Proficient leading the creation, revision, and/or improvement of standard operational procedures (SOPs)
- Proficient driving operational best practices.
- Proficient scripting processes in a language such as Bash, Python, or Ruby
- Proficient troubleshooting and anticipating problems that affect the performance, reliability, or availability of software systems
PREFERRED QUALIFICATIONS
- Experience working cross-organizationally and leading strategic team efforts requiring work from multiple team members
- Experience actively mentoring junior engineers
- Experience performance tuning software applications and optimizing fleet utilization
- Experience with Infrastructure as Code, (such as CDK, CloudFormation, Puppet, Chef, Ansible, or similar)
Apply safely
To stay safe in your job search, information on common scams and to get free expert advice, we recommend that you visit SAFERjobs, a non-profit, joint industry and law enforcement organization working to combat job scams.