Home South Africa Site Reliability Engineering Manager

Home South Africa Site Reliability Engineering Manager

Site Reliability Engineering Manager

Full time at a Laimoon Verified Company in South Africa
Posted on April 24, 2024

Job details

Key Responsibilities:Build, lead and manage the SRE and IT Telescope Operations Team.Operations and Service management - Work with stakeholders within the organisation to develop and detail Computing and Software operations and service framework, processes and tools required to operate the telescope as intended.Service delivery and support - Continuously assess and recommend improvements to our platform and processes to enhance the effectiveness of our services.Infrastructure, network and platform management.Support telescope construction and deployment.Key Requirements:Qualification:BTech/ Degree/ Masters/ PHD in Computer Science, Information Technology, Information Systems, Computer Engineering or related fieldsExperience:BTech in Computer Science, Information Technology, Information Systems, Computer Engineering or related fields coupled with 13 years relevant working experience; or Degree in Computer Science, Information Technology, Information Systems, Computer Engineering or related fields coupled with 9 years relevant working experience; or Master's Degree in Computer Science, Information Technology, Information Systems, Computer Engineering or related fields coupled with 7 years relevant working experience; or PHD in Computer Science, Information Technology, Information Systems, Computer Engineering or related fields coupled with 5 years relevant working experience.Computer and network infrastructure implementationIT service, operations and management, including significant responsibility over Service Level AgreementsIT Infrastructure or software Team leadershipIT Architecture and GovernanceProject managementIT systems engineering, application support, and user managementIT governance and securityData governance and securityIT availability, resilience and redundancySystems analysis, design and engineeringExperience in supporting distributed software systems in a production environment such as Cloud and/or Data CentresProcurement and IT asset managementKnowledge:Track record of building and managing high-performance teams in a Software, IT or Technology related industry or organisation.Experience in asset lifecycle management and software asset management.Experience in managing resources and prioritisation.Knowledge and background with IT Service Management disciplines and Frameworks such as ITIL and Change Management.Experience of Lean Agile project management.Experience of working in a globally diverse team.Programming/scripting experience and capability across multiple platforms.Additional Notes:SKILLS/ABILITIES/COMPENTENCIES:Essential:Experience working with Linux and within the Open Source Software EcosystemExperience with DevOps tools, processes and culture.Experience and/or certification and knowledge in SRE, ITIL or related IT Management processes.Experience supporting and maintaining large-scale High-Performance Computing (HPC) and storage systems.Advanced experience with programming and/or scripting languages such as Python.Desirable:Certification in Project managementExperience in agile project management e.g. SAFe, Scrum.Demonstrate interest in astronomy and understanding of the challenges of controlling telescopes.Strong Leadership QualityStrategic thinkerProblem solving skillsPlanning and Time ManagementTeam building and collaborationResource ManagementPlanning and DesignCommunication and Interpersonal skillsSkills:Teamwork and Collaboration: Cooperates with others to achieve organisational objectives and may share team resources in order to do this. Collaborates with other teams as well as industry colleagues.Influence and Communication: Identifies critical stakeholders and influences them via an influential third party, for example through an established network, to gain support for sometimes contentious proposals/ideas.Resource Management/Leadership: Provides leadership that fosters an environment that encourages new ideas and provides support for the development of emerging skills. Creates trust by displaying consistency, understanding, integrity and patience. Plans, seeks, allocates and monitors resources to achieve outcomes.Judgement and Problem Solving: Anticipates and manages problems in ambiguous situations. Develops and selects an appropriate course of action and provides for contingencies. Evaluates, interprets and integrates complex bodies of information and draws logical conclusions, synthesises proposals and defends options with reasoned arguments.Independence: Assesses the risk and opportunity of identified strategies, options and actions. Overcomes problems and setbacks in achieving goals. Invariably includes consideration of value-added future impact on the bottom line when determining the optimal and efficient use of resources.Adaptability: Demonstrates flexibility in thinking and adapts to and manages the increasing rate of organisational change by adjusting strategies, goals and priorities.

Apply safely

To stay safe in your job search, information on common scams and to get free expert advice, we recommend that you visit SAFERjobs, a non-profit, joint industry and law enforcement organization working to combat job scams.

Share this job
See All Site Jobs
Feedback Feedback