Senior Infrastructure Engineer
Full time
at Boardroom Appointments
in
Online
Posted on January 14, 2025
Job details
About the job Senior Infrastructure Engineer
- Bachelor's degree in Computer Science/ Engineering or related discipline
- Must be VMware vSphere Certified
- Must be Veeam Certified
- HPE or Dell Certified
- Microsoft Certified - MCSE
- 5+ years of related experience
- Minimum of 2 years in a system administrative position
- Minimum of 2 years working experience with Linux/Unix or derivative
- Minimum of 2 years working experience with troubleshooting hardware and/or software stack
- Strong Experience with Linux-based operating systems required Red Hat Enterprise Linux certified (RHCE or better) or other appropriate Linux/Unix certification
- RHEL, CentOS, and Ubuntu
- Experience in cloud environments (Amazon Web Services / Azure)
- Backups Experience Commvault, or Veeam required
- Basic Mimecast administration
- Microsoft Windows Server Deployments - Active Directory, Exchange, Group Policies, and Office365 experience is required
- Proven experience in and with a large, Enterprise environment and infrastructure HPE Blade Server Technology
- SAN storage solutions Experience in FC, and FCOE storage technologies is required
- Ability to troubleshoot network issues - LAN Routing and Switching troubleshooting (VLANs, IP Subnetting, DHCP, DNS)
- Load Balancer technology experience F5 and HA Proxy etc
- Monitoring and alerting experience with Open-Source technologies such as Nagios, Zabbix, and/or Logstash
- SAN storage solutions i.e., HPE or Dell Experience in FC, and FCOE storage technologies are required
- Own transport and drivers license beneficial in the rare circumstance where emergency onsite support is required at the data centre
- Professional diagramming and completing of relevant documentation Network monitoring, troubleshooting, and systems management experience Datacentre disaster recovery planning and testing
- Knowledge of infrastructure and software environments, programs, methodologies, procedures, and policies
- Advanced troubleshooting skills, with deep experience in systems, network, and server infrastructure, with a proactive problem prevention-oriented problem- solving mindset
- Bash, python, ruby, and/or PHP scripting experience advantageous
- Network/OS clustering knowledge advantageous
- Containerisation (Docker/Kubernetes) knowledge advantageous
- Use of CD/CI tools (Ansible/Puppet/Terraform) advantageous
- Public Cloud vendor experience (Amazon Web Services/Azure)
- Basic networking, from Ethernet to IP HPE Comware, Aruba, ProCurve
- Virtualization technologies such as KVM, RHV-H or VMWare are advantageous
- Experience using client, console web-based management tools
- Experience in working with Application and Infrastructure monitoring solutions Data Centre experience
- Gauge the effectiveness and efficiency of existing systems; develop and implement strategies for improving or further leveraging these systems
- Propose and create system design models, specifications, diagrams, and charts to provide direction to development teams
- Perform server and security audits, system backup procedures, and other recovery processes in accordance with the company's disaster recovery and business continuity strategies
- Integrate servers, including database, e-mail, print, and backup servers and their associated software into enterprise systems
- Ensure system connectivity of all servers, shared software, groupware, and other applications
- Create and maintain documentation as it relates to system configuration, mapping, processes, and service records
- Ensure compatibility and interoperability of in-house computing systems
- Coordinate and perform in-depth tests, including end-user reviews, for modified and new systems
- Monitor and test system performance; prepare and deliver system performance statistics and reports
- Provide orientation and training to end users for all modified and new systems
- Maintain a service-orientated IT operations function that supports ongoing operations to drive efficiency, quality and customer service
- Ensure adherence to service level agreements and manage to deliver on agreed performance levels for application availability, response time, and network performance
- Ensure the protection of IT assets and the integrity, security, and privacy of information as per defined policies
- Drive the implementation of strategies and plans for the technical infrastructure, operational enhancements, and application architecture and execute the plan
- Identifying and evaluating inefficiencies and recommending optimal operations business practices, and system functionality and behavior
- Manage the existing solutions and IT operational applications
- Provide onsite or remote access diagnoses and resolution of computer hardware and software problems using a highly integrated set of diagnostic tools and techniques
- Serve as a technical expert providing support to the other IT Technicians and IT Operations team in general
- Provisioning of servers according to strict guidelines with attention to detail and documentation
- Installation and upgrading of complex IT equipment, networks and systems maintaining and documenting of all work performed as well as strict usage of the call logging systems to ensure open communication with the whole team Administer, implement, and provide technical support of applications and associated hardware utilizing a specialized set of diagnostic tools with elevated privileges
- Operation and administration of most of the IT systems used to manage, monitor and configure all the technical systems in the environment
- Manage access and security of systems in accordance with policies
- Collaborate with systems architecture and operations teams to ensure smooth and reliable operation of software and systems for fulfilling business objectives and processes
- Work with executive team members, decision makers, and stakeholders to define business requirements and systems goals, and to identify and resolve business systems issues
- Execute planned operations and deployment activities, as well as preventive maintenance
- Postproduction monitoring, and troubleshooting of product/services Proactive monitoring, evaluating, recommending, testing of new and existing platforms
- Thorough understanding of the application architecture, functional capabilities and technical infrastructure, both current and future state
- Build a culture of respect and understanding across the organization
- Recognize outcomes which resulted from effective collaboration between teams
- Build co-operation and overcome barriers to information sharing, communication and collaboration across the organization
- Facilitate opportunities to engage and collaborate with external stakeholders to develop joint solutions
- Through effective inspirational leadership, facilitate the creation of accountable, full service teams who understand and strive to meet the needs of all stakeholders
- Role model behavior and motivate team members in line with the core values
- Take full responsibility for performance of all direct reports, motivating and managing them in relation to quality standards and agreed benchmarks and objectives, focusing on all aspects of sound people management
- Provide support and guidance on career path planning, on-the-job training, coaching and mentoring to direct reports
- Develop, promote and direct the implementation of equal opportunities policies in all aspects of the company's work
- Communicate and maintain trust relationships with shareholders, business partners and authorities
- Leads change to creates a self-refreshing and learning organisation
- Continuous improvement of business processes
Apply safely
To stay safe in your job search, information on common scams and to get free expert advice, we recommend that you visit SAFERjobs, a non-profit, joint industry and law enforcement organization working to combat job scams.