IT Administrator - HPC Qatar Posted on 03/28/2024 Trending
تفاصيل الوظيفة
JOB SUMMARYThe IT Administrator - HPC within the Biomedical Informatics Division is responsible for the installation, configuration, tuning, and troubleshooting of the HPC infrastructure (Linux servers in more than 1 PB environment with over 1000 processing cores) and the associated systems software. He/ She is also responsible for maintaining the integrity and the stability of the HPC.
The administrator provides technical expertise for the use of the HPC systems and applications. He/ she works with users at SIDRA to solve specific computational problems and help researchers to efficiently utilize HPC resources.
KEY ROLE ACCOUNTABILITIESInstalls software on HPC systems, writes test scripts, troubleshoots application problems and runs benchmarks to evaluate the performance of algorithms on different configurations.
Assists in monitoring the performance and stability of the HPC resources.
Oversees the health, compliance and performance of various HPC systems.
Contributes to the design and configuration of HPC systems in response to the business requirements.
Leads projects related to the deployment of new systems (hardware and software), the upgrade of existing systems, and the integration among various systems.
Performs Disk Management and Data Backup.
Provides monthly reports on the performance and utilization of the HPC systems to their management.
Installs software & manages file systems, and troubleshoots alerts from monitoring tools.
Performs programming/scripting to automate some of the operational functions.
Helps researchers evaluate available software and hardware.
Assists researchers with debugging problems that arise when compiling or using HPC resources or linking to HPC-specific libraries (for example, C, C++, Matlab, Perl, R, openmp, mpi, cuda, pthreads, etc.).
Assists researchers in optimizing or parallelizing existing applications.
Adheres to Sidra's standards as they appear in the Code of Conduct and Conflict of Interest policies.
Adheres to and promotes Sidra's Values.
QUALIFICATIONS, EXPERIENCE AND SKILLS - SELECTION CRITERIAESSENTIALEducationBachelor's degree in Computer Science, Engineering or a relevant field.
Experience4+ Years technical systems administration experience; maintaining computer systems infrastructure and its operation, at least 3 of which is in high performance and cluster computing for scientific applications.
Experience in Unix operating systems (Level 2 support or higher).
Experience in HPC software development and architecture.
Experience with virtualization concept and technology.
A good understanding of computer network infrastructure.
Certification and LicensureUnix / Linux Certified.
VMware/Citrix.
PMP.
Job Specific Skills and AbilitiesHigh level of Technical skills related to systems administration and infrastructure of HPCs.
Extensive experience and skill managing Unix/Linux operating systems in a large-scale system environment.
Shell scripting experience and ability.
Solid understanding of networked computing environment concepts.
Demonstrated ability in managing file systems and storage in an HPC environment.
Experience with batch schedulers (particularly PBS or MOAB).
Ability to understand the business requirements of the users and translate it into technical requirements.
Ability to manage vendor relationships.
Ability to manage multiple projects simultaneously.
Ability to assess the criticality and urgency of users' requirements and prioritize them properly.
Ability to prepare reports on problems' root cause or technology evaluation.
Ability to develop and present technical information in a format that is understood by non-technical individuals.
Excellent written and interpersonal communications skills and self-motivation are essential.
Proven ability to work well individually and in a team environment and to produce high-quality work.
Proficiency with Microsoft Office suite.
Fluency in written and spoken English.
PREFERREDDatabase awareness.
Understanding of traffic classification and prioritization.
Understanding of network security.
Experience building, installing, and configuring a variety of open-source Linux software packages, especially with complex dependencies.
Experience setting up and maintaining a clustered file system such as GPFS or others.
Experience setting up and maintaining scientific computing clusters and their associated scheduling systems, such as SGE or PBS.
Previous experience working in Bioinformatics/NGS environment.#J-18808-Ljbffr
Apply safely
To stay safe in your job search, information on common scams and to get free expert advice, we recommend that you visit SAFERjobs, a non-profit, joint industry and law enforcement organization working to combat job scams.