Operations Engineer (Linux)

دوام كامل في Asterix Communications في Saudi Arabia
نُشرت يوم September 22, 2024

تفاصيل الوظيفة

Job Overview: We are looking for a skilled Operations Engineer (Linux) to join our client in Riyadh, KSA, in a full-time capacity. The ideal candidate will manage and optimize Linux-based infrastructure, utilize containerization technologies, and support various middleware systems. Proficiency in Linux commands and shell scripting, along with experience in Docker, Kubernetes, access control systems, and monitoring tools, is essential.

Key Responsibilities:

  1. Linux System Management: Perform routine maintenance, upgrades, and troubleshooting for Linux systems. Use Linux commands and write shell scripts to automate tasks and improve operational efficiency.
  2. Containerization: Manage and orchestrate containerized applications using Docker and Kubernetes, applying basic commands for effective container management and orchestration.
  3. Middleware Support: Maintain and support middleware systems such as MySQL and Elasticsearch, ensuring optimal performance and reliability.
  4. Access Control Systems: Configure and manage access control systems to ensure secure and controlled resource access.
  5. Cluster Management: Manage Slurm clusters, Lustre storage, and Ceph storage, where applicable. Understand multi-node, multi-GPU scheduling in Slurm clusters and effectively utilize Ceph storage.
  6. Monitoring and Troubleshooting: Use monitoring tools like Grafana and Zabbix to analyze logs, review monitoring data, and troubleshoot performance and reliability issues.
  7. Documentation: Maintain accurate documentation of system configurations, processes, and procedures, ensuring all documentation is up-to-date and accessible.
  8. Collaboration: Work closely with cross-functional teams to ensure smooth product delivery and integration. Collaborate with team members to resolve technical issues and enhance system performance.

Required Qualifications:

  1. Proficiency in common Linux commands and shell scripting.
  2. Familiarity with Docker and Kubernetes commands.
  3. Experience with access control systems.
  4. Experience with middleware such as MySQL and Elasticsearch.

Preferred Qualifications:

  1. Experience in managing Slurm clusters, Lustre storage, and Ceph storage.
  2. Understanding of multi-node, multi-GPU scheduling in Slurm clusters.
  3. Knowledge of Ceph storage utilization.
  4. Experience with monitoring tools like Grafana and Zabbix.
  5. Ability to analyze logs, review monitoring data, and troubleshoot issues.

Language Requirement:

Proficiency in Mandarin is mandatory. Any nationality is acceptable.

Application Instructions:

If you meet the qualifications and are interested in this role, please submit your CV for review. Be sure to highlight relevant experience and skills as outlined in the job description. Shortlisted candidates will be contacted directly for further discussion. #J-18808-Ljbffr

Apply safely

To stay safe in your job search, information on common scams and to get free expert advice, we recommend that you visit SAFERjobs, a non-profit, joint industry and law enforcement organization working to combat job scams.

Share this job
تحسين فرصتك لحصول على وظيفة خذ دورة عبر الإنترنت على الهندسة ابتداءً من الآن. تطلب ترويج10 دولار للدورات عبر الإنترنت. انظر جميع الدورات
See All Operations Jobs
تعليقات وملاحظات تعليقات وملاحظات