Sr. AI Ops Engineer
Description
The Sr. AI Ops Engineer manages the deployment, monitoring, and maintenance of AI models and systems. This role involves ensuring the reliability, scalability, and performance of AI systems, collaborating with cross-functional teams to optimize AI operations, and troubleshooting issues as they arise. Responsibilities and Duties Deploy, monitor, and maintain AI models and systems to ensure optimal performance and reliability. Implement and manage CI/CD pipelines for the continuous integration and delivery of AI models. Collaborate with data scientists, AI engineers, and other stakeholders to understand model requirements and ensure successful deployment. Monitor the performance of AI models and systems, identifying and resolving issues promptly. Develop and maintain automated monitoring and alerting systems to ensure the health and performance of AI systems. Optimize AI models and infrastructure for scalability and efficiency. Ensure compliance with data governance, security, and regulatory standards in AI operations. Document deployment procedures, monitoring processes, and maintenance activities. Stay updated with the latest advancements in AI operations and infrastructure technologies. Provide technical support and guidance to AI Ops engineers and other team members. Participate in project planning and contribute to the development of project timelines and deliverables. Perform other duties relevant to the job as assigned by the Principal AI Ops Engineer or senior management. Requirements Bachelor’s degree in Computer Science, Information Technology, or a related field. Relevant certifications (e.g., AWS Certified Dev Ops Engineer, Google Cloud Professional Dev Ops Engineer) are preferred. Minimum of 5 years of experience in AI operations, Dev Ops, or related fields. Experience in managing the deployment and maintenance of AI models. Strong programming skills in languages such as Python and Java. Proficiency in AI and machine learning frameworks (e.g., Tensor Flow, Py Torch). Experience with CI/CD tools (e.g., Jenkins, Git Lab CI). Excellent problem-solving and troubleshooting skills. Strong communication and interpersonal skills. In-depth knowledge of AI operations and infrastructure management. Familiarity with cloud platforms (e.g., AWS, Azure, Google Cloud) and their AI services. Understanding of data governance, security, and regulatory standards. Ability to manage multiple tasks and prioritize effectively. Strong attention to detail and commitment to delivering high-quality work. Ability to work independently and as part of a team.#J-18808-Ljbffr
Posted: 7th July 2025 12 pm
Application Deadline: N/A
Similar Jobs
Explore more opportunities like this