Lead Generative AI Research Engineer (AI Labs)

Full time at Krutrim in India

Posted on February 7, 2025

Job details

Lead Generative AI Engineer / Scientist - Large-Scale AI Models Location: Bangalore (India), Singapore and Palo Alto (CA, US) Type of Job: Full-time About Krutrim: is building AI computing for the future. Our envisioned AI computing stack encompasses the AI computing infrastructure, AI Cloud, multilingual and multimodal foundational models, and AI-powered end applications. We are India’s first AI unicorn and built the first foundation model from the country. Our AI stack is empowering consumers, startups, enterprises and scientists across India and the world to build their end AI applications or AI models. While we are building foundational models across text, voice, and vision relevant to our focus markets, we are also developing AI training and inference platforms that enable AI research and development across industry domains. The platforms being built by Krutrim have the potential to impact millions of lives in India, across income and education strata, and across languages. The team at Krutrim represents a convergence of talent across AI research, Applied AI, Cloud Engineering, and semiconductor design. Our teams operate from three locations: Bangalore, Singapore & San Francisco. Job Description: We are looking for an experienced Lead Generative AI Engineer to train, optimize, scale, and deploy a variety of generative AI models such as large language models, voice/speech foundation models, vision and multi-modal foundation models using cutting-edge techniques and frameworks. In this hands-on role, you will architect and implement state of art neural architecture, robust training and inference infrastructure to efficiently take complex models with billions of parameters to production while optimizing for low latency, high throughput, and cost efficiency. Key Responsibilities:

Architect and refine foundation model infrastructure to support the deployment of optimized AI models with a focus on C/C++, CUDA, and kernel-level programming enhancements.
Implement state-of-the-art optimization techniques, including quantization, distillation, sparsity, streaming, and caching, for model performance enhancements.
Spearhead the development of Vision pipelines, ensuring scalable training and inference workflows of 10s and 100s of billions of parameter foundation models.
Should be able to innovate for the state-of-the-art architectures involving Panoptic Segmentation, Image Classification and Image Generation. It is expected that the candidate experiments with the internals of Vision Transformers and convolutional Models like ConvNext, CLIP, Visual Question Answering (VQA) and Diffusion Models. Practice around AI Arts, Image Prompts, Conditional Image Generation will be an additional advantage.
Design, develop, and innovate state-of-the-art in large multimodal models.
Make architectural choices across dense / Mixture-of-experts, early fusion / deep fusion, choice of modality encoders (VQ-GAN, ViT, CLIP/SigLIP), decoders (Stable diffusion, Stable cascade, AudioLDM).
Proven track record of developing and applying novel neural network architectures such as Mixture of Experts, Diffusion Models, and State Space Machines (MAMBA, SAMBA)
Execute training and inference processes with a key emphasis on minimizing latency and maximizing throughput, utilizing GPU clusters and custom hardware.
Innovate on current model deployment platforms, employing AWS, GCP, and GPU clusters, to enable high scalability and responsiveness.
Integrate and tailor frameworks such as PyTorch, TensorFlow, DeepSpeed, Lightening, FSDP, and Habana for the advancement of super-fast model training and inference.
Advance the deployment infrastructure with MLOps frameworks such as KubeFlow, MosaicML, Anyscale, Terraform, ensuring robust development and deployment cycles.
Enhance post-deployment mechanisms with exhaustive testing, real-time monitoring, and sophisticated explainability and robustness checks.
Drive continuous improvement initiatives for deployed models with automated pipelines for drift detection and performance degradation.
Lead the charge in model management, encompassing version control, reproducibility, and lineage tracking.
Cultivate a culture of high-performance computing and optimization within the AI/ML domain, propagating best practices and knowledge sharing.

Qualifications:

Ph.D. with 5+ years or MS with 8+ years of experience in ML Engineering, Data Science, or related fields.
Demonstrated expertise in high-performance computing with proficiency in Python, C/C++, CUDA, and kernel-level programming for AI applications.
Extensive experience in the optimization of training and inference for large-scale AI models, including practical knowledge of quantization, distillation, and Vision Pipelines.
It will be of additional benefit if the Candidate understands Diffusion Models (DDPM), Variational Autoencoders, Bayesian Modelling, Stochastic Variational Inference (SVI) and Reinforcement Learning.
Experience in building 10s and 100s of billions of parameters generative AI foundation models
AI training job scheduling, orchestration, and management via SLURM and Kubeflow.
Proven success in deploying optimized ML systems on a large scale, utilizing cloud infrastructures and GPU resources.
In-depth understanding and hands-on experience with advanced model optimization frameworks such as DeepSpeed, FSDP, PyTorch, TensorFlow, and corresponding MLOps tools.
Familiarity with contemporary MLOps frameworks like MosaicML, Anyscale, Terraform, and their application in production environments.
Strong grasp of state-of-the-art ML infrastructures, deployment strategies, and optimization methodologies.
An innovative problem-solver with strategic acumen and a collaborative mindset.
Exceptional communication and team collaboration skills, with an ability to lead and inspire.

Apply safely

To stay safe in your job search, information on common scams and to get free expert advice, we recommend that you visit SAFERjobs, a non-profit, joint industry and law enforcement organization working to combat job scams.

See All Lead Jobs

Lead Generative AI Research Engineer (AI Labs)

Job details

Apply safely

Hiring company

Krutrim

Jobs

Courses

Location

Follow us

Home India Lead Generative AI Research Engineer (AI Labs)

Home India Lead Generative AI Research Engineer (AI Labs)

Lead Generative AI Research Engineer (AI Labs)

Job details

Apply safely

Hiring company

Krutrim

Why are you reporting this job?

Laimoon Job Alert fresh jobs directly from websites*

Jobs

Courses

Location

Follow us