Gen AI Architect
Job details
1. About the Job - The candidate should require mandatory Gen AI Architect Experience. 2. Job Title – Gen AI Architect 3. Location – Noida, Mumbai, Pune, Bangalore, Hyderabad, Chennai 4. Educational Background - UG. – B. Tech /B. E in any specialization PG. – MCA in Computers 5. Key Responsibilities - Architectural Design:
- Collaborate with stakeholders to understand business requirements and translate them into architectural blueprints.
- Design scalable, secure, and high-performance architecture for the Autogen-based LLM-Integrated application.
- Define data models and schemas for integrating operational data from relational databases into the application.
- Lead the implementation efforts, ensuring adherence to architectural guidelines and best practices.
- Develop robust APIs and interfaces for seamless communication between the application and relational databases.
- Write efficient and maintainable code, following coding standards and version control processes.
- Integrate operational data from various relational databases into the application, ensuring data consistency and integrity.
- Conduct thorough testing, including unit testing, integration testing, and performance testing, to validate the functionality and scalability of the application.
- Troubleshoot and debug issues as they arise during the integration and testing phases.
- Identify performance bottlenecks and optimization opportunities within the application architecture.
- Implement performance tuning strategies to improve the speed, reliability, and efficiency of data retrieval and processing.
- Continuously monitor system performance and proactively address any degradation or inefficiencies.
- Create comprehensive technical documentation, including architecture diagrams, API specifications, and deployment procedures.
- Conduct knowledge sharing sessions to disseminate architectural knowledge and best practices among team members.
- Provide guidance and mentorship to junior team members, fostering their professional growth and development.
- 10-15 years of overall technology experience in core application development
- Experience leading development of AI application using Python backend frameworks and multiple inferencing pipelines
- Rapid PoC/Prototyping skills and expertise in building and demoing application without need a developer’s assistance.
- Hands-on expertise of SharePoint indexes and data/file structures (Azure SQL)
- Good knowledge of Azure Form Recognizer for OCR of complex images, forms and other data
- Handson with implementing Task Weaver, Autogen, Agentic Flows, Retrieval Augmented Generation (RAG) and RLHF (Reinforcement Learning from Human Feedback)
- Designing and implementing vector databases on Azure cloud using Ai Search and Cosmos DB vCore
- Sound project implementation level knowledge of Pinecone, FAISS, Weaviate or Chroma DB
- Knowledge of NLP techniques like transformer networks, embeddings, intent recognition etc.
- Hands-on skills on Embedding and finetuning Azure OpenAI using MLOPS/LLMOPS pipelines.
- Strong communication, architectural sketching, and collaboration skills
Apply safely
To stay safe in your job search, information on common scams and to get free expert advice, we recommend that you visit SAFERjobs, a non-profit, joint industry and law enforcement organization working to combat job scams.