You are viewing a preview of this job. Log in or register to view more details about this job.

Generative AI Engineer

Role Overview 

Build and deploy generative AI solutions using LLM fine-tuning, RAG architectures, and agentic AI systems.  

Responsibilities 

  • Fine-tune large language models for domain-specific applications and use cases 
  • Design and implement RAG (Retrieval Augmented Generation) pipelines with vector databases 
  • Build agentic AI systems with autonomous decision-making and tool-using capabilities 
  • Optimize LLM performance through prompt engineering, parameter tuning, and model selection 
  • Develop and deploy production-ready GenAI applications and APIs 
  • Evaluate model performance, conduct A/B testing, and iterate on solutions 
  • Collaborate with cross-functional teams to integrate GenAI into products 

Requirements 

  • Bachelor's degree in Computer Science, Engineering, AI/ML, or related field 
  • Strong experience with LLM fine-tuning (LoRA, QLoRA, full fine-tuning, PEFT methods) 
  • Proven expertise building RAG systems with vector databases and embeddings 
  • Hands-on experience developing agentic AI solutions and multi-agent systems 
  • Proficiency with LLM frameworks (LangChain, LlamaIndex, Haystack) 
  • Experience with vector databases (Pinecone, Weaviate, ChromaDB, FAISS) 
  • Strong Python programming skills 
  • Understanding of transformer architectures and LLM APIs 

Preferred 

  • Experience with distributed training and GPU optimization 
  • Knowledge of MLOps, model deployment, and monitoring 
  • Familiarity with Hugging Face ecosystem 
  • Experience with prompt optimization and evaluation frameworks 
  • Understanding of LLM safety, alignment, and guardrails 
  • Background in NLP or deep learning research