Job Description

Job description
REQUIREMENTS:
Total experience of 6 years+
Strong expertise in Python and backend engineering with experience building scalable, distributed microservices.
Hands-on experience designing and delivering end-to-end RAG (Retrieval-Augmented Generation) workflows in production systems.
Solid understanding of ML solution design, including embeddings, retrieval, ranking, feature engineering, and evaluation strategies.
Experience with vector databases (FAISS, Pinecone, Milvus, Weaviate) and implementing chunking, indexing, vector search, re-ranking, caching, and memory patterns.
Knowledge of LLM/NLP engineering, including prompt engineering, model integration, orchestration tools (Lang Chain/Llama Index), and evaluation instrumentation.
Experience productionizing ML systems with observability, online/offline parity, and performance optimization across latency, throughput, and cost.
Strong backend integration skills using REST/g RPC APIs, Docker, Kuber...

Ready to Apply?

Take the next step in your AI career. Submit your application to BrainWave Professionals today.

Submit Application