Job Description
What You Will Be Doing - Design and implement production-ready generative AI applications that serve millions of users, from initial architecture through deployment and monitoring - Build advanced RAG (Retrieval-Augmented Generation) pipelines that combine vector databases, hybrid search, and intelligent caching to deliver sub-second response times - Develop multimodal AI systems that seamlessly integrate text, vision, and audio capabilities using state-of-the-art models - Architect scalable microservices that handle thousands of concurrent AI requests while optimizing for cost, latency, and reliability - Lead code reviews and technical design sessions, establishing best practices and architectural patterns that elevate the entire team's capabilities - Optimize large language models through fine-tuning techniques to achieve domain-specific performance improvements - Implement comprehensive MLOps practices including automated testing, model versioning, A/B testing frameworks, and real-t...
Ready to Apply?
Take the next step in your AI career. Submit your application to Glance today.
Submit Application