Job Description
Role Overview
Lead the DevOps and infrastructure team as both a technical leader and hands-on individual contributor, managing the company's growing cloud and on-premise resources with exceptional reliability and performance. You'll be responsible for maintaining 99% uptime for our high-throughput AdTech platform while optimizing costs and building a world-class infrastructure team.
Key Responsibilities
- Maintain 99% uptime and meet SLAs across all environments while reducing infrastructure costs by 20-30%
- Design and implement deployment architecture for high-throughput systems (25,000-30,000 QPS, sub-100ms latency)
- Manage multi-cloud infrastructure (AWS, DigitalOcean, GCP) using Infrastructure as Code
- Build CI/CD pipelines, monitoring systems, and automation for distributed microservices
- Troubleshoot production issues including Kafka lag, RabbitMQ failures, Nodejs, Python and Java application performance
- Lead incident re...
Ready to Apply?
Take the next step in your AI career. Submit your application to Confidential today.
Submit Application