Job Description
Key Responsibilities
- Design, build, and operate a geo-distributed, highly available infrastructure to ensure maximum performance and reliability of our services.
- Participate in the full lifecycle of infrastructure and platform projects: from design and capacity planning to implementation and long-term operation.
- Improve reliability and operability practices: fault tolerance, scalability, incident response, postmortems, and bottleneck analysis.
- Automate build, test, and delivery pipelines (CI/CD) and infrastructure provisioning.
- Take part in on‑call rotation and contribute to continuous improvement of operational processes.
- Develop internal tools for service and infrastructure management.
- Maintain and improve network connectivity between multiple data centers and cloud environments (multi‑cloud and on‑prem).
Nice to have
- Experience with Terragrunt.
- Understanding of BGP principl...
Ready to Apply?
Take the next step in your AI career. Submit your application to inDriver today.
Submit Application