Job Description
We are looking for a Lead Site Reliability Engineer to drive the stability, performance, and security of our infrastructure—spanning internal developer tools like GitLab to high‑traffic, client‑facing platforms such as Client websites. In this role, you’ll lead reliability efforts, guide a growing SRE team, and help shape the systems that power our digital operations. If you're passionate about building scalable systems, mentoring engineers, and making critical infrastructure decisions, we’d love to connect with you.
RESPONSIBILITIES
- Help design, build, and maintain systems that stay fast, secure, and reliable at scale
- Monitor servers, tools, and infrastructure daily to make sure everything is running properly
- Respond to system issues or outages and help prevent them from happening again
- Install and configure new servers or rebuild existing ones when needed
- Review security risks and recommend improvements to keep sy...
Ready to Apply?
Take the next step in your AI career. Submit your application to Taocrowd today.
Submit Application