Job Description

Job Description
- Design, operate, and optimize AWS infrastructure in a hybrid cloud environment.
- Improve performance, reliability, and cost efficiency through proactive optimization and capacity planning.
- Perform lifecycle management, scalability improvements, and infrastructure modernization initiatives.
- Act as a senior escalation point for complex infrastructure issues.
Systems Reliability & Operations
- Participate in on-call rotation and lead incident response efforts.
- Own monitoring and alerting using tools such as Cloud Watch and related observability platforms.
- Drive root cause analysis for recurring issues and implement long-term reliability fixes.
- Reduce operational effort through automation and proactive improvements.
Ticket & Service Management
- Monitor, assign, prioritize, and resolve tickets using ITSM tools such as Service Now, Jira, or similar platforms.
- Adhere to SLA, ticket quality standards, documentation requirement...

Ready to Apply?

Take the next step in your AI career. Submit your application to BETSOL today.

Submit Application