Job Description

Job Description

  • Design, operate, and optimize AWS infrastructure in a hybrid cloud environment.
  • Improve performance, reliability, and cost efficiency through proactive optimization and capacity planning.
  • Perform lifecycle management, scalability improvements, and infrastructure modernization initiatives.
  • Act as a senior escalation point for complex infrastructure issues.


Systems Reliability & Operations

  • Participate in on-call rotation and lead incident response efforts.
  • Own monitoring and alerting using tools such as CloudWatch and related observability platforms.
  • Drive root cause analysis for recurring issues and implement long-term reliability fixes.
  • Reduce operational effort through automation and proactive improvements.


Ticket & Service Management

  • Monitor, assign, prioritize, and resolve tickets using ITSM tools such as ServiceNow, Jira, or similar ...

Ready to Apply?

Take the next step in your AI career. Submit your application to BETSOL today.

Submit Application