Job Description

Job Description

  • Design, operate, and optimize AWS infrastructure in a hybrid cloud environment.
  • Improve performance, reliability, and cost efficiency through proactive optimization and capacity planning.
  • Perform lifecycle management, scalability improvements, and infrastructure modernization initiatives.
  • Act as a senior escalation point for complex infrastructure issues.


Systems Reliability & Operations

  • Participate in on-call rotation and lead incident response efforts.
  • Own monitoring and alerting using tools such as CloudWatch and related observability platforms.
  • Drive root cause analysis for recurring issues and implement long-term reliability fixes.
  • Reduce operational effort through automation and proactive improvements.


Ticket & Service Management

  • Monitor, assign, prioritize, and resolve tickets using ITSM tools such as ServiceNow, Jira, or...

Ready to Apply?

Take the next step in your AI career. Submit your application to BETSOL today.

Submit Application