Job Description

At AIA we’ve started an exciting movement to create a healthier, more sustainable future for everyone.

If you believe in developing a better tomorrow, read on. 

About the Role

System Reliability Engineer (SRE) is responsible to ensure our cloud application systems are reliable and available to users. The SRE will supervise application systems and establish automated detections, root cause analysis, and formulate preventive actions. They will gather and analyze metrics from operating systems as well as applications to assist in performance tuning and fault finding. They will partner with development teams to improve services.

Functional Duties:

  • Set up and maintain monitoring of infrastructure and application

  • Build alerts and auto recovery for various operational issues

  • Capture and analyze metrics from operating systems as well as applications

  • Advise in performance tuning and fault find...

  • Ready to Apply?

    Take the next step in your AI career. Submit your application to AIA today.

    Submit Application