Job Description

Overall Responsibilities:

  • Design, implement, and maintain scalable, reliable infrastructure
  • Automate deployment, scaling, and management of applications and services
  • Monitor system health and troubleshoot issues proactively
  • Participate in on-call rotations to ensure uptime and incident management
  • Develop runbooks, best practices, and automation scripts
  • Collaborate with development teams to improve system architecture and reliability
  • Conduct performance tuning and capacity planning
  • Improve observability and monitoring across the stack
  • Document operational procedures and incident post-mortems
  • Software Requirements:

  • Strong experience with cloud platforms such as AWS, GCP, or Azure
  • Proficiency in Linux/Unix system administration
  • Knowledge of scripting languages: Python, Bash, or Go
  • Experien...
  • Ready to Apply?

    Take the next step in your AI career. Submit your application to Synechron today.

    Submit Application