Job Description

The Sr. SRE will be responsible for the reliability, scalability, and performance of systems supporting classified government projects in an air-gapped deployment. This role leverages advanced monitoring and DevOps tools to ensure uptime and compliance in a disconnected environment.

Key Responsibilities

  • Design and maintain highly reliable systems using RKE2, Kubernetes, Ingress, Kong, Artifactory, and Sonar.
  • Implement observability solutions with Prometheus, Grafana, Splunk, and Elastic to monitor system health in an air-gapped setting.
  • Ensure compliance and performance optimization across multi-tenant deployments.
  • Conduct code quality analysis and security assessments using Sonar.
  • Collaborate with the Lead and Infra/Security Specialists to resolve incidents and improve system resilience.
  • Develop and maintain documentation for system configurations and recovery procedures in a classified environment. <...

Ready to Apply?

Take the next step in your AI career. Submit your application to Orion Innovation today.

Submit Application