Job Description
The Sr. SRE will be responsible for the reliability, scalability, and performance of systems supporting classified government projects in an air-gapped deployment. This role leverages advanced monitoring and DevOps tools to ensure uptime and compliance in a disconnected environment.
Key Responsibilities
- Design and maintain highly reliable systems using RKE2, Kubernetes, Ingress, Kong, Artifactory, and Sonar.
- Implement observability solutions with Prometheus, Grafana, Splunk, and Elastic to monitor system health in an air-gapped setting.
- Ensure compliance and performance optimization across multi-tenant deployments.
- Conduct code quality analysis and security assessments using Sonar.
- Collaborate with the Lead and Infra/Security Specialists to resolve incidents and improve system resilience.
- Develop and maintain documentation for system configurations and recovery procedures in a classified environment. <...
Ready to Apply?
Take the next step in your AI career. Submit your application to Orion Innovation today.
Submit Application