Job Description

We are looking for a highly skilled Site Reliability Developer 3 (SRD3) to join our Observability team, who is responsible for designing, building, and operating large-scale infrastructure monitoring and observability platforms across the globe. Solve complex problems related to infrastructure on premise, cloud services and build automation to prevent problem recurrence. Facilitate service capacity planning, solution performance analysis, system tuning and remediation.

  • Design, develop, and operate large-scale observability and infrastructure monitoring platforms across network, compute, storage, and virtualization layers.
  • Build and maintain monitoring solutions using OpenNMS, OpenSearch/Elastic, Logstash, Kafka, Grafana, and related tools.
  • Develop and maintain automation frameworks using Python, Ansible, Chef, and scripting on Unix/Linux platforms.
  • Design and implement CI/CD pipelines using Jenkins, GitHub, and container-based workflows.
  • A...
  • Ready to Apply?

    Take the next step in your AI career. Submit your application to Oracle today.

    Submit Application