Job Description
Design, develop, troubleshoot and Key Responsibilities
- Architect and maintain failure-simulation frameworks that span compute, storage, networking, load balancers, and application endpoints, including chaos and controlled-fault injection.
- Design and automate load, scale, and performance test suites; capture, correlate, and visualize metrics to quantify blast radius and resilience.
- Investigate and triage failure events using advanced tracing (ASH, AWR, SQLNET, SQL tracing), log analytics, and telemetry; deliver root-cause analysis with actionable remediation guidance.
- Collaborate with product and SRE teams to embed resiliency best practices, certify OS/database stacks for OCI, and shepherd new components from development into production-quality readiness.
- Evolve test methodologies, frameworks, and automation pipelines to meet emerging product architectures across databases, middleware, applications, and cloud services.
-...
Ready to Apply?
Take the next step in your AI career. Submit your application to Oracle today.
Submit Application