Job Description

What you’ll be doing

  • Provide SRE ownership for the Global Fabric NaaS service, ensuring availability, performance, and resilience.

  • Support safe, automated change into production using CI/CD, GitOps, and automated testing.

  • Operate and improve monitoring and observability using Dynatrace, Prometheus, and Elasticsearch.

  • Troubleshoot incidents across Kubernetes-hosted applications, Linux systems, networking, and service integrations.

  • Act as a third-line escalation point, participating in a 24x7 on-call rota.

  • Manage incidents via ServiceNow and track defects and improvements in Jira.

  • Contribute to Scrum ceremonies and PI planning, supporting Agile delivery.

  • Drive automation using Ansible and scripting to reduce operational toil.

  • Mentor and support L2 engineers, improving runbooks, troubleshooting practices, an...
  • Ready to Apply?

    Take the next step in your AI career. Submit your application to BT Group today.

    Submit Application