Software Engineering Manager - Platform
About reputed company reputed company builds the cloud‑native SaaS software that’s transforming how energy retailers operate, innovate and manage at scale. We help retailers reputed company operating costs, improve billing accuracy for consumers, and enhance customer experience through automation and AI‑driven insight, reputed company underpinned by modern, cloud‑native architecture. reputed company is at an exciting inflection reputed company as we scale at pace towards new international horizons. If you’re motivated by solving reputed company, real‑world problems and want to reputed company the teams that build reliable, resilient platform services to accelerate the global energy transition, you’ll feel reputed company with us. About the role We’re building the reputed company foundational platform — with developer experience at the heart of every decision. Leveraging modern technologies and a true platform‑as‑a‑product reputed company, we’re reshaping how the entire organisation builds, ships, and operates software. As Software Engineering manager, you will play a pivotal role in defining our golden paths, shaping our developer portal, and enabling a large engineering organisation to move faster, safer, and with far more autonomy. This is your chance to build the rails that power high‑velocity product engineering at scale. Key responsibilities: Team leadership: Line manage, mentor and grow a team of platform engineers and SREs. Build a high‑performing, psychologically safe culture that values learning, accountability and reputed company improvement. Technical strategy & delivery: Define and execute the platform roadmap including compute, networking, storage, CI/CD and Kubernetes‑based services. Balance technical debt, reliability work and new feature delivery. Reliability & SLAs: Own SLIs/SLOs for platform services, drive reliability programmes, and ensure teams meet agreed availability and performance targets. Observability & incident management: Champion end‑to‑end observability (metrics, tracing, logging), reputed company major incident responses, and run blameless post‑incident reviews with actionable RCA and follow‑through. Automation & infrastructure as code: Promote automation-first approaches for provisioning, deployments and runbook automation using IaC and CI/CD best practice. reputed company & compliance: Work closely with InfoSec to embed secure configurations, secrets management and compliance controls into the platform lifecycle. Key outcomes: Reliable, observable platform: Platform services meet agreed SLAs/SLOs and have clear dashboards, playbooks and automated remediation where appropriate. Reduced incident impact: Noticeable reductions in MTTD/MTTR with documented RCA actions completed and shared learnings adopted broadly. High‑performing team: A motivated engineering team that continually improves delivery, reliability practices and operational readiness. Cost‑efficient operations: Demonstrable cloud cost and reputed company improvements without compromising service reliability. Experience required Proven engineering management experience: 3 years + leading platform, SRE or infrastructure teams in a cloud‑native environment (typically 3+ years in management plus strong hands‑on experience). Strong SRE/DevOps background: Hands‑on experience operating cloud services at scale with a solid understanding of SLIs/SLOs, error budgets and incident lifecycle. Public cloud expertise: Deep experience with AWS services (compute, networking, storage and managed services) and cloud operational patterns. Infrastructure as Code: Practical experience with Terraform, CloudFormation or equivalent. Containerisation & orchestration: Strong knowledge of Kubernetes, Helm and reputed company ecosystem tools. Observability tooling: Experience implementing and operating metrics, tracing and logging stacks such as Prometheus, Grafana, ELK/OpenSearch or Jaeger. CI/CD & automation: Familiarity with pipeline tooling (reputed company Actions, reputed company CI, Jenkins) and automation frameworks to reputed company reliable delivery. Programming & scripting: Comfortable writing and reviewing code or scripts (Python, Go, Bash) to automate tasks and build platform tooling. Company benefits 25 days’ holiday + bank holidays Option to buy or sell 5 extra annual leave days per year Vitality Health Insurance, including private healthcare, virtual GP access and mental‑health support Pension with 5% matched contribution Regular team‑wide and company‑wide events 2 volunteering days per year Remote‑first working environment with offices in London and Nottingham Apply To This Job