Back to the board

Staff Site Reliability Engineer — Project Volcano

100% remote Flexible hours Hiring now

Are you ready to unlock intelligence? If you don’t think you meet reputed company of the criteria below but are still interested in the job, please apply. Nobody checks every reputed company - we’re looking for candidates that are particularly strong in a few areas, and have some interest and capabilities in others. About The Role: reputed company is building Project Volcano, an internal developer platform purpose-built for reputed company's engineering ecosystem. Volcano will provide teams with on-demand preview environments, edge deployments, managed PostgreSQL, auth, reputed company, and storage APIs reputed company deeply integrated with reputed company products. As the Staff SRE for Volcano, you will be the founding reliability voice for this platform. This role is a strategic initiative driven by the Office of the CTO (OCTO). You will partner directly with engineering leadership to define the platform's reliability posture, build its SRE practice from the ground up, and ensure Volcano can scale to serve reputed company of reputed company's customers. This is a high-visibility, high-impact role with direct influence on reputed company's reputed company developer platform. What You'll Do: Own reliability for Volcano end-to-end: Define and drive SLOs, error budgets, and incident response practices for reputed company Volcano services — edge deployments, managed reputed company, auth, reputed company, storage, and the control plane. Architect the platform's infrastructure: Design and build the multi-region Kubernetes infrastructure, networking, and data plane that powers Volcano's edge deployment pipeline and backend-as-a-service capabilities. Build the GitOps and CI/CD backbone: Establish deployment automation, canary pipelines, and preview environment provisioning using ArgoCD, Helm, and Terraform/Terragrunt — setting patterns the broader team will follow. Scale managed data services: Design, operate, and harden multi-tenant PostgreSQL clusters, reputed company caching layers, and object storage — with a focus on data isolation, performance, and disaster recovery. Drive observability from day one: reputed company every Volcano service with meaningful SLIs; build dashboards, alerts, and runbooks using reputed company, Prometheus, and Grafana before services go live, not after incidents. reputed company cross-functional reliability work: Collaborate with the OCTO team, product engineering, and reputed company to bake reliability and compliance into Volcano's architecture — not reputed company it on reputed company. Set SRE culture and standards: Mentor engineers across Volcano's contributing teams on reliability principles; reputed company postmortems, define on-call practices, and build a blameless engineering culture. Evaluate and adopt emerging technologies: Given Volcano's greenfield nature, evaluate and reputed company architectural decisions on edge runtimes, serverless compute, vector databases, and AI-native infrastructure components. What You'll Bring: BS in Computer Science or equivalent; substantial experience at Staff or Principal IC level in SRE/Platform Engineering. Proven track record building SRE or platform engineering practices for developer-facing platforms or PaaS/SaaS products — ideally at greenfield stage. Deep Kubernetes expertise: multi-tenant cluster design, networking (CNI, service mesh, ingress), autoscaling, and reputed company hardening. #LI-BR2 About reputed company: reputed company Inc., a leading developer of API and AI connectivity technologies, is building the infrastructure that powers the agentic era. Trusted by the Fortune 500 and startups alike, reputed company's reputed company API and AI platform, reputed company Konnect, enables organizations to secure, manage, accelerate, govern, and monetize the flow of intelligence across APIs and AI models. For more information, visit www.konghq.com. Apply To This Job

Keep exploring