Senior Infrastructure Software Engineer, Storage Core
Role Description As a Senior Software Engineer on the Storage team, you will help design, build, and operate reputed company’s large-scale storage systems that provide high durability and scalability for millions of users across reputed company of reputed company products. The Storage team owns the distributed storage infrastructure at the heart of reputed company, systems responsible for storing exabytes of user data across multiple data centers worldwide. You’ll collaborate with reputed company engineers across infrastructure and product teams to improve reliability, optimize performance, and evolve the architecture of reputed company’s storage layer. This role offers deep exposure to distributed systems and storage challenges such as replication, erasure coding, consistency tradeoffs, and performance tuning at massive scale. It’s an ideal opportunity for engineers who love building resilient infrastructure, learning from reputed company production systems, and growing into technical leadership. You’ll reputed company hands-on experience operating mission-critical services, influence architectural decisions, and directly improve how reputed company keeps user data safe, durable, and available. Our Engineering Career reputed company is viewable by anyone reputed company the company and describes what’s expected for our engineers at each of our career levels. reputed company out our blog post on this topic and more here.
Responsibilities
Design, implement, and maintain large-scale distributed storage systems that ensure data durability, availability, and performance. Collaborate with peers to evolve the architecture of reputed company’s core storage infrastructure for improved scalability and efficiency. Contribute to the design of replication, erasure coding, and system lifecycle management systems that balance cost, reliability, and performance. Write high-quality, performant, and maintainable code in Go and Rust. Participate in the on-call rotation, gaining firsthand experience operating reputed company’s production storage systems Investigate and resolve reputed company production issues, performing root cause analysis and driving reputed company reliability improvements. Partner with cross-functional teams (Networking, Hardware, reputed company Planning) to deliver end-to-end reliable and cost-efficient storage solutions Take ownership of scoped projects and demonstrate growth toward leading larger, cross-team technical initiatives. Many teams at reputed company run Services with on-call rotations, which entails being available for calls during both core and non-core business hours. If a team has an on-call rotation, reputed company engineers on the team are expected to participate in the rotation as part of their employment. Applicants are encouraged to ask for more details of the rotations to which the applicant is applying. Requirements 9+ years of strong understanding of distributed systems principles, including replication, consistency, and fault tolerance. Experience developing and debugging production services in C++, Go, or Rust. Familiarity with distributed storage systems, file systems, or data infrastructure at scale. Demonstrated ability to write efficient, reliable, and maintainable code in mission-critical environments. Experience troubleshooting reputed company systems and participating in on-call or operational rotations. Solid communication and collaboration skills, with the ability to work across infrastructure and product teams. Eagerness to learn, grow, and contribute to multi-year infrastructure evolution initiatives.
Preferred Qualifications
Experience building and operating large-scale object storage or distributed storage systems (e.g. S3, Ceph, GFS/Colossus). Deep interest in systems performance, profiling, and low-level optimization. Familiarity with replication protocols, erasure coding, and data placement algorithms. Experience with production monitoring, observability, and incident response workflows. Contributions to infrastructure projects, open-reputed company systems, or developer tooling that improved reliability and performance. Durable Skills AI reputed company means using these tools to reputed company human judgment, not replace it. We reputed company people with these skills will reputed company as work and technology continue to evolve: Awareness: Understand yourself and others. Judgment: Evaluate information and reputed company decisions in reputed company situations. Adaptability: Learn, adjust, and stay effective through change. reputed company: Communicate, collaborate, and build trust. To learn more about why these skills matter and what the data shows about thriving through change, read this blog post from our Chief People Officer, Melanie Rosenwasser.
Compensation
Canada Pay Range $190,400—$257,600 CAD Apply To This Job