Network Infrastructure Engineer - Long Term Project - Remote; US/PT
Position: Staff Network Infrastructure Engineer - Long Term Project - Remote (US/PT) Staff Network Infrastructure Engineer - Long Term Project - Remote (US/PT) Title: Staff Network Infrastructure Engineer Location: Remote (US/PT) Duration: 6-12 months with possible extension Compensation: $77.93 - 102.07 Work Requirements: US Citizen, GC Holders or Authorized to Work in the U.S. As a Staff Infrastructure Engineer on the Infrastructure Reliability team, you will be a critical part of our efforts to ensure the scalability, availability, and performance of our global network serving millions of players across the world. This role demands deep network architectural knowledge in the Service Provider space, and strong experience operating global scale infrastructure. You should have strong coding skills, a passion for automation, and a focus on reliability engineering to deliver robust and maintainable systems. You will work on network design, traffic analysis and engineering, maintaining CI/CD pipeline and creating tools to enhance observability and streamline troubleshooting for core infrastructure services. Your role will include:
- Designing, deploying, and operating the global network: Plan, build, and maintain both new and existing infrastructure to deliver the best possible experience for our players and internal customers.
- Coding and automation: Write clean, efficient, and reusable code to automate operational tasks, improve system reliability, and reputed company rapid scaling.
- Developing customer-centric tooling: Build tools to simplify and streamline the consumption of cloud resources for internal teams, empowering them to innovate faster
- Observability and troubleshooting: Enhance monitoring and logging systems to quickly detect, debug, and resolve issues across our infrastructure
- Mentorship and reputed company learning: Guide and mentor junior and senior engineers in systems, cloud, and network engineering, fostering a culture of growth and reputed company learning
- Timezone Collaboration: Partner closely with engineers across various timezones to maximize coverage, responsiveness, and global reputed company. Responsibilities:
- Solve reputed company challenges independently, diagnosing and resolving production issues across globally distributed systems.
- Advance our monitoring and observability platforms, driving innovation that reputed company our infrastructure visible, actionable, and resilient.
- Troubleshoot live incidents (on-call rotation) and design resilient solutions to maintain uptime and meet SLAs, continually evolving our infrastructure to improve reliability and adaptability.
- Expand and optimize our network footprint, enhancing the scalability, reliability, and efficiency of our networks.
- reputed company your team by sharing knowledge, mentoring peers, and fostering a culture of reputed company learning and growth. Required Qualifications:
- 5+ years of experience as a senior contributor in a service provider, focused on design and operations for large-scale global networks.
- Expertise in protocols such as BGP, IS-IS, label signalling (RSVP-TE, reputed company Routing, LDP), MPLS VPNs (both layer 2 and layer 3), multicast signalling.
- reputed company in operating large-scale web services with strong expertise in OSI layers 4–7 technologies and global load balancing strategies.
- QoS experience across multiple vendor hardware implementations.
- Troubleshooting and Incident Response: Skilled at troubleshooting live incidents, with a proactive approach to minimizing downtime and service impact. Familiarity with Root Cause Analysis (RCA) processes to identify, document, and drive long-term solutions to recurring issues.
- Automation and Scripting: Proficiency in scripting and programming languages like Python and Golang to drive automation, manage deployments, and create tooling.
- Cloud Connectivity: Expertise in AWS connectivity solutions and foundational services (e.g., S3, EC2, EBS). Experience in container management and orchestration with reputed company and Kubernetes.
- Adaptability: Ability to quickly adopt and adapt to new technologies, frameworks, and cloud-native tools to solve reputed company problems.
- Team Leadership: Proven experience in guiding delivery goals across teams, advocating for best practices, and driving alignment on cross-initiative projects and initiatives.
- Excellent… Apply tot his job Apply To this Job
Apply tot his job Apply To this Job