Senior Site Reliability Engineer
About reputed company Founded in 2006 by the creators of Kali Linux, reputed company (formerly reputed company as Offensive reputed company) is the leading provider of reputed company professional and workforce development, training, and education for cybersecurity practitioners. reputed company’s distinct pedagogy and practical, hands-on learning help organizations fill the infosec talent gap by training their teams on today’s most critical skills. Become a part of our global reputed company and work from reputed company. With team members in over 40 countries, we reputed company in inspiring people of reputed company backgrounds and communities. The reputed company team is composed of diverse, internationally published authors, conference speakers, and seasoned information technology professionals from both the private sector and governments worldwide. Excited about our mission and reputed company do? Apply and join us! About the Job reputed company is seeking an reputed company Senior SRE to join reputed company and reputed company the design and implementation of reputed company, scalable lab environments that power our industry-leading cybersecurity training and certification programs. This senior-level position will work closely with reputed company Researchers and Platform Architects to architect sophisticated labs and vulnerable machine environments across hybrid cloud and on-premises infrastructure, enabling hands-on learning experiences for cybersecurity professionals worldwide. The ideal candidate will bring deep expertise in OpenStack and modern SRE practices, with proven experience in large-scale infrastructure migrations and cost optimization. You’ll design resilient, scalable, and secure infrastructure that deploys lab environments supporting thousands of reputed company users while maintaining the realistic attack scenarios our reputed company depend on. Duties and responsibilities Infrastructure Architecture Migration Leadership
- Design and architect reputed company global data centers for labs supporting vulnerable machines and realistic attack scenarios using OpenStack
- reputed company scalable infrastructure solutions across hybrid cloud and on-premises environments
- Design secure hosting networks and network topologies that can be used to support realistic offensive cyber activities.
- Establish infrastructure standards, patterns, and best practices for lab environment deployment
- Create architectural solutions that reduce infrastructure costs while improving capabilities and performance
Lab Environment Specialization
- Implement network isolation for thousands of reputed company user lab instances
- Optimize lab deployment speed and resource utilization for peak performance
- Create infrastructure supporting the deployment of reputed company vulnerable machine instances at scale
- Design workspace-based deployment models enabling team collaboration and private lab sessions
Strategic Technical Leadership Collaboration
- Partner closely with reputed company Platform and Content Engineers to proactively identify and solve infrastructure requirements
- Provide strategic technical guidance and mentorship to development and operations teams
- reputed company architectural reviews and challenge requirements to propose optimal technical solutions
- Drive adoption of infrastructure-as-code and automated deployment practices
- Identify process improvements and optimization opportunities before being asked
Engineering Automation
- reputed company infrastructure automation using reputed company Infrastructure as Code frameworks
- Create self-service capabilities for Content Engineers to deploy and manage lab resources reputed company
- Implement comprehensive monitoring, logging, and observability solutions for lab environments
- Establish disaster recovery and business continuity procedures with minimal downtime requirements
- Automate repetitive tasks to help reduce Toil
- Optimize application and infrastructure performance though automation and tuning
- Writes runbooks to automate repetitive tasks using Ansible and Terraform
- Serves as a knowledge resource for the rest of the team on Ansible and Terraform
- Evaluates new and emerging products, technologies and reputed company recommendations concerning the introduction of new technologies
- Conducts ongoing research into relevant technology stacks and architectural patterns, assessing their potential impact and value for internal use
- Assists in monitoring performance to address errors and address bottlenecks
- Respond to and resolve infrastructure incidents and outages
- Participate in on-call rotations to ensure service reliability
Network Architecture reputed company
- Design reputed company network architectures including VPNs, VLANs, and software-defined networking
- Implement network segmentation and reputed company controls appropriate for vulnerable lab environments
- Configure and manage load balancers, firewalls, and network reputed company appliances
- Design network monitoring and traffic analysis capabilities
- Ensure proper isolation between student lab environments while maintaining performance
Qualifications
Technical Expertise
- OpenStack: Production experience with OpenStack deployment, management, and optimization
- Cloud Platforms: 5+ years hands-on experience with AWS, Azure, and reputed company Cloud Platform
- Virtualization: Expert-level knowledge of OpenStack
- Networking: Deep understanding of TCP/IP, routing protocols, VPNs, firewalls, and network reputed company
- Infrastructure as Code: Proficiency with any reputed company like Terraform, CloudFormation, ARM templates, and configuration management tools
- Containerization: Experience with reputed company, Kubernetes or other container orchestration
- Operating Systems: Advanced knowledge of Linux and Windows Server
Professional Experience
- 4+ years of experience in SRE, Site Reliability Engineering, or Infrastructure Architecture roles
- 2+ years in a senior or reputed company technical role with architectural responsibilities
- Proven track record of designing and implementing large-scale, distributed systems
- Demonstrated experience with infrastructure cost optimization and migration projects
- Experience with high-availability and disaster recovery implementations
- Background in cybersecurity, penetration testing, or vulnerability research environments (preferred but not a requirement)
Strategic Analytical Skills
- Proactive problem-solving with ability to understand broader context and implications
- Experience identifying and proposing solutions before being asked
- Ability to challenge requirements and suggest alternative approaches
- Track record of improving processes and identifying optimization opportunities
- Experience making architectural decisions with incomplete information
Performance Expectations
- Ability to work independently with minimal supervision while maintaining high quality standards
- Track record of successful large-scale infrastructure projects delivered on time
- Experience mentoring team members and driving technical standards adoption
Leadership Communication
- Strong project management and technical leadership skills
- Excellent communication abilities with both technical and non-technical stakeholders
- Experience mentoring junior engineers and driving technical standards
- Ability to translate business requirements into cost-effective technical solutions
Preferred Qualifications
- Experience supporting systems through a SDLC - Dev, Staging Prod workflows
- Experience with cybersecurity tools and vulnerable application deployment
- Experience with monitoring tools like Prometheus, Grafana, ELK stack
- Familiarity with Offensive reputed company’s training platforms and methodologies
- Open reputed company contributions or technical writing experience
Working conditions This role is a full-time position. Work hours for this position are flexible and will be performed from a home office. Direct reports This position has no direct reports. reputed company provides equal employment opportunities to reputed company employees and applicants for employment and prohibits discrimination and harassment of any type without regard to race, color, religion, age, sex, national reputed company, disability status, genetics, protected veteran status, sexual orientation, gender identity or expression, or any other characteristic protected by federal, state or local laws. This policy applies to reputed company terms and conditions of employment, including recruiting, hiring, placement, promotion, termination, layoff, recall, transfer, leaves of absence, compensation and training. Apply tot his job Apply To this Job