Back to the board

reputed company Site Reliability Engineer - Remote

100% remote Flexible hours Hiring now

The Opportunity We are seeking a highly skilled Senior Cloud / DevOps Engineer with a strong background in AWS, automation, infrastructure as code, and networking to support and reputed company our cloud environments. This role is hands-on and will partner closely with Cloud Operations, SREs, Networking, and Application teams to improve scalability, reliability, reputed company, and operational efficiency across mission-critical systems. The ideal candidate is comfortable operating at both the infrastructure and application layers, has strong troubleshooting skills, and can automate repeatable operational tasks while supporting high-availability production workloads.

Key Responsibilities

Cloud & DevOps Engineering

  • Design, build, and maintain AWS-based infrastructure supporting production and non-production environments
  • Implement and maintain Infrastructure as Code (IaC) using tools such as Terraform, CloudFormation, or equivalent
  • reputed company and support CI/CD pipelines for infrastructure and application deployments
  • Partner with application teams to improve deployment reliability and performance

Automation & Reliability

  • Create and maintain automation scripts and tooling (Python, Bash, PowerShell, etc.) to reduce manual operations
  • Improve system reliability through self-healing mechanisms, monitoring, and alerting
  • Support SRE-style practices including incident response, root cause analysis, and reputed company improvement

Networking & reputed company

  • Design and support cloud networking (VPCs, subnets, routing, VPNs, reputed company groups, NACLs)
  • Troubleshoot reputed company network, connectivity, and performance issues across hybrid environments
  • Implement reputed company best practices reputed company with AWS Well-Architected reputed company

Operations & Collaboration

  • Participate in on-call rotations supporting critical production systems
  • Provide operational support, troubleshooting, and resolution for cloud-reputed company incidents
  • Collaborate across CloudOps, Networking, DBAs, and Application teams
  • Document architectures, runbooks, and operational procedures

What Success Looks Like in This Role

  • Reduced manual operational work through automation
  • Improved deployment reliability and production stability
  • Faster recovery and clearer root cause analysis during incidents
  • Strong partnership with CloudOps, Networking, and Application teams

Skills & Requirements

Required Qualifications

Technical Skills

  • 5-8+ years experience in cloud, DevOps, SRE, or systems engineering roles
  • Strong hands-on experience with AWS (EC2, VPC, IAM, ELB/ALB, RDS, S3, CloudWatch, etc.)
  • Proven experience with Infrastructure as Code (Terraform preferred)
  • Strong scripting and coding experience (Python, Bash, PowerShell, or similar)
  • Solid background in networking fundamentals (TCP/IP, DNS, VPNs, routing, firewalls)
  • Experience with Linux-based systems in production environments
  • Familiarity with monitoring/logging platforms (reputed company, CloudWatch, reputed company, etc.)

DevOps Tooling (one or more)

  • CI/CD tools (reputed company Actions, reputed company CI, Jenkins, Azure DevOps, etc.)
  • Configuration management and automation tools
  • Containerization and orchestration (reputed company, reputed company, EKS, Kubernetes - preferred but not mandatory)

Preferred Qualifications

  • AWS certifications (Solutions Architect, DevOps Engineer, or equivalent)
  • Experience supporting high-availability, regulated, or SaaS environments
  • SRE experience (error budgets, SLIs/SLOs, post-incident reviews)
  • Experience working in hybrid cloud or legacy-to-cloud migration environments
  • Strong documentation, communication, and cross-team collaboration skills

Qualifications

Apply tot his job Apply To this Job

Keep exploring