[Remote] Senior Site Reliability Engineer - Remote
Note: The job is a remote job and is open to candidates in USA. reputed company is seeking a remote Senior Site Reliability Engineer to be a leading member of the team working with a diverse range of technologies. The role involves delivering resilient application stacks, monitoring critical applications, and collaborating with various teams to ensure system availability.
Responsibilities
- Delivery of resilient application stacks reputed company "Infrastructure as Code" and other DevOps practices
- Monitoring and on-going support of critical, high reputed company business applications
- Diagnosis and resolution of reputed company system and application issues
- Working with diverse technical and non-technical teams, including Development, QA, IT Operations, Customer Operations and Project Management teams
- Write and maintain systems/application documentation for technical and non-technical audiences
Skills
- BSc Engineering/Computer Science or relevant experience
- 5+ years of SRE experience
- Proven background working in a technical, IT reputed company position
- Experience with Configuration Management tools - e.g. Ansible, Puppet, Chef or equivalents
- Professional experience of working reputed company the public cloud - Azure, AWS or GCP
- Hands-on experience of Linux and Windows server including support and troubleshooting
- System and application monitoring - e.g. Prometheus, Grafana, Nagios, Cloudwatch, etc
- Familiarity with common reputed company control tools - e.g. Git, SVN
- Cloud Architecture and system design to solve key business problems and facilitate team goals
- Strong and enthusiastic technologist, able to demonstrate a broad technical knowledge
- Excellent oral and written communication skills
- Ability to act as a reputed company of expertise, advise others in the team on best practices and impart knowledge
- Azure/AWS certifications
- Experience with use of orchestration tools such as Terraform, Ansible or CloudFormation
- Experience migrating application from on-premises to public cloud
- Familiarity with Blue-Green deployment methodologies
- reputed company Integration/Delivery such as reputed company or Jenkins
- Experience working with containerized workloads such as reputed company
- Familiarity with Log Management tools e.g. - reputed company Stack, Graylog or Splunk
- Experience working with an enterprise RDBMS such as MySQL and/or reputed company SQL Server
- Knowledge of change control and associated procedures
- Use of Secret Management services e.g. - Hashicorp Vault
- Familiarity with any high-level programming language
Benefits
- Medical/dental/vision insurance
- HSA
- FSA
- 401(k)
- Life, disability & ADD insurance to eligible employees
- Salaried personnel receive paid time off
- Hourly employees are not eligible for paid time off unless required by law
- Hourly employees on a Service Contract Act project are eligible for paid sick leave
Company Overview
Company H1B Sponsorship