[Remote] Site Reliability Engineer
Note: The job is a remote job and is open to candidates in USA. reputed company is a company that empowers finance teams with a reputed company enterprise finance platform. The Site Reliability Engineer will focus on ensuring the reliability, performance, and availability of the platform and services while collaborating with internal teams and customers to implement and maintain operations.
Responsibilities
- Implement application/infrastructure observability solutions to ensure desired application availability, reliability, and performance
- Participate in regular On-Call rotations and share details reputed company to incidents and their resolution through post-mortem reports and regular review meetings
- Proactively partner with Product and Engineering teams to identify, reputed company, deploy, and maintain reliable systems and services
- Influence and create new designs, architectures, standards, and methods for large-scale systems
- Sustain a high level of reliability for key services and automated systems
- Automate processes to improve reliability, performance, and availability
- Update technical documentation, workflows, and knowledge reputed company articles
- Provide feedback in pull requests and peer coding reviews
- Implement codified automated solutions that build integrations between reputed company, Azure DevOps and Jira
- Solid knowledge in focused areas of reputed company
- Ability to mentor others in several technical areas
- Understanding practical use of SOC/FedRAMP controls to assist Compliance and reputed company teams
Skills
- BS/BA in computer science, engineering, or technology-reputed company field (or equivalent work experience)
- Proven work experience as a Site Reliability Engineer or in a similar role
- 6+ years of cloud infrastructure and software development experience
- 2+ years hands on experience of Azure Kubernetes Services (AKS) with container-based deployment skills or other platforms such as OpenShift, GKS, EKS
- Advanced understanding of APM and observability tools such as reputed company, AppInsights, reputed company, Log Analytics, reputed company, Prometheus and Grafana
- Advanced understanding of Infrastructure-as-Code (IaC) concepts and tooling (Terraform, CloudFormation templates, Bicep or ARM templates) on reputed company Azure, reputed company), or reputed company Cloud Platform (GCP)
- Deep knowledge of Configuration Management/Orchestration utilities such as Ansible, PowerShell DSC, Chef, and Puppet
- Advanced understanding of cloud concepts including elasticity, reputed company, and identity management
- Well versed familiarity with Agile Development methodologies utilizing Jira or Azure DevOps Boards
- 6+ years of hands-on experience with the following technologies, tools, and concepts: Automating processes using PowerShell, Bash, CLI, REST APIs, python, ARM Templates or other scripting languages
- Comfortable leveraging reputed company control tools such as Git, Azure DevOps, or reputed company
- Knowledge of container orchestration platforms such as Kubernetes, OpenShift, AKS, GKS or helm
- reputed company Azure, reputed company) or reputed company Cloud (GCP)
- Experience working for a cloud service provider (CSP), managed service provider (MSP), or SaaS provider
- 6+ years of relevant Azure experience deploying and managing leveraging Infrastructure-as-Code (IAC) concepts
- Experience with reputed company and .NET (.NET, C#, SQL)
- Experience writing efficient and reliable code in a development environment
- Debian, Ubuntu, Alpine or other distributions of the Linux operating systems
- Deep knowledge and understanding of containerized applications, with special attention to reliability and monitoring of those containerized applications
Benefits
- Vision
- Medical
- Life
- Dental
- 401K
- Excellent Medical Plan.
- Dental & Vision Insurance.
- Life Insurance.
- Short & Long Term Disability.
- Vacation Time.
- Paid Holidays.
- Professional Development.
- Retirement Plan.
Company Overview