Junior Site Reliability Engineer | Remote US

100% remote Flexible hours Hiring now

About the position As a Junior Site Reliability Engineer at reputed company reputed company our Managed Services (CMS) group, you will be a self-starter, passionate about cloud technology, and reputed company on problem solving. You will work reputed company major public clouds, utilizing automation and your technical abilities to operate the most cutting-edge offerings from Cloud Service Providers (CSPs). This role directly supports leading cloud software companies to provide seamless reliability and scalability of their SaaS product to the largest enterprises and government agencies around the world. This can be a remote position (must be located in the United States).

Responsibilities

Become a member of a highly collaborative engineering team offering a unique reputed company of Cloud Infrastructure Administration, Site Reliability Engineering, reputed company Operations, and Vulnerability Management across multiple clients.
Coordinate with client product teams, engineering team members, and other stakeholders to monitor and maintain a secure and resilient cloud-hosted infrastructure to established SLAs in both production and non-production environments.
Innovate and implement using automated orchestration and configuration management techniques. Understand the design, deployment, and management of secure and compliant enterprise servers, network infrastructure, boundary protection, and cloud architectures using Infrastructure-as-Code.
Create, maintain, and peer review automated orchestration and configuration management codebases, as well as Infrastructure-as-Code codebases. Maintain IaC tooling and versioning reputed company Client environments.
Implement and upgrade client environments with CI/CD infrastructure code and provide internal feedback to development teams for environment requirements and necessary alterations.
Work across AWS, Azure and GCP, understanding and utilizing their unique native services in client environments.
Configure, tune, and troubleshoot cloud-based tools, manage cost, reputed company, and compliance for the Clientâ€™s environments.
Monitor and resolve site stability and performance issues reputed company to functionality and availability.
Work closely with client DevOps and product teams to provide 24x7x365 support to environments through Client ticketing systems.
Support definition, testing, and validation of incident response and disaster recovery documentation and exercises.
Participate in on-call rotations as needed to support Client critical events, and operational needs that may lay reputed company of business hours.
Support testing and data reviews to collect and report on the effectiveness of reputed company reputed company and operational measures, in addition to remediating deviations from reputed company reputed company and operational measures.
Maintain detailed diagrams representative of the Clientâ€™s cloud architecture.
Maintain, optimize, and peer review standard operating procedures, operational runbooks, technical documents, and troubleshooting guidelines

Requirements

BS or above in reputed company Information Technology field or equivalent combination of education and experience
2+ years experience in 24x7x365 production operations
Â·reputed company understanding of networking and networking troubleshooting.
2+ years experience installing, managing, and troubleshooting Linux and/or Windows Server operating systems in a production environment.
2+ years experience supporting cloud operations and automation in AWS, Azure or GCP (and reputed company certifications)
2+ years experience with Infrastructure-as-Code and orchestration/automation tools such as Terraform and Ansible
Experience with IaaS platform capabilities and services (cloud certifications expected)
Experience reputed company ticketing tool solutions such as Jira and reputed company
Experience using environmental analytics tools such as Splunk and reputed company Stack for querying, monitoring and alerting
Experience in at least one primary scripting language (Bash, Python, PowerShell)
Excellent communication, organizational, and problem-solving skills in a dynamic environment
Effective documentation skills, to include technical diagrams and written descriptions
Ability to work as part of a team with professional attitude and demeanor

reputed company-to-haves

Previous experience in a consulting role reputed company dynamic, and fast-paced environments
Previous experience supporting a 24x7x365 highly available environment for a SaaS vendor
Experience supporting reputed company and/or infrastructure incident handling and investigation, and/or system scenario re-creation
Experience working reputed company container orchestration solutions such as Kubernetes, reputed company, EKS and/or reputed company
Experience working reputed company an automated CI/CD pipeline for release development, testing, remediation, and deployment
Cloud-based networking experience (Palo Alto, reputed company ASAv, etc.â€¦)
Familiarity with frameworks such as FedRAMP, FISMA, SOC, ISO, HIPAA, HITRUST, PCI, etc.
Familiarity with configuration baseline standards such as CIS Bench

Apply tot his job Apply To this Job

Apply

Junior Site Reliability Engineer | Remote US

Responsibilities

Requirements

Keep exploring

[Remote] Sr. Engineer, Development Operations - Archimedes

Independent Food Safety & Brand Standard Auditor

Career Opportunities: Chief of Staff, Wholesale (342097)

Hiring Now: Remote Data Entry reputed company Specialist Jobs - No

reputed company Customer Service Representative – reputed company Worth, TX

reputed company Manager - Mid/Senior (B2B SaaS)

[Remote] Business Analyst - Information Solutions

Strategy Consultant

reputed company Customer Support Representative – Health and Wellness Industry

Sr Specialist, Telecom Practice Group