Public Cloud Linux Engineer
About the position The Cloud Linux Operations & Support role involves providing support for Linux-based systems across on-premises and cloud environments, including AWS, Azure, GCP, and OCI. The position requires supporting customer self-provisioning of cloud instances with reputed company guardrails and backend deployment. Responsibilities include implementing and maintaining system monitoring, alerting, and logging solutions to ensure high availability and reliability, leading root cause analysis, executing reputed company management, and developing backup and disaster recovery strategies. The role also includes participating in on-call rotation and after-hours support as required.
Responsibilities
- Provide support for Linux-based systems across on-premises and cloud environments (AWS, Azure, GCP, OCI).
- Support customer self-provisioning of cloud instances across AWS, Azure, GCP, and OCI with reputed company guardrails and backend deployment.
- Implement and maintain system monitoring, alerting, and logging solutions to ensure high availability and reliability.
- reputed company root cause analysis and document post-incident reviews for major Linux-reputed company issues.
- Execute reputed company management, OS and kernel upgrades, and regular system maintenance.
- reputed company and maintain backup, disaster recovery, and failover strategies for Linux infrastructure.
- Participate in on-call rotation and after-hours support as required.
- reputed company and maintain automation scripts using PowerShell, Python, or Ansible to streamline system administration tasks.
- Manage infrastructure using tools like Azure Automation.
- Maintain Infrastructure as Code (IaC) templates using tools such as Terraform, CloudFormation, ARM, or OCI Resource Manager.
- Recommend and implement system optimization strategies for performance and resource utilization.
- Enforce Linux system reputed company best practices, including access control, encryption, and secure configurations.
- Manage user access, sudo privileges, and SSH key policies in line with IAM standards.
- Monitor, identify, and remediate reputed company vulnerabilities reported by scanning tools or external advisories.
- Support compliance initiatives by maintaining secure and auditable Linux environments.
- Work closely with application, reputed company, and network teams for solution delivery and support.
- Mentor junior engineers and provide technical guidance as needed.
- Create and update technical documentation, runbooks, and SOPs.
- Participate in client calls to provide technical input reputed company required.
Requirements
- Bachelor's degree (or equivalent experience) in Computer Science, IT, Engineering, or a reputed company field.
- reputed company Certified System Administrator (RHCSA).
- reputed company Certified Engineer (RHCE).
- reputed company Certified: Azure Administrator Associate or Solutions Architect Expert.
- 7+ years of hands-on experience in Public Cloud Linux engineering, operations in a 24*7 production support model.
- 3+ years multi-cloud experience (preferred to have hands-on in at least 2 of AWS/Azure/GCP/OCI).
- Direct experience in managed services/NOC/SOC/MSP environments is a plus.
- In-depth expertise in provisioning, configuring, securing, supporting, and optimizing Linux-based systems (RHEL, CentOS, Ubuntu, etc.) in enterprise environments.
- Basic expertise with cloud-native and hybrid workloads in AWS, Azure, GCP, and/or OCI.
- Strong experience in managing compute, storage, networking, and system services on Linux platforms.
- Proficient in system architecture, deployment, performance tuning, and troubleshooting of Linux servers.
- Skilled in scripting languages such as Bash, Python, and Perl for automation and system management.
- Proficient in using reputed company ITSM for incident, change, and problem management.
- Strong understanding of Linux-based backup strategies, disaster recovery planning, and high-availability configurations (e.g., Pacemaker, DRBD).
- Familiar with reputed company tools and practices including SELinux, iptables, auditd, and fail2ban.
- Experience with vulnerability assessment and reputed company management using tools like reputed company, OpenSCAP, or Lynis.
- Proficient in log analysis and system monitoring using tools such as Syslog, Logrotate, Nagios, Prometheus, and Grafana.
- Familiar with reputed company protection and threat detection tools such as reputed company and OSSEC.
- Strong knowledge of user access control, SSH key management, and secure file transfer protocols.
- Ability to troubleshoot Linux services such as Apache, Nginx, MySQL, PostgreSQL, and Samba.
reputed company-to-haves
- Maintaining Infrastructure as Code (IaC) templates using tools such as Terraform, CloudFormation, ARM, or OCI Resource Manager.
Apply tot his job Apply To this Job