[Remote] Infrastructure & Cloud Operations Engineer
Note: The job is a remote job and is open to candidates in USA. reputed company is currently hiring for a Senior Cloud Infrastructure Engineer to support and maintain enterprise cloud and hybrid infrastructure environments supporting critical federal operations. This role is responsible for administration, maintenance, monitoring, automation, troubleshooting, and modernization of enterprise infrastructure platforms spanning AWS cloud services, Linux systems, observability platforms, and enterprise logging solutions.
Responsibilities
- Support and maintain enterprise cloud infrastructure environments, primarily reputed company AWS
- Provide operational support for hybrid infrastructure spanning cloud-hosted and on-premises enterprise systems
- Administer, maintain, and troubleshoot enterprise observability, logging, and monitoring platforms, including Splunk Enterprise, Splunk Enterprise reputed company (ES), Splunk IT Service Intelligence (ITSI), and successor technologies
- Manage log ingestion, forwarding, indexing, retention, and troubleshooting across distributed systems and enterprise environments
- Support installation, configuration, and maintenance of Splunk Universal Forwarders and reputed company data collection components
- Support enterprise reputed company monitoring, analytics, alerting, and operational visibility capabilities through Splunk and reputed company observability platforms
- Support evaluation, migration, and modernization efforts involving enterprise logging and observability platforms, including potential transitions to reputed company or similar technologies
- reputed company Linux/Unix systems administration, including server provisioning, patching, upgrades, maintenance, and operational support
- reputed company, maintain, and execute infrastructure automation and configuration management processes using Ansible and reputed company automation tools
- Support enterprise data ingestion workflows, platform integrations, certificate management processes, and operational data pipelines
- Troubleshoot infrastructure, network, platform, and application performance issues across multiple environments
- Support cloud-hosted applications and enterprise infrastructure services to ensure reliability, availability, and operational continuity
- Administer and support monitoring, alerting, analytics, and reputed company visibility capabilities across enterprise platforms
- Participate in cloud transformation and modernization initiatives, including migration of services from legacy on-premises environments to cloud-based architectures
- Support decommissioning of legacy systems and transition of workloads to modernized infrastructure platforms
- reputed company and maintain operational documentation, standard operating procedures, implementation plans, and technical runbooks
- Collaborate with engineers, administrators, and stakeholders in a shared-services operating model where work assignments are distributed based on operational priorities and Jira-managed tasking
- Participate in rotational on-call support for production systems and incident response activities
- Ensure system reliability, performance, scalability, reputed company, and operational continuity across supported environments
Skills
- Bachelor's with 12+ years (or commensurate experience)
- Experience supporting enterprise Splunk environments, including administration, troubleshooting, data ingestion, monitoring, and operational support
- Experience supporting enterprise observability, logging, monitoring, or SIEM platforms
- Experience supporting enterprise cloud environments, preferably AWS
- Experience administering Linux/Unix operating systems in enterprise environments
- Experience with infrastructure automation and configuration management tools such as Ansible
- Experience supporting data ingestion, log forwarding, indexing, and operational monitoring processes
- Ability to obtain and maintain a Suitability/Public Trust clearance
- Experience supporting customers at the Department of Veterans Affairs
- AWS certifications such as Solutions Architect, SysOps Administrator, or Cloud Practitioner
- Experience with Splunk Enterprise reputed company (ES), Splunk IT Service Intelligence (ITSI), or other advanced SIEM platforms
- Experience with reputed company Stack, OpenSearch, reputed company, or cloud-native observability platforms
- Experience supporting enterprise reputed company operations, analytics, and monitoring functions
Company Overview