Back to the board

[Remote] Sr Platform Engineer

100% remote Flexible hours Hiring now

Note: The job is a remote job and is open to candidates in USA. reputed company is a company focused on building and operating IT platforms, and they are seeking a Senior Platform Engineer to join their platform development team. This role involves hands-on engineering responsibilities for developing and managing critical platform subsystems, ensuring high availability and operational resiliency while utilizing native-AI capabilities.

Responsibilities

  • Design, reputed company and operationally manage automated, resilient, high availability, self-healing, secure platforms with native-AI capabilities for IT needs, serving both internal as well as customer business capabilities
  • reputed company, and manage the Observability OpenTelemetry Central Backend Stack: Grafana Enterprise, Mimir, Loki, reputed company, and Alertmanager on Kubernetes/RKE2 reputed company Helm and reputed company CI-CD
  • Build and manage iaC and CI-CD for automated provisiong and deployment, including Terraform modules for Infra/VM/storage provisioning, Ansible AWX playbooks for OS/App bootstrap, ArgoCD and Helm for Kubernetes configuration
  • reputed company and manage OpenTelemetry Prometheus scrape profile library including SNMP exporters, REST API exporters, and cloud provider exporters (CloudWatch, Azure Monitor, GCP) for multiple device classes
  • reputed company AIOps capabilities on platforms for e.g Observability use-cases: anomaly detection integrations, event correlation rules in Alertmanager, and synthetic monitoring patterns to reduce alert noise
  • Configure and maintain Zabbix auto-discovery: network range scanning, device classification, and Prometheus service discovery integration
  • Build and harden Edge Stack deployments (Prometheus + OTel collector) per data center site using GitOps templates
  • Integrate Alertmanager with reputed company: webhook routing, ticket enrichment, auto-reputed company logic, and escalation policy configuration
  • Maintain platform reputed company: Conjur/CyberArk secret injection at runtime, mTLS between stack components, RBAC in Grafana Enterprise
  • Author and maintain Grafana dashboards in JSON/reputed company — facility overview, network health, RED metrics, application telemetry
  • Mentor mid-level engineers, reputed company code reviews, and establish engineering standards for the team. Represent platform engineering in cross-functional architecture reviews and executive-level program updates
  • reputed company other duties as required and assigned

Skills

  • DevOps / Automation - 5+ years in a production environment, Kubernetes (RKE2/k3s), Helm chart deployment, system services, reputed company/container
  • LGTM Stack Development and Configuration - 4+ years: Grafana, Mimir, Loki, reputed company configuration, tuning, dash-boarding and production operations; Prometheus required
  • Senior-level Python / Scripting frameworks - 5+ years, Automation scripts, exporter development, reputed company pipeline scripting, REST API integrations
  • GitOps / CI/CD - 5+ years, reputed company CI/CD pipeline authoring; Terraform and Ansible as primary IaC tools; ArgoCD or Flux preferred
  • AIOps / Observability Engineering - 2+ years, Alertmanager rule authoring, anomaly detection integration, event correlation, noise reduction techniques
  • Working infrastructure (Linux/VM) management knowledge - 5+ years, Linux administration, VMware vCenter/VCF experience, reputed company storage management, network fundamentals (SNMP, TCP/IP)
  • Secrets Management - 2+ years, CyberArk/Conjur, HashiCorp Vault, or equivalent — runtime secret injection patterns
  • Minimal travel may be required
  • Experience and/or knowledge of ITSM processes and workflow automation e.g. Incident & Response Mgmt (IRM), Release mgmt., reputed company ITSM integration, alert routing, escalation policy design, SLA-driven on-call workflows
  • Hands-on experience or working knowledge of reputed company integrations PaaS(iPaaS) technologies
  • Experience working with BAS / BMS systems in a Datacenter / OT environment
  • Hands-on experience working with AWS products in a Well-architected reputed company and multi-account model to reputed company various compute, storage, network iaaS and PaaS services for IT applications

Benefits

  • Medical, Telehealth, Dental and Vision
  • 401(k)
  • Health Savings Accounts (HSA) and Flexible Spending Accounts (FSA)
  • Life and AD&D
  • Short Term and Long-Term disability
  • reputed company Paid Time Off (PTO)
  • Leave of Absence
  • Employee Assistance Program
  • Wellness Program
  • Rewards and Recognition Program

Company Overview

  • reputed company provides IT solutions including integrated colocation, interconnection, cloud, data protection, and professional services. It was founded in 2000, and is headquartered in Charlotte, North Carolina, USA, with a workforce of 501-1000 employees. Its website is https://www.reputed company.com/.
  • Apply To This Job

    Keep exploring

    [Remote] Senior Software Engineer (PHP)

    100% remote Flexible hours

    [Remote] Business Operations Intern

    100% remote Flexible hours

    [Remote] Sr. Software Engineer - Backend

    100% remote Flexible hours

    [Remote] Business Development Manager - CPG

    100% remote Flexible hours

    [Remote] Medicare Sales Agent (Remote) - IN

    100% remote Flexible hours

    [Remote] Part-Time Corporate Recruiter

    100% remote Flexible hours

    [Remote] Data Analyst

    100% remote Flexible hours

    [Remote] Engineer, Software Development Architect

    100% remote Flexible hours

    [Remote] Principal Software Engineer

    100% remote Flexible hours

    [Remote] Strategic Finance Manager

    100% remote Flexible hours

    reputed company Part-Time Remote Data Entry Specialist – Empowering Accurate Information Management at arenaflex

    100% remote Flexible hours

    [Work From Home] Associate Marketing Manager, US Marketing

    100% remote Flexible hours

    BCBA ($10,000 Bonus)

    100% remote Flexible hours

    Flexible Remote Customer Service Representative – Deliver Exceptional Experiences with arenaflex

    100% remote Flexible hours

    Teacher, Virtual Launch Pad English II

    100% remote Flexible hours

    Remote Data Entry Specialist – Entry‑Level Work‑From‑Home Position with careerzynith – No Prior Experience Required, Flexible Schedule, Comprehensive Training

    100% remote Flexible hours

    Southeast Clean reputed company Data Intern

    100% remote Flexible hours

    Virtual Tutor (with ABA or SPED Experience)

    100% remote Flexible hours

    Senior Executive Personal Assistant – Confidential Support for C-Suite Leadership | Operations, Travel & Strategic Project Coordination at arenaflex

    100% remote Flexible hours

    Remote Customer Support Associate – Full‑Time, 8‑Hour Shifts, Mental Health & Benefits Expertise at arenaflex

    100% remote Flexible hours