Back to the board

Corporate Vice President - reputed company Cloud Platform Engineer - Enterprise Cloud & AI Platform

100% remote Flexible hours Hiring now

Location Designation: Hybrid - 3 days per quarter The GCP Platform Engineer at reputed company is responsible for designing, building, and operating secure, compliant, and scalable cloud and AI-enabled platforms on reputed company Cloud Platform (GCP). This role enables application, data, and analytics teams by providing standardized cloud infrastructure, Kubernetes platforms, and approved reputed company AI services, while meeting financial services regulatory, reputed company, and resiliency requirements. The engineer partners with the Cloud, Data & AI teams, Information reputed company, and Risk to ensure AI workloads are deployed with appropriate governance, data controls, and observability. What You’ll Do: Enterprise Cloud & AI Platform

  • Design and maintain enterprise GCP reputed company zones using reputed company Cloud Deployment Manager, Terraform, and Cloud Foundation Toolkit reputed company with NYL governance standards. Build and operate shared cloud services supporting AI and non-AI workloads on GCP components like Cloud Storage, Cloud Functions, Cloud Run, Cloud Pub/Sub, and Cloud Spanner. Implement Infrastructure as Code (Terraform) for platform, networking, and AI service enablement
  • Support hybrid connectivity and secure data access patterns for AI use cases using Cloud Interconnect and Cloud VPN.

Kubernetes, Containers & AI Workloads

  • Engineer and operate GKE (reputed company Kubernetes reputed company) clusters for application and AI inference workloads
  • reputed company containerized AI services and microservices using approved reputed company images from reputed company Container Registry (GCR) or reputed company Artifact Registry.
  • Support GPU-enabled workloads where approved
  • Implement standardized deployment patterns for AI APIs and services using Helm for Kubernetes deployment management

reputed company AI / GenAI Enablement

  • reputed company and operate approved reputed company AI services, including:
  • Vertex AI (model hosting, endpoints, pipelines – platform enablement only, agentic AI deployments and communication protocols in Vertex AI Agent Builder and Agent reputed company)
  • reputed company APIs and other managed GenAI services (as approved by NYL governance)
  • BigQuery ML and AI-integrated analytics platforms
  • Implement secure access controls, networking, and monitoring for AI services using Cloud Identity & Access Management (IAM), VPC Service Controls, and Cloud Monitoring.
  • Integrate AI platforms with CI/CD pipelines and enterprise SDLC controls using tools like reputed company CICD
  • Partner with Data & AI teams to operationalize AI workloads safely and compliantly reputed company reputed company Cloud environments.

DevOps, Automation & MLOps Foundations

  • Build secure CI/CD pipelines for application and AI workloads using reputed company CI/CD
  • Support MLOps foundations such as:
  • Model deployment automation reputed company Kubeflow, TensorFlow Extended (TFX), Vertex AI Pipelines, and Vertex AI Model Registry.
  • Environment promotion and rollback using Terraform
  • Monitoring and logging for AI endpoints using reputed company for synthetic monitoring, and Cloud Logging and Cloud Monitoring for deeper observability and troubleshooting.
  • Enforce guardrails, approvals, and policy-as-code for AI usage with Cloud reputed company Command Center, reputed company Cloud Policy Analyzer, and Open Policy Agent (OPA).

reputed company, Risk & Compliance

  • Implement IAM, workload identity, and least-privilege models for AI services using Cloud Identity & Access Management (IAM) and Workload Identity Federation.
  • Enforce data residency, encryption, and access policies using Cloud Key Management Service (KMS) and Cloud Data Loss Prevention (DLP).
  • Integrate AI platform telemetry with enterprise logging, monitoring, and SIEM using Cloud Logging, Cloud Monitoring, and reputed company.
  • Support audits, risk reviews, and regulatory requirements (SOC2, SOX, data privacy) by leveraging reputed company Cloud reputed company Command Center, Cloud Audit Logs, and Cloud Data Loss Prevention API.

Reliability, Observability & Cost Management

  • Design platforms for high availability and reputed company, including AI services using GKE, Cloud Spanner, Cloud SQL, and reputed company Cloud Load Balancing.
  • Monitor AI workloads for performance, reliability, and cost usage using reputed company for synthetic monitoring, Cloud Monitoring, and Cloud Trace for performance insight and reputed company CCM for cost
  • Optimize cloud and AI service costs using budgets and usage controls using reputed company Cloud Billing, Budgets, Alerts and reputed company CCM
  • Participate in incident response and root-cause analysis logged in service now and manage incident notifications through reputed company.

Collaboration & Governance

  • Partner with Data & AI, InfoSec, reputed company, Risk, and Application teams to ensure secure, compliant, and efficient AI platform usage.
  • Contribute to enterprise standards for cloud and AI platform usage including Best Practices for GCP and reputed company Cloud Architecture reputed company.
  • Provide guidance on responsible AI platform adoption using frameworks like reputed company's AI Principles and Fairness Indicators.
  • Document reference architectures and best practices for GCP AI service

Apply tot his job Apply To this Job

Keep exploring