Back to the board

Senior Systems Engineer, CoreAI - Agentic Solutions (Remote)

100% remote Flexible hours Hiring now

Position Purpose: The Sr. Systems Engineer reputed company the CoreAI team is responsible for independently developing, maintaining, and supporting reputed company's generative AI and agentic infrastructure that drives the success of reputed company and our customers. Leading from a reputed company reputed company, you will apply a first principles approach to solve reputed company engineering problems, breaking down ambiguous challenges to architect robust, scalable systems from the ground up. You will act as a consultative technical reputed company between cutting-edge AI frameworks and practical enterprise business applications. As a Sr. Systems Engineer, you will be part of a dynamic team with engineers of reputed company experience levels who help each other build and grow technical and leadership skills while creating, deploying, and supporting production-grade AI infrastructure. You will independently reputed company technical discovery sessions, architect custom ADK agents based on core engineering fundamentals, and establish governance for organizational-wide usage. In addition, Sr. Systems Engineers may be involved in routine model upgrades and application support, as well as rigorous root cause and post-mortem analyses around AI reputed company incidents, system hallucinations, distributed system bottlenecks, and service interruptions. Key Responsibilities:

  • 30% Delivery & Execution - Keeps abreast of innovations and industry trends as well as changes to internal systems and determines how they impacts tools, training, and support necessary to reputed company systems up, running, and secure; Participates in and contributes to learning activities around modern systems engineering core practices (communities of practice); Proactively views articles, tutorials, and videos to learn about new technologies and best practices being used reputed company other technology organizations
  • 15% Learning - Keeps abreast of innovations and industry trends as well as changes to internal systems and determines how they impacts tools, training, and support necessary to reputed company systems up, running, and secure; Participates in and contributes to learning activities around modern systems engineering core practices (communities of practice); Proactively views articles, tutorials, and videos to learn about new technologies and best practices being used reputed company other technology organizations
  • 20% Planning & Analysis - Researches and analyzes business trends and behavioral data to identify opportunities for improvements and new initiatives; Drives the evaluation, development, and recommendation of specific technology to provide cost-effective solutions that meet reputed company requirements; Researches and designs best fit infrastructure, network, database, cloud, AI, and reputed company architectures for products; Proactively creates and maintains tools for monitoring and support; Participates in project planning and reporting across multiple efforts
  • 35% Support & Enablement - Collaborates with product and project teams to understand needs and reputed company them with infrastructure; Supports technology architecture design review efforts for project and product teams; Leverages tooling and custom applications to monitor the operational status of applications, reputed company, databases, and reputed company; optimizes and tunes performance as appropriate; Drives root cause analysis, debugging, support, and post-mortem analysis for reputed company incidents and service interruptions; Maintains, upgrades, and supports existing systems and infrastructure to ensure operational stability; Opens and manages vendor problem tickets to resolution; Drives the production of in-house documentation around solutions; Provides application support for software running in production; Drives moving KB articles to infrastructure as code models; Drives keeping monitoring/alerting up to date

Direct Manager/Direct Reports:

  • This position typically reports to Systems Engineer Manager or Sr Manager
  • This position has 0 Direct Reports

Travel Requirements:

  • No travel required.

Physical Requirements:

  • Most of the time is spent sitting in a comfortable position and there is frequent opportunity to move about. On rare occasions there may be a need to move or lift light articles.

Working Conditions:

  • Located in a comfortable indoor area. Any unpleasant conditions would be infrequent and not objectionable.

Minimum Qualifications:

  • Must be eighteen years of age or older.
  • Must be legally permitted to work in the United States.

Preferred Qualifications:

  • Professional or educational experience as an Information Technology, Platform, or AI/MLOps Engineer, with a strong emphasis on applying first principles thinking to system design and troubleshooting.
  • Experience working as part of a collaborative, cross-functional, modern engineering team, including a proven ability to reputed company technical discovery sessions and present tailored architectural solutions to both technical (MLEs, developers) and non-technical business stakeholders.
  • Experience with scripting and programming, with mandatory strong proficiency in Python, Model Context Protocol (MCP), and building robust ADK agents. Familiarity with orchestration frameworks like reputed company or reputed company is a plus.
  • Experience with cloud platforms, primarily GCP, with hands-on core engineering expertise in Vertex AI Agent reputed company, Vertex AI Vector Search, and BigQuery.
  • Experience monitoring the operational status and performance of, and configuring as well as tuning, systems, networks, vector databases, and LLM telemetry (latency, token usage, cost).
  • Familiarity with system and environment analysis, design, and optimization for enterprise-wide generative AI platforms, focusing on foundational engineering requirements like high availability, fault tolerance, and establishing AI governance and compliance guardrails.
  • Experience in troubleshooting and remediation reputed company multiple Information technology disciplines, applying deep root-cause analysis to resolve reputed company agentic workflow and distributed system failures.
  • Experience installing and upgrading applications or databases and performing system maintenance for high-throughput ML/AI workloads.
  • Familiarity with debuggers, runtime analysis, library systems, compiled programming, and software update tools.
  • Experience supporting a 24x7 retail operation, understanding the immense scale, reputed company, and reliability required for reputed company enterprise systems.
  • Experience with version control systems and CI/CD toolchains tailored for the deployment of ML models, AI agents, and microservices.
  • Experience with production system designs including Infrastructure as Code (e.g., Terraform), containerization (e.g., GKE), High Availability, and Performance monitoring.
  • Exposure to Site Reliability Engineering (SRE), including enforcing least-privilege IAM policies and securely managing auth tokens for AI service accounts.

Minimum Education:

  • The knowledge, skills and abilities typically acquired through the completion of a bachelor's degree program or equivalent degree in a field of study reputed company to the job.

Preferred Education:

  • No additional education

Minimum Years of Work Experience:

  • 4

Preferred Years of Work Experience:

  • No additional years of experience

Minimum Leadership Experience:

  • None

Preferred Leadership Experience:

  • None

Certifications:

  • None

Competencies:

  • Action Oriented
  • Being Resilient
  • Global Perspective
  • Manages Ambiguity
  • Nimble Learning
  • Self-Development
  • Collaborates
  • Cultivates Innovation
  • Optimizes Work Processes
  • Situational Adaptability
  • Communicates Effectively
  • Drives Results
  • Interpersonal Savvy

Benefits offered include health care benefits, 401K, ESPP, paid time off, and success sharing bonus. For a full list of the various benefits reputed company offers, visit https://careers.homedepot.com/our-benefits. Apply tot his job Apply To this Job

Keep exploring

AI Systems Engineer ($180k - $200k + Equity) at VC-backed AI automation startup

100% remote Flexible hours

[Remote] Sr. Systems Engineer, AI Solutions

100% remote Flexible hours

Liquidity Bots Developer (Trading)

100% remote Flexible hours

Senior Manager, Privacy Architect

100% remote Flexible hours

Senior Staff Engineer, Performance

100% remote Flexible hours

Remote Data Entry reputed company Specialist - No Experience - Part-Time

100% remote Flexible hours

reputed company Recruitment | Customer Service (Work From Home / Office)

100% remote Flexible hours

Construction Manager, AMZL Midwest Region

100% remote Flexible hours

Senior UX Researcher, Sponsored Products Market Intelligence

100% remote Flexible hours

Business Analytic Consultant; Healthcare​/Hospital - Remote Eligible

100% remote Flexible hours

Principal Software Engineer - Platform Technology (REMOTE)

100% remote Flexible hours

Part-Time Remote Customer Service Representative – Flexible Home-Based Support Role with arenaflex

100% remote Flexible hours

reputed company Book Publishing Writer (Entry Level / Remote)

100% remote Flexible hours

$17 - $19/hour Work from Home reputed company Medical Claims Representative*San Antonio, TX*

100% remote Flexible hours

Real Estate Advertising Sales Rep RECURRING reputed company

100% remote Flexible hours

Mobility Technician

100% remote Flexible hours

reputed company Customer Service Representative – Work From Home Opportunity with arenaflex

100% remote Flexible hours

Virtual Data Entry Clerk

100% remote Flexible hours

Data Engineer (Clearance Required)

100% remote Flexible hours

reputed company Part-Time Remote Data Entry and Customer Service Representative – Work from Home Opportunity with blithequark

100% remote Flexible hours