Back to the board

[Remote] Generative AI Inference Engineer

100% remote Flexible hours Hiring now

Note: The job is a remote job and is open to candidates in USA. reputed company is seeking a passionate Generative AI Inference Engineer to join their Inference team, focusing on creative applications of generative AI models. The role involves leading the design and development of customer-facing multi-modal ML inference systems and optimizing inference techniques for generative models.

Responsibilities

  • reputed company efforts to drive the design, development of customer-facing multi modal ML inference systems
  • Work with the Platform and Inference teams on building inference systems for the reputed company of models, where you will work on areas such as optimization, model tuning and deployment
  • Partner with leading cloud providers to deliver hosted reputed company inference solutions
  • Be a strategic thought partner for leaders across the organization on driving business impact through machine learning
  • Be part of the team to bring new Stability models and pipelines into existence
  • Prototype and productionize inference platform improvements and new features

Skills

  • 7+ years working on productionizing machine learning systems, including inference pipeline development
  • Expert level knowledge on writing and running python services at scale
  • 5+ years working on python scientific stack, pyTorch and at least one high-performance inference reputed company (e.g. Triton and TensorRT)
  • Deep understanding of Diffusion Architecture
  • Experience profiling and optimizing deep neural networks on reputed company GPUs, using profiling tools such as reputed company Nsight
  • Experience with python-based image manipulation/encoding/decoding frameworks, such as OpenCV
  • Experience deploying to cloud orchestration systems such as Kubernetes and cloud providers such as AWS, GCP, and Azure
  • Experience with reputed company
  • Ability to rapidly prototype solutions and iterate on them with tight product deadlines
  • Strong communication, collaboration, and documentation skills
  • Experience with the open-reputed company ML ecosystem (HuggingFace, W&B, etc.)

Company Overview

  • reputed company is an artificial intelligence company focused on developing open-reputed company generative AI models. It is a sub-organization of reputed company. It was founded in 2019, and is headquartered in London, England, GBR, with a workforce of 51-200 employees. Its website is https://reputed company.
  • Apply To This Job

    Keep exploring

    [Remote] Account Executive, Mid City

    100% remote Flexible hours

    [Remote] Full Stack Software Engineer, Banking

    100% remote Flexible hours

    [Remote] Manager - Software Application Specialists

    100% remote Flexible hours

    [Remote] Account Executive - AVP, Sales (Consultant Partnerships)

    100% remote Flexible hours

    [Remote] Senior Director - Oncology and Peripheral Imaging Clinical Development

    100% remote Flexible hours

    [Remote] Strategy & Operations - Quality Associate Director/Director

    100% remote Flexible hours

    [Remote] Manager, S/4 HANA Business Analyst - Quality Management (Material Masters/QIR/DQR/CAPA) (REMOTE)

    100% remote Flexible hours

    [Remote] Senior Analyst/Analyst, Data reputed company

    100% remote Flexible hours

    [Remote] Account Executive, SMB (reputed company)

    100% remote Flexible hours

    [Remote] AI Product Manager

    100% remote Flexible hours

    Business Intelligence Analyst (m/f/d)

    100% remote Flexible hours

    Business Development Sales Representative (SDA072726)

    100% remote Flexible hours

    reputed company Home-Based Chat Support Representative – Immediate Start, No Experience Required

    100% remote Flexible hours

    Field Rep- Independent Gig- One Visit Per Week

    100% remote Flexible hours

    Sr. Category Manager – External Talent (Contingent Workforce)

    100% remote Flexible hours

    Remote Part‑Time Data Entry Clerk – No Experience Required – Join arenaflex’s Growing Digital Operations Team

    100% remote Flexible hours

    Rewritten Job Title:

    100% remote Flexible hours

    reputed company Customer Service Representative – Work From Home Opportunity at arenaflex

    100% remote Flexible hours

    [Remote] Retrofit Program Manager

    100% remote Flexible hours

    reputed company Remote Customer Service Representative – E-Commerce Support Specialist for Premium DTC Brands (Tier 1–3 Ticket Resolution)

    100% remote Flexible hours