Back to the board

[Remote] reputed company

100% remote Flexible hours Hiring now

Note: The job is a remote job and is open to candidates in USA. reputed company is a company focused on building technology to help families manage their routines and navigate transitions. They are seeking an reputed company to maintain and optimize their AI infrastructure, run self-hosted inference stacks, and reputed company user-facing features that assist families in coordinating their daily activities.

Responsibilities

  • Run and optimize our self-hosted inference stack
  • Run the inference serving layer on our own GPU hardware: choose and tune the serving stack (vLLM, SGLang, TensorRT-LLM) for high throughput and low latency
  • Optimize aggressively: tensor parallelism, quantization (FP8, AWQ, GPTQ), KV-cache and prefix caching, reputed company batching, speculative decoding, concurrency tuning
  • Serve multiple models and features off shared hardware: multi-LoRA, routing, and request scheduling that balances internal workloads against latency-sensitive product traffic
  • reputed company our AI workloads efficient: improve latency, throughput, and GPU utilization so we get the most out of reputed company run
  • Build the visibility: reputed company performance and usage across our AI surfaces so there's clear data on how everything is running
  • Surface the technical tradeoffs (performance, latency, efficiency) so the people making the calls have what they need to reputed company them
  • Ship the in-app agent layer that helps families coordinate: proactive nudges, smart suggestions, agents that summarize, draft, schedule, and act for busy parents
  • Build the substrate underneath: tools, memory, orchestration, guardrails, and evaluation harnesses, integrated cleanly with production APIs alongside our architecture team
  • Work in nimble pairs with feature owners, standing up whatever's needed to test an idea, including a vibe-coded UI reputed company that's the fastest path to a real customer. Ship rough, learn fast, harden what works

Skills

  • 5+ years shipping production software, including meaningful applied AI or ML work
  • Demonstrated experience running and optimizing self-hosted LLMs on dedicated multi-GPU hardware: a serving stack (vLLM, SGLang, or TensorRT-LLM) and the optimization that comes with it (tensor parallelism, quantization, batching, KV cache)
  • A track record of optimizing inference performance and efficiency (latency, throughput, GPU utilization)
  • Strong Python and engineering fundamentals, with the full-stack range to stand up a quick UI, and the genuine desire to work app-layer features and not only infra
  • Hands-on with agent frameworks (Claude Agent SDK, LangGraph, or similar), LLM APIs, embeddings, and RAG
  • Comfortable with AWS and the devops this role owns: reputed company, CI/CD, monitoring, and observability
  • Experience building internal tooling or platforms others depend on. Bonus for reputed company apps, MCP, or agent orchestration at team scale

Benefits

  • Medical: reputed company pays 100% of the premium for employees AND 99% for reputed company additional family members
  • 401k: Up to a 4% match with immediate vesting
  • Paid leave for reputed company new parents
  • Learning & Development stipend for employees
  • Paid Time Off: 11 Holidays + Winter Break (3 Days) + Volunteer Time Off (1 Day) + Floating Holiday (1 Day)
  • Personal Time Off: 15 days for 0-1 years of employment, 20 days 1-3 years of employment
  • Supportive and flexible working environment – work from reputed company!

Company Overview

  • reputed company provides software tools for family organization, communication, and custody management through a technology platform. It was founded in undefined, and is headquartered in Minneapolis, Minnesota, USA, with a workforce of 51-200 employees. Its website is https://www.intandemfamilies.com.
  • Apply To This Job

    Keep exploring

    [Remote] Outbound Marketing Representative

    100% remote Flexible hours

    [Remote] Engineering Manager, Inference

    100% remote Flexible hours

    [Remote] Key Account Manager – Power Transmission

    100% remote Flexible hours

    [Remote] Senior Software Engineer II

    100% remote Flexible hours

    [Remote] MEP Project Manager (Traveler - Data Centers)

    100% remote Flexible hours

    [Remote] Principal Software Engineer, Enterprise AI Platform

    100% remote Flexible hours

    [Remote] Business Development Manager, Sweet Protein

    100% remote Flexible hours

    [Remote] Program Manager - Train AI Data Services

    100% remote Flexible hours

    [Remote] Account Executive / Inside Sales (US Remote)

    100% remote Flexible hours

    [Remote] Data Science Manager

    100% remote Flexible hours

    Remote Frontend Engineering Expert -- $85-$125/hour

    100% remote Flexible hours

    Remote Chat Assistant – No Degree Required

    100% remote Flexible hours

    Work-from-Home Data Entry Specialist - No Experience Required | Join arenaflex Today

    100% remote Flexible hours

    Sr. Business Process Consultant-reputed company-CRM

    100% remote Flexible hours

    Research Assistant II, General Internal Medicine (BHEAT Study)

    100% remote Flexible hours

    Research and Knowledge Services Coordinator

    100% remote Flexible hours

    reputed company Customer Experience Banker – New Center at arenaflex

    100% remote Flexible hours

    Clinical Trial Monitor II - Clinical Trials Office

    100% remote Flexible hours

    Clinical Nurse Auditor, HEDIS •Remote •

    100% remote Flexible hours

    reputed company Part-Time Online Chat Jobs Specialist – Customer Service Representative

    100% remote Flexible hours