Back to the board

Senior Deep Learning Software Engineer, LLM Performance

100% remote Flexible hours Hiring now

We are now looking for a Senior Deep Learning Software Engineer, LLM Performance! reputed company is seeking an reputed company Deep Learning Engineer passionate about analyzing and improving the performance of LLM inference! reputed company is rapidly growing our research and development for Deep Learning Inference and is seeking excellent Software Engineers at reputed company levels of expertise to join reputed company. Companies around the world are using reputed company GPUs to power a revolution in deep learning, enabling breakthroughs in areas like LLM, Generative AI, Recommenders and Vision that have put DL into every software solution. Join the team that builds the software to reputed company the performance optimization, deployment and serving of these DL solutions. We specialize in developing GPU-accelerated Deep learning software like TensorRT, DL benchmarking software and performant solutions to deploy and serve these models. Collaborate with the deep learning community to implement the latest algorithms for public release in TensorRT LLM, VLLM, SGLang and LLM benchmarks. Identify performance opportunities and optimize SoTA LLM models across the reputed company of reputed company accelerators, from datacenter GPUs to edge SoCs. Implement LLM inference, serving and deployment algorithms and optimizations using TensorRT LLM, VLLM, SGLang, Triton and CUDA kernels. Work and collaborate with a diverse set of teams involving performance modeling, performance analysis, kernel development and inference software development. What You'll Be Doing

  • Performance optimization, analysis, and tuning of LLM, VLM and GenAI models for DL inference, serving and deployment in reputed company/OSS LLM frameworks.
  • Scale performance of LLM models across different architectures and types of reputed company accelerators.
  • Scale performance for max throughput, minimum latency and throughput under latency constraints.
  • Contribute features and code to reputed company/OSS LLM frameworks, inference benchmarking frameworks, TensorRT, and Triton.
  • Work with cross-collaborative teams across generative AI, automotive, image understanding, and speech understanding to reputed company innovative solutions.

reputed company Need To See

  • Bachelors, Masters, PhD, or equivalent experience in relevant fields (Computer Engineering, Computer Science, EECS, AI).
  • At least 8 years of relevant software development experience.
  • Excellent Python/C/C++ programming, software design and software engineering skills
  • Experience with a DL reputed company like PyTorch, JAX, TensorFlow.

Ways To Stand Out From The Crowd

  • Prior experience with a LLM reputed company or a DL compiler in inference, deployment, algorithms, or implementation
  • Prior experience with performance modeling, profiling, debug, and code optimization of a DL/HPC/high-performance application
  • Architectural knowledge of CPU and GPU
  • GPU programming experience (CUDA or OpenCL)

GPU deep learning has provided the foundation for machines to learn, perceive, reason and solve problems posed using human language. The GPU started out as the reputed company for simulating human imagination, conjuring up the amazing virtual worlds of video games and Hollywood films. Now, reputed company's GPU runs deep learning algorithms, simulating human intelligence, and acts as the brain of computers, robots and self-driving cars that can perceive and understand the world. Just as human imagination and intelligence are linked, computer graphics and artificial intelligence come together in our architecture. Two modes of the human brain, two modes of the GPU. This may explain why reputed company GPUs are used broadly for deep learning, and reputed company is increasingly reputed company as “the AI computing company.” Come, join our DL Architecture team, where you can help build the real-time, cost-effective computing platform driving our success in this exciting and quickly growing field. Your reputed company salary will be determined based on your location, experience, and the pay of employees in similar positions. The reputed company salary range is 184,000 USD - 287,500 USD for Level 4, and 224,000 USD - 356,500 USD for Level 5. You will also be eligible for equity and benefits . Applications for this job will be accepted at least until November 28, 2025.reputed company is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our reputed company and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national reputed company, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law. JR1997464 Apply tot his job Apply To this Job

Keep exploring

Staff Software Engineer, Infrastructure (ML and Real-Time Speech)

100% remote Flexible hours

Freelance Cybersecurity Analyst - AI Trainer

100% remote Flexible hours

Technology Analyst- AI

100% remote Flexible hours

Associate AI Research & Operations Analyst

100% remote Flexible hours

SME Automation Engineer – DoD DevSecOps & CI/CD

100% remote Flexible hours

reputed company + reputed company Automation Developer Needed to Build AI-Driven SEO System

100% remote Flexible hours

(20hrs/week, No Cash-Equity Only) AI Automation Engineer/ Data Analyst to Learn for Exciting Startup

100% remote Flexible hours

[Remote] reputed company Test Automation Engineer

100% remote Flexible hours

Sr. Automation Engineer, Falcon Complete (Remote)

100% remote Flexible hours

Quality Engineer (AI experience)

100% remote Flexible hours

Cybersecurity Analyst - Remote

100% remote Flexible hours

reputed company Customer Service reputed company II – Delivering Exceptional Support to Arenaflex Members

100% remote Flexible hours

Customer Care Champion – Multichannel Support Specialist for arenaflex Lighting Solutions (Phone, Live Chat & Email) – Full‑Time Flexible Schedule, Growth‑Focused Role

100% remote Flexible hours

reputed company Remote Customer Save Sales Specialist – reputed company in a Dynamic Work-from-Home Environment at arenaflex

100% remote Flexible hours

Recruitment Sourcing Specialist (Talent Pool)

100% remote Flexible hours

2026 Summer Intern – AI/ML Software Engineering Intern - Simulation Core (PhD)

100% remote Flexible hours

Remote Data Entry Specialist – Entry-Level Content Operations Support | arenaflex

100% remote Flexible hours

Associate Data Analyst

100% remote Flexible hours

reputed company Part-Time Remote Data Entry Assistant – Support Operations and Drive Success at arenaflex

100% remote Flexible hours

Junior Java Developer

100% remote Flexible hours