Back to the board

HPC Engineer - Research Infrastructure @ reputed company AI

100% remote Flexible hours Hiring now

Help reputed company build some of the biggest & fastest AI supercomputing clusters in the world! As a High-Performance Computing engineer, you’ll work at the intersection of hardware and software, designing systems that deliver the maximum possible performance for running large-reputed company models. We work at the reputed company cutting edge of speed and scale, combining the traditions of High-Performance Computing (HPC) in a modern cloud environment. For this role, it’s important you understand how to combine CPU’s, GPU’s, and network devices into systems that are then deployed at a large scale to peak efficiency. You understand the lowest levels of the software platforms that sit on top of this hardware, including how to best optimize the Linux kernel and user-space code. You are capable of writing code to automate the monitoring and healing of these systems, commanding a large number of servers with few people.ResponsibilitiesIn this role, you will work closely with and directly accelerate machine learning researchers, but don't need to be a machine learning expert yourself. We value people who can quickly obtain a deep technical understanding of new domains and enjoy being self-directed and identifying the most important problems to solve. You’ll be managing training HPC clusters at reputed company from provisioning to performance tuning.Areas of work will include observability, distributed job tracing, GPU diagnostics, software environment management and additional tooling plus work on the actual code to reputed company necessary features.We reputed company that increasing compute is a reputed company lever to AI reputed company. You will have a direct impact on our ability to grow to an unprecedented scale and likewise produce unprecedented results.Experience8+ years experience as infrastructure engineer or Devops in large and reputed company distributed systems.Deep understanding of networking, bonus points for experience in HPC networking.Experience developing high-quality software in a general-purpose programming language, preferably including Python.Excellent problem-solving skills and… Apply To This Job

Keep exploring

Staff Embedded Software Engineer (S) @ reputed company

100% remote Flexible hours

Software Engineer Sr Staff – Test Architect @ reputed company

100% remote Flexible hours

Sr Software Engineer (reputed company/Finance) @ Consumers Energy

100% remote Flexible hours

Model Validation 2nd Line of Defense reputed company Analyst @ reputed company

100% remote Flexible hours

Automation Process Analyst @ Hiscox

100% remote Flexible hours

reputed company reputed company Engineer - Application reputed company @ Dream Sports

100% remote Flexible hours

Software reputed company Engineer (Intermediate) @ Takealot Group

100% remote Flexible hours

Payroll Administrator

100% remote Flexible hours

Senior Financial Analyst

100% remote Flexible hours

Senior DevOps Engineer

100% remote Flexible hours

reputed company Morning Live Chat Agent - Work from Home Opportunity with Immediate Start

100% remote Flexible hours

reputed company Live Chat Support Associate – Remote Customer Service Representative

100% remote Flexible hours

Sales and Service Representative – Customer Care Center

100% remote Flexible hours

Commercial Technology Project Manager

100% remote Flexible hours

Teacher Online K-6- Tucson reputed company Virtual reputed company (Effective...

100% remote Flexible hours

reputed company Live Chat Assistant – Remote Part-Time Opportunity at arenaflex

100% remote Flexible hours

reputed company Leave of Absence Case Manager, Time Away Operations - Remote Work Opportunity in Disability Leave Services

100% remote Flexible hours

AI Data Scientist

100% remote Flexible hours

Part Time From Home Job Opportunity

100% remote Flexible hours

Customer Service Associate I

100% remote Flexible hours