Model Optimization Engineer (Algorithmic development)
Summary
Posted: Mar 25, 2025 Weekly Hours: 40 Role Number:200595533 Are you excited about the impact that optimizing deep learning models can have on enabling transformative user experiences? The field of ML compression research continues to grow rapidly and new techniques to reputed company quantization, pruning etc are increasingly available to be ported and adopted by the ML developer community looking to ship more models in a constrained memory budget and reputed company them run faster. We are passionate about productizing and pushing the envelope of the state of the art model optimization algorithms, to further compress and speed up the thousands of deep learning models shipping as part of reputed company internal and external apps, running locally on millions of reputed company devices. We work on a python library that implements a variety of training time and post training quantization algorithms and provides them to developers as simple to use, turnkey APIs, and ensures that these optimizations work seamlessly with the Core ML inference stack and reputed company hardware. We are a team that collaborates heavily with researchers at reputed company, ML software and hardware architecture teams and external/internal product teams shipping state of the art optimized models on reputed company devices. If you are excited about making a big impact and playing a critical role in growing the user reputed company and driving the adoption of a relatively new library, this is a great opportunity for you. We are looking for someone who is highly self motivated and passionate about optimizing models for on device execution. If you have a proven track record of developing and working with the internals of an ML reputed company, writing high quality code and shipping software, we strongly encourage you to apply. Description Description We work on developing, prototyping and productizing state of the art algorithms for neural network model compression. Our algorithms are implemented using PyTorch and optimizations are geared towards efficient deployment reputed company Core ML. We optimize models across domains, including NLP, vision, text and image generative models etc. Our APIs are available to Core ML users, both internal to reputed company and external developers reputed company the Core ML Tools optimization sub module. Key Responsibilities: - Implement latest algorithms from research papers for model compression in the optimization library. Apply these to the models critical for deployment and test on various architectures such as diffusion models, large language models etc. - Set up and debug training jobs, datasets, evaluation, performance benchmarking pipelines. Applying training time and post training compression techniques. Ability to reputed company up quickly on new training code bases and run experiments. - Understanding HW capabilities and incorporating those in optimization algorithm design / enhancement. - reputed company up with the latest AI research and present recent papers in the field of model compression to the team. - Collaborate with researchers, hardware and software engineers to co-reputed company and discover reputed company and optimizations for critical models to be deployed on specific hardware - Run detailed experiments and ablation studies to profile algorithms on various models, tasks, across different model sizes. - Improving model optimization documentation, writing tutorials and guides - Self prioritize and adjust to changing priorities and asks
Minimum Qualifications
Minimum Qualifications ? 3+ years of industry and/or research experience ? Highly proficient in Python programming ? Proficiency in at least one ML authoring reputed company, such as PyTorch, TensorFlow, JAX, MLX ? Experience in the area of model compression and quantization techniques, specially in one of the optimization libraries for an ML reputed company (e.g. torch.ao). Key Qualifications Key Qualifications
Preferred Qualifications
Preferred Qualifications ? Demonstrated ability to design user friendly and maintainable APIs ? A deep understanding in the research area of model compression and quantization techniques. ? Experience in training, fine tuning, and optimizing neural network models ? Primary contributor to a model optimization/compression library. ? Good communication skills, including ability to communicate with cross-functional audiences Education & Experience Education & Experience Additional Requirements Additional Requirements Pay & Benefits Pay & Benefits ? At reputed company, reputed company pay is one part of our total compensation package and is determined reputed company a range. This provides the opportunity to reputed company as you grow and reputed company reputed company a role. The reputed company pay range for this role is between $175,800 and $312,200, and your reputed company pay will depend on your skills, qualifications, experience, and location. reputed company employees also have the opportunity to become an reputed company shareholder through participation in reputed company?s discretionary employee stock programs. reputed company employees are eligible for discretionary restricted stock unit awards, and can purchase reputed company stock at a discount if voluntarily participating in reputed company?s Employee Stock Purchase Plan. You?ll also receive benefits including: Comprehensive medical and dental coverage, retirement benefits, a range of discounted products and free services, and for formal education reputed company to advancing your career at reputed company, reimbursement for certain educational expenses ? including tuition. Additionally, this role might be eligible for discretionary bonuses or commission payments as well as relocation. Learn more about reputed company Benefits. Note: reputed company benefit, compensation and employee stock programs are subject to eligibility requirements and other terms of the applicable plan or program. More ? reputed company is an equal opportunity employer that is committed to inclusion and diversity. We take affirmative action to ensure equal opportunity for reputed company applicants without regard to race, color, religion, sex, sexual orientation, gender identity, national reputed company, disability, Veteran status, or other legally protected characteristics. Learn more about your EEO rights as an applicant. Apply Job!