AIML - Software Engineer (ML Efficiency), Machine Learning Platform & Infrastructure

Apple Inc

Cupertino, CA

Job posting number: #7285656 (Ref:apl-200571396)

Posted: October 3, 2024

Job Description

Summary
Do you want to shape the platform that enables the next generation of intelligent experiences on Apple products & services? In Apple’s Machine Learning Platform Technology & Infra team we have built the platform that Apple uses for developing machine learning, artificial intelligence, and computer vision applications. As a team, we have a variety of technical backgrounds, from machine learning PhDs to builders of large-scale production systems.

Specifically in this role you will be working on optimizing end-to-end system performance of distributed machine learning workloads. This is a highly collaborative role and you will be working with key partners across the company.
Description
We are seeking highly motivated and experienced engineers to join our team. The ideal candidate will have a deep understanding of machine learning systems and cloud computing infrastructure. Key responsibilities in this role are:

Engage with ML researchers to optimize end-to-end performance of large scale distributed ML workloads
Analyze workload metrics to identify sources of inefficiencies and work with users to understand and optimize ML workloads
Conduct workload analysis based on benchmarking key workloads on deployed systems
Improve large scale training resiliency by optimizing applications and frameworks for improved recovery from failures and preemptions
Influence architecture, design, development, and operations of next generation ML accelerator systems based on workload insights
Minimum Qualifications
  • Experience working with large scale parallel and distributed accelerator-based systems
  • Experience optimizing performance and AI workloads at scale
  • Experience developing code in one or more of training frameworks (such as PyTorch, TensorFlow or JAX)
  • Experience in performance analysis and optimization experience in Cloud accelerators
  • Deep understanding of computer systems and the interactions between HW and SW
  • Strong communicator with ability to analyze complex and ambiguous problems
  • Programming and software design skills (proficiency in C/C++ and/or Python)
  • Experience working in a high-level collaborative environment and promoting a teamwork mentality
Preferred Qualifications
  • BS or MS in Computer Science or related field
Pay & Benefits




Apply Now

Please mention to the employer that you saw this ad on AiCareers.com

More Info

Job posting number:#7285656 (Ref:apl-200571396)
Application Deadline:Open Until Filled
Employer Location:Apple Inc
Jacksonville,Florida
United States
More jobs from this employer