Compute Architect Intern - 2025
Job Description
We are now looking for a Compute Architect intern for GPU / Deep Learning field.
Are you passionate about exploring computer architectures for deep learning? Do you like to work at the intersection of hardware and software? NVIDIA is seeking world class programmers and performance architects who love to squeeze out every cycle of performance from deep learning codes. In this role, you will write code that ships in our deep learning libraries, as well as guide the direction of our future GPU architectures. This position offers the opportunity to have real impact in a fast-moving, technology-focused company.
What you'll be doing:
Analyze the performance of various machine learning/DL algorithms on existing/new architectures
Identify bottlenecks and propose creative solutions to improve them.
Prototype key deep learning and data analytics algorithms and applications
Understand and analyze the interplay of hardware and software architectures on future algorithms and applications
Add new capabilities to GPU architectures
What we need to see:
MS or PhD in relevant discipline (CS, EE, Math) or equivalent experience
Strong programming skills in C, C++, or Python
Familiarity with GPU computing (CUDA, OpenCL, OpenACC) and HPC (MPI, OpenMP)
Strong background in computer architecture
Experience with matrix multiply and convolution algorithms
Ways to stand out from the crowd:
Experience with parallel programming and CUDA/OpenCL
Familiar with DL frameworks/fundamentals
Familiar with MLIR or compiler development/optimization
Good communication and organizational skills