Senior Software Test Development Engineer - Nemo LLM

Nvidia

Shanghai, China

Job posting number: #7291685 (Ref:JR1990256)

Posted: October 30, 2024

Job Description

We are looking for a Software Test development engineer in NVIDIA’s AI SWQA team. The position is in NVIDIA AI Software Quality Assurance team that defines, develops and performs tests to validate robustness and measure the performance of NVIDIA‘s AI software and GPU Infrastructure for autonomous driving, healthcare, speech recognition, natural language processing, and a wide variety of other AI scenarios. This team collaborates with multiple AI product teams to develop new products; derive and improve complex test plans; and improve our workflow processes for a diverse range of GPU computing platforms. You should grow with being in the critical path supporting developers working for billion-dollar business lines as well as intimately understanding the values of responsiveness, thoroughness and teamwork. You should constantly foster and implement efficiency improvements across your domain. Join the team which is building software which will be used by the entire world!

What you’ll be doing:

  • Work closely with global cross-functional teams to understand the test requirements and take ownership of product quality.

  • Plan/design/execute/report/automate test plan/test case/test reports.

  • Manage bug lifecycle and co-work with inter-groups to drive for solutions.

  • Automate test cases and assist in the architecture, crafting and implementing of test frameworks.

  • In-house repro and verify customer issues/fixes.

What we need to see:

  • BS or higher degree in CS/EE/CE or equivalent.

  • 5+ years of software quality assurance or test automation background with knowledge of test infrastructure and strong analysis skills.

  • Scripting language (Python, Bash) knowledge and UNIX/Linux experience.

  • Good Python software development or test development experience.

  • Good user/development experience of virtualization like VM & Docker container & k8s

  • Excellent English written and oral communication skills.

  • LLM user/developing/training/inference experience is must like GPT, Llama

  • Good Deep Learning multi-node training experience on Slrum cluster

  • Popular DLFW user/developing experience like Pytorch, JAX, DGL, PyG

  • Able to juggle conflicting/changing priorities and maintain a positive attitude while experiencing challenging and dynamic schedules.

Ways to stand out from the crowd:

  • Familiarity with NVIDIA GPU hardware products (Tesla, Tegra, DGX, etc.).

  • Understanding and working knowledge with any Deep Learning Framework especially in end-to-end customer scenarios.

  • Working knowledge of NVIDIA GPU Computing (CUDA) and CUDA libraries for Deep Learning like cuDNN

  • Experience in VectorCAST, Bullseye, Gcov, or Coverity tools.





Apply Now

Please mention to the employer that you saw this ad on AiCareers.com

More Info

Job posting number:#7291685 (Ref:JR1990256)
Application Deadline:Open Until Filled
Employer Location:Nvidia
Santa Clara,California
United States
More jobs from this employer