Senior DevOps Engineer

Nvidia

Santa Clara, CA

Job posting number: #7276733 (Ref:JR1986565)

Posted: August 31, 2024

Job Description

NVIDIA is looking for a world class engineer to join its multifaceted and fast-paced Infrastructure, Planning and Processes organization where you will be working as a Senior DevOps Engineer. The position will be part of a fast-paced crew that develops and maintains sophisticated build & test environments for a multitude of hardware platforms both NVIDIA GPUs and Tegra Processors along with various operating systems (Windows/Linux/Android). The team works with various other business units within NVIDIA Software such as Graphics Processors, Mobile Processors, Deep Learning, Artificial Intelligence and Driverless Cars to cater to their infrastructure & system’s needs.

As a DevOps Engineer, you’ll also be working in conjunction with various teams such as software engineering to deploy these new products and manage our infrastructure, associated processes and systems. Keen attention to detail, problem-solving abilities, and a solid knowledge base are essential.

What you’ll be doing:

  • Participate in the design, implementation and enhancement of automated SW testing infrastructures for the automotive & mobile Tegra platforms, bring up tasks for new Tegra platforms.

  • Enhance and develop new modules for harness in python and shell scripts.

  • Develop, Improve and Maintain our infrastructure codebase. Implement & support end-to-end CI/CD system

  • Drive automation of monitoring to gain more insight into applications and system health.

  • Working with software engineering teams as well as internal support groups world-wide to ensure that our software is produced, tested and delivered to the customer in a consistent and effective manner that meets our world-class standards

  • Develop and implement modules/pipelines to streamline onboarding/maintenance of Nvidia tegra/gpu devices in k8 based Nvidia validation systems/queues.

  • Debug & fix existing and new tests, system configurations and hardware setup of Tegra’s.

  • Maintain systems once they are live by measuring and monitoring availability, latency and overall system health.

What we need to see:

  • Strong object-oriented programming background, Java, Python preferred.

  • Experience of maintaining cloud infrastructure and highly-available production environment and excellent debugging, problem solving and analytical skills

  • Strong understanding of architectural requirements and development processes involved in building reliable, robust, scalable data products and pipelines

  • Background in Databases both SQL (MySQL) and NoSQL (Elastic Search /MongoDB/Cassandra) and proficient with configuration management tools like Ansible, Puppet & Chef and strong background with CI/CD systems.

  • Experience in Kubernetes, dockers & virtualization and data analytics/visualization tools like Kibana, Grafana, Splunk etc.

  • Background with source code management and binary repository systems like GitLab, GitHub, Artifactory etc.

  • Knowledge of monitoring systems such as Zabbix, Prometheus and/or similar systems and advanced knowledge of standard methodologies related to security.

  • Bachelor's or Master’s degree in Computer Science, Software Engineering (or equivalent experience) with 5+ years of proven industry experience.

Ways to stand out from the crowd:

  • Ability to analyze situations and utilize troubleshooting skills, systems and tools, and problem solving abilities

  • Prior experience on embedded & mobile systems, and large scale operations team.

  • Experience with using and improving data centers. Knowledge and experience in Linux, Windows, Android and embedded OS like QNX is a plus.

  • Background with computer algorithms and ability to choose the best possible algorithms to meet the scaling challenge.

With competitive salaries and a generous benefits package, we are widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us and, due to outstanding growth, our exclusive engineering teams are rapidly growing. If you're a creative and autonomous engineer with a real passion for technology, we want to hear from you!

The base salary range is 140,000 USD - 258,750 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.





Apply Now

Please mention to the employer that you saw this ad on AiCareers.com

More Info

Job posting number:#7276733 (Ref:JR1986565)
Application Deadline:Open Until Filled
Employer Location:Nvidia
Santa Clara,California
United States
More jobs from this employer