AI Infrastructure Software Lead Job at NVIDIA, Santa Clara, CA

QnlHUEhKc2REb0RwTkZYMFYzRjVacjJ2OFE9PQ==
  • NVIDIA
  • Santa Clara, CA

Job Description

Salary: $224,000 - 356,500 per year Requirements:

  • A total of 10 years of experience in the industry focused on large distributed system software development.
  • A Bachelor's degree in Computer Science or a related field, or equivalent experience.
  • At least 5 years of experience in managing AI and software development teams.
  • Familiarity with modern software development stacks and tools, including containerization and deployments (cloud or on-premises), API integration for seamless model operation, and real-time processing frameworks.
  • Experience in developing and maintaining LLM or GenAI infrastructure.
  • Strong communication, collaboration, and problem-solving skills, with a commitment to fostering an inclusive and diverse workplace.
  • Practical experience in developing large-scale distributed systems.
Responsibilities:
  • Mentor, cultivate, and develop a top-tier team of AI infrastructure engineers.
  • Collaborate across various teams and organizations to create products utilizing LLMs and agent systems that meet the needs of NVIDIA engineering teams.
  • Work closely with research and infrastructure teams to serve a broad user base, including hardware and software teams throughout NVIDIA.
  • Align priorities among collaborators and establish metrics to gauge the success of the product/team.
  • Formulate and implement strategies for scalable, reliable, and secure AI infrastructure that supports both research and production workloads.
  • Ensure robust monitoring, logging, visualization, and alerting capabilities to guarantee promised uptime and operational excellence.
  • Architect, design, develop, and maintain infrastructure and large-scale applications for LLM-based solutions while optimizing these systems for performance, scalability, reliability, and secure data management.
  • Stay abreast of the latest trends in AI, ML, and infrastructure, actively seeking opportunities to integrate advancements into NVIDIA’s LLM and AI infrastructure solutions.
Technologies:
  • AI
  • API
  • Architect
  • Cloud
  • Hardware
  • LLM
  • DevOps

More:

At NVIDIA, we have a history of continuously reinventing ourselves over the past two decades. The invention of the GPU in 1999 spurred the growth of the PC gaming market, transformed modern computer graphics, and revolutionized parallel computing. Recently, GPU deep learning has propelled modern AI and launched a new era of computing. We envision NVIDIA as a "learning machine" that evolves by addressing challenging opportunities that matter to the world and only we can tackle. Our mission is to amplify human imagination and intelligence while expanding the realms of possibility. We are on the lookout for strategic, bold, diligent, and innovative individuals who share our passion for addressing complex challenges. Become part of our journey today.

As one of the technology industry's most desirable employers, we offer competitive salaries and an extensive benefits package. Your base salary will be determined by your location, experience, and the compensation of employees in similar roles. The salary range for this position is $224,000 to $356,500 for Level 3, and $272,000 to $425,500 for Level 4. Equity and benefits will also be part of your compensation. Applications for this position will be accepted at least until July 29, 2025. NVIDIA is committed to fostering a diverse workplace and is proud to be an equal opportunity employer. We celebrate diversity and do not discriminate in our hiring or promotional practices based on race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status, or any other characteristic protected by law.

last updated 42 week of 2025

Job Tags

Full time,

Similar Jobs

Openkyber

Web Developer Job at Openkyber

 ...Role: Web Methods Developer Location: Remote/ Duration: 6+ Months Job Description: Design and develop integration solutions using WebMethods Analyze requirements and translate them into technical designs Collaborate with cross-functional teams... 

Quicken Loans

Hotel and Events Security Officer (Part-Time) Job at Quicken Loans

 ...As a Hotel and Events Security Officer (Part-Time), you will patrols the hotel and public premises to maintain the safety and security of our team members, guests, and assets. The Hotel and Events Security Officer is very familiar with all property details and services... 

Bloomerang

Director, Marketing Operations Job at Bloomerang

 ...change happens on purpose. We champion the power and potential of nonprofits, igniting next-level impact with the team and technology built...  ...We are seeking an innovative and data-driven Director of Marketing Operations to lead the engine that powers our marketing and top... 

Pacific Northwest National Laboratory

Geomechanics Researcher Job at Pacific Northwest National Laboratory

 ...Directorates within the Lab, focused on a specific area of scientific research or other function, with its own leadership team and dedicated...  ...insurance, tuition assistance, relocation, backup childcare, legal benefits, supplemental parental bonding leave, surrogacy and... 

GDIT

GCCS Tier 1 Administrator Job at GDIT

 ...TO SUCCEED: Bring your technology expertise and drive for innovation to GDIT. The Systems Engineer must have: Certification: CompTIA Security+ or equivalent certification required for DoD 8570 IATT level II compliance. Experience: 2+ year of experience in IT System...