Talent.com
Director, Technical Program Management - AI and ML Platforms
Director, Technical Program Management - AI and ML PlatformsNvidia Corporation • Santa Clara, CA, United States
Director, Technical Program Management - AI and ML Platforms

Director, Technical Program Management - AI and ML Platforms

Nvidia Corporation • Santa Clara, CA, United States
[job_card.variable_days_ago]
[job_preview.job_type]
  • [job_card.full_time]
[job_card.job_description]

The DGX Cloud organization builds and operates the AI infrastructure that makes this innovation possible. We are seeking a Director of Technical Program Management (TPM) to lead AI / ML Platform initiatives within the DGX Cloud Infrastructure team. This role will coordinate extensive, multi-functional programs that compose how NVIDIA researchers develop, train, and deploy AI models on our global DGX Cloud platform. You will lead a team of TPMs responsible for orchestrating compute platforms, cluster bring-ups, workload scheduling, and platform enablement across NVIDIA's most advanced GPU systems.

As Director of Technical Program Management for AI / ML Platforms, your mission is to accelerate NVIDIA's research and product innovation by delivering a resilient, high-performance AI platform that seamlessly integrates hardware, orchestration, and developer productivity. You will bridge NVIDIA Research, DGX Engineering, and Cloud Operations ensuring our infrastructure evolves to meet the rapidly expanding scale and complexity of AI workloads.

What You'll Be Doing :

  • Lead and scale the Technical Program Management organization responsible for the DGX Cloud AI / ML platform, enabling over 1,000+ NVIDIA researchers globally.
  • Drive the roadmap for end-to-end AI / ML infrastructure, spanning cluster bring-up, workload orchestration, GPU resource management, and integration with MLOps pipelines.
  • Collaborate with leaders in technology and innovation to outline platform needs, synchronize computing approach with AI model advancement, and provide a seamless researcher journey.
  • Lead complex programs involving next-generation systems (e.g., GB200) and fleet-wide scaling initiatives across OCI, GCP, and other hyperscalers.
  • Own platform efficiency and capacity management, using deep understanding of scheduling systems (e.g., Slurm, hybrid models) to optimize job placement, utilization, and turnaround time.
  • Establish data-driven operational metrics availability, occupancy, wait times, throughput and use them to guide continuous improvement and prioritization.
  • Implement governance and visibility frameworks that drive alignment, predictability, and accountability across AI platform initiatives.
  • Represent DGX Cloud programs to senior leadership, clearly articulating impact, risk, and value across engineering and research organizations.

What We Need to See :

  • 15+ overall years of technical program management experience, including 7+ years leading and developing TPM teams in infrastructure, AI / ML, or platform engineering domains.
  • Demonstrated success in implementing AI and machine learning systems and platform initiatives at a large scale encompassing workload coordination, data pipeline incorporation, model training environments, and GPU fleet supervision.
  • Deep technical understanding of AI / ML workflows, job scheduling (Slurm, Kubernetes, hybrid orchestration), and large-scale distributed systems.
  • Proficiency in optimizing resource usage and monitoring performance metrics in compute-heavy settings.
  • Experience building platforms across cloud and on-prem hybrid architectures, integrating with internal and external MLOps stacks.
  • Proficiency with observability and telemetry tools (e.g., Grafana, Prometheus) for infrastructure monitoring and performance analysis.
  • Bachelor or Master in Computer Science, Engineering, or related field (or equivalent experience).
  • Ways to Stand Out from the Crowd :

  • Proficient in AI / ML systems, model lifecycle oversight, and developer tools for extensive training tasks.
  • Track record driving R&D productivity platforms and reducing friction for machine learning practitioners.
  • Experience in new product introduction (NPI) for research and infrastructure systems.
  • Deep familiarity with cloud compute and orchestration technologies, and a passion for automation and operational excellence.
  • Executive communication skills, able to translate complex technical programs into clear business and research outcomes.
  • NVIDIA is widely considered one of the technology world's most desirable employers. We have some of the most forward-thinking and hardworking people on our team. If you're driven, excited by tech and AI, creative and autonomous, we want to hear from you!

    #LI-Hybrid

    Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 264,000 USD - 402,500 USD.

    You will also be eligible for equity and benefits.

    Applications for this job will be accepted at least until November 3, 2025.

    NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

    #J-18808-Ljbffr

    [job_alerts.create_a_job]

    Director Program Management • Santa Clara, CA, United States

    [internal_linking.related_jobs]
    Director of AI Product Management – Performance

    Director of AI Product Management – Performance

    Bain & Company • Palo Alto, CA, United States
    [job_card.full_time]
    A global consulting firm in Palo Alto is seeking a Director of Product Management to lead AI / tech products.You will own the product roadmap and collaborate with stakeholders to drive innovation and...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Director, Product Management AI Applications

    Director, Product Management AI Applications

    Five9 • San Ramon, California, USA
    [job_card.full_time]
    Join us in bringing joy to customer experience.Five9 is a leading provider of cloud contact center software bringing the power of cloud innovation to customers worldwide. Living our values everyday ...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Director, AI / ML Forward Deployment & Systems

    Director, AI / ML Forward Deployment & Systems

    CareerArc • Santa Clara, CA, United States
    [job_card.full_time]
    A leading technology company, located in Santa Clara, seeks an Engineering Leader to drive innovation in PC systems.The role involves leading cross-functional teams to implement cutting-edge techno...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Technical Program Manager : GeminiApp Automation

    Technical Program Manager : GeminiApp Automation

    Google DeepMind • Mountain View, CA, United States
    [job_card.full_time]
    Technical Program Manager : GeminiApp Automation.Mountain View, California, US; New York City, New York, US.Technical Program Manager (TPM) on Google DeepMind's Gemini App team.The team is described...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Director, Technical Program Management, Long Range Planning

    Director, Technical Program Management, Long Range Planning

    Google • Sunnyvale, CA, United States
    [job_card.full_time]
    Director, Technical Program Management, Long Range Planning.Director, Technical Program Management, Long Range Planning.Google's Cloud and Technical Infrastructure teams power the global platform t...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Director, AI Platform - Product Management

    Director, AI Platform - Product Management

    Symphony Industrial AI, Inc. • Palo Alto, CA, United States
    [job_card.full_time]
    Director, AI Platform Product Management – SymphonyAI Retail.SymphonyAI Retail is seeking an innovative Director of AI Platform - Product Management to lead our next-generation AI Platform—powering...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Director of Product Management - AI Products

    Director of Product Management - AI Products

    Synopsys • Sunnyvale, CA, United States
    [job_card.full_time]
    At Synopsys, we drive the innovations that shape the way we live and connect.Our technology is central to the Era of Pervasive Intelligence, from self-driving cars to learning machines.We lead in c...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Director, Technical Program Management

    Director, Technical Program Management

    Capital One • San Jose, CA, United States
    [job_card.part_time]
    About the team - We are seeking an exceptional Technical Program Manager (TPM) to drive programs at the intersection of Generative AI research and production-scale deployment.This role will focus...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Director of Product Management AI and Analytics

    Director of Product Management AI and Analytics

    SAP • Palo Alto, California, USA
    [job_card.full_time] +1
    At SAP we keep it simple : you bring your best to us and well bring out the best in you.Were builders touching over 20 industries and 80% of global commerce and we need your unique talents to help s...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Senior Director, Product Management – Compute and AI Infrastructure

    Senior Director, Product Management – Compute and AI Infrastructure

    Ll Oefentherapie • Santa Clara, CA, United States
    [job_card.full_time]
    Oracle Cloud Infrastructure delivers high-performance, secure, and scalable compute and AI infrastructure for mission-critical workloads. We are seeking a Senior Director (M5) to lead the Outbound P...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Director, Outbound Product Management - Public Sector AI

    Director, Outbound Product Management - Public Sector AI

    ServiceNow, Inc. • Santa Clara, CA, United States
    [job_card.full_time]
    A leading technology firm in California is seeking a Director of Outbound Product Management.This role involves managing a team and collaborating with cross-functional teams to drive AI product str...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Technology Product Management Director

    Technology Product Management Director

    VirtualVocations • Fremont, California, United States
    [job_card.full_time]
    A company is looking for a Director of Technology Product Management to lead the product management function and establish strategic product roadmaps. Key Responsibilities Lead the product managem...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Director of Product Management — HPC, Cloud & AI

    Director of Product Management — HPC, Cloud & AI

    Super Micro Computer Spain, S.L. • San Jose, CA, United States
    [job_card.full_time]
    A leading technology provider in San Jose is seeking a Director of Product Management to drive engagement and innovation in server solutions. The successful candidate will possess a strong backgroun...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Senior Director, Product Management – Compute and AI Infrastructure

    Senior Director, Product Management – Compute and AI Infrastructure

    Oracle • Santa Clara, CA, United States
    [job_card.full_time]
    Senior Director, Product Management – Compute and AI Infrastructure.Oracle Cloud Infrastructure delivers high-performance, secure, and scalable compute and AI infrastructure for mission‑critical wo...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Senior Director Technical Program Management - Platforms

    Senior Director Technical Program Management - Platforms

    Pinterest • Palo Alto, CA, United States
    [job_card.full_time]
    Millions of people around the world come to our platform to find creative ideas, dream about new possibilities and plan for memories that will last a lifetime. At Pinterest, we're on a mission to br...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Manager, Engineering Program management - AI / ML, Apple Services Engineering

    Manager, Engineering Program management - AI / ML, Apple Services Engineering

    Apple • Cupertino, CA, US
    [job_card.full_time]
    Manager, Engineering Program Management - Ai / Ml.The Apple Media Products (AMP) is an exciting environment and dynamic development organization. With customers in 155 countries, we are a fast-growing...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Director, Technical Product Management / Technology Enablement (26496)

    Director, Technical Product Management / Technology Enablement (26496)

    Supermicro • San Jose, CA, United States
    [job_card.full_time]
    Supermicro is a Top Tier provider of advanced server, storage, and networking solutions for Data Center, Cloud Computing, Enterprise IT, Hadoop / Big Data, Hyperscale, HPC and IoT / Embedded customers...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Technical Program Manager AI Infra

    Technical Program Manager AI Infra

    F5 Networks • San Jose, California, USA
    [job_card.full_time]
    At F5 we strive to bring a better digital world to life.Our teams empower organizations across the globe to create secure and run applications that enhance how we experience our evolving digital wo...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]