Talent.com
Senior Manager, Professional Services HPC Deployment
Senior Manager, Professional Services HPC DeploymentNVIDIA • Remote, NC, US
Senior Manager, Professional Services HPC Deployment

Senior Manager, Professional Services HPC Deployment

NVIDIA • Remote, NC, US
[job_card.30_days_ago]
[job_preview.job_type]
  • [job_card.full_time]
  • [filters.remote]
[job_card.job_description]

NVIDIA is in search of an HPC Deployment Manager to bolster its Professional Services division. Across academia and industry, NVIDIA's products are driving ground-breaking advancements in deep learning, data analytics, and the optimization of data centers. Join our team, where we are at the forefront of constructing some of the globe's most expansive and rapid data centers! We seek an individual capable of supervising the deployment of cutting-edge InfiniBand and Ethernet technologies with a team comprising AI and HPC experts. This role demands dynamic interpersonal abilities and a customer-centric approach.

The chosen candidate will engage with clients, collaborators, and internal units to assess, delineate, and complete large-scale AI/HPC initiatives. They will orchestrate the day-to-day operations, guidance, and cultivation of a multi-layered team of HPC service professionals. This entails ensuring the timely delivery of a varied spectrum of AI HPC data center projects. Furthermore, this role offers an opportunity to thrive within a fast-paced, inventive, and technologically sophisticated atmosphere, emphasizing unparalleled performance and the exploration of an array of novel hardware and software technologies in AI supercomputing.

What you will be doing:

  • Directs and supervises the service HPC engineering functions in designing, developing, installing, and validating hardware and software for the Customer AI High-Performance Computing (HPC) systems.

  • Leads, handles, mentors, and builds a very hardworking HPC service engineering team to deliver innovative advances in high-performance computing AI systems.

  • Responsible for leading our HPC projects' planning, implementation, and performance. Improves the integrity of system services bring-up and related by applying groundbreaking technical and operational knowledge to configure and maintain HPC AI network and server platforms.

  • Drives HPC team hardware and software deployment, plans, develops, and deploys procedures for system validation.

  • Lead team activities and drive tests and plans for Customer's HPC AI systems implementations, custom scripts, and testing procedures to ensure operational reliability for the system.

  • Supports the HPC Engineering team, working with other internal collaborators to develop and run a well-rounded strategy for delivering service quality and continuous service improvement. Supports governance for software engineering through the implementation of standards and quality measures.

  • Leads team member development, helping them set and achieve goals for their career growth. Develop an inclusive environment that values team member differences, creating a sense of belonging and appreciation. Chips in to a culture of trust and clarity.

  • Build strong relationships with INVIDIA leaders, customers, partners, and collaborators. Works closely to identify, implement, and support leading NVIDIA's AI solutions engineering, maintaining currency with industry standards and innovations. Provides input around process optimization, department budgeting, and the monitoring and management of resources.

  • Be the domain authority with customers during planning calls through implementation.

What we need to see:

  • 8+ overall years' experience in IT, high-performance computing, or other related field; 3+ years of experience in a management or leadership role

  • Demonstrated expertise in HPC systems design configuration and planning.

  • Proficiency with low latency/high-bandwidth interconnect infrastructure (Infiniband and Ethernet).

  • Expertise with HPC system software cluster management/provisioning tools, including job schedulers (Slurm, salt, xCAT).

  • Proficiency with shared and distributed memory parallelism (OpenMP, MPI, NCCL and HPL) and accelerators (GPUs).

  • Strong scripting ability (Bash, Perl, Python, etc.) and experience with programming fundamentals.

  • Expertise with administration, supervising and maintaining secure Linux/Unix operating systems (CentOS, Solaris).

  • Experience establishing processes for maintaining system performance, managing best-in-class standards, and familiarity with cloud computing and container technologies.

  • Ability to understand and work with large, sophisticated systems, identify and resolve problems, handle performance, and troubleshoot network issues related to infrastructure.

  • Expertise with multi-vendor hardware/software management, security, and network/Internet protocols. Strong communication and social skills, with the ability to provide detailed information and high-level summaries to management-level individuals and groups, present the business side of technical topics to non-technical audiences, and develop positive working relationships and strong rapport with team members.

  • Bachelor's degree in computer science, information systems, or a related field or equivalent experience

  • Solid knowledge of HPC storage

  • Exemplary communication and interpersonal skills, with the ability to present the business side of technical topics to non-technical audiences and persuasively and optimally get along with relationships with various stakeholders and diverse individuals and groups

Ways to stand out from the crowd:

  • InfiniBand experience.

  • Experience with GPU-focused hardware/software.

  • Experience with MPI.

  • Automation tooling background (Ansible, Salt, Puppet, etc.).

  • Ethernet and Storage technologies such as Lustre or GPFS.

The base salary range is 208,000 USD - 327,750 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and . NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

[job_alerts.create_a_job]

Senior Manager, Professional Services HPC Deployment • Remote, NC, US

[internal_linking.similar_jobs]
Service Access Manager - DRH ED

Service Access Manager - DRH ED

Dukehealth.org • Durham, NC, United States
[job_card.full_time]
At Duke Health, we're driven by a commitment to compassionate care that changes the lives of patients, their loved ones, and the greater community.No matter where your talents lie, join us and disc...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Director, HR Services Delivery Center Leader (Americas)

Director, HR Services Delivery Center Leader (Americas)

Kyndryl • Durham, NC, United States
[job_card.full_time] +1
At Kyndryl, we design, build, manage and modernize the mission-critical technology systems that the world depends on every day.So why work at Kyndryl? We are always moving forward - always pushing ...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Senior Director, Advanced Informatics Lead

Senior Director, Advanced Informatics Lead

Regeneron Pharmaceuticals, Inc • NC, United States
[job_card.full_time]
Regeneron’s growing portfolio is accompanied by ever-increasing amounts of research and clinical data.We are seeking a leader in advanced analytics who can harness the power and insights within our...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Senior Product Architect - CA 7 Distributed Components

Senior Product Architect - CA 7 Distributed Components

Broadcom Corporation • Durham, NC, United States
[job_card.full_time]
If you are a first time user, please create your candidate login account before you apply for a job.If you already have a Candidate Account, please Sign-In before you apply.Broadcom's Mainframe Sof...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Senior Marketing Manager, Product and Service

Senior Marketing Manager, Product and Service

BD (Becton, Dickinson and Company) • Durham, NC, United States
[job_card.full_time]
Join us and shape the future of pharmacy: The Senior Marketing Manager, Core product line and Service is a strategic leader within the US Pharmacy Automation Marketing team.This role is responsible...[show_more]
[last_updated.last_updated_1_day] • [promoted]
SOLUTIONS & SERVICE MANAGER

SOLUTIONS & SERVICE MANAGER

Durham County • Durham, NC, United States
[job_card.full_time]
Durham County Government is home to over 2,000 dedicated professionals working together to deliver essential services that strengthen and support our vibrant, diverse community.As the heart of a fa...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Service Director

Service Director

Woodward Management Partners • Apex, NC, United States
[job_card.full_time]
SERVICE DIRECTOR JOB DESCRIPTION.Responsible for all phases of maintenance operations while operating within budgeted financials goals of property under direction of Community Director.Performs var...[show_more]
[last_updated.last_updated_1_day] • [promoted]
Development Services Manager

Development Services Manager

Orange Water and Sewer Authority • Carrboro, NC, US
[job_card.full_time]
OWASA seeks an enthusiastic professional to lead our Development Services team.This position ensures that external projects affecting OWASA water, sewer, and reclaimed water systems are designed an...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Senior Marketing Manager, Product and Service

Senior Marketing Manager, Product and Service

Becton, Dickinson and Company • Durham, NC, United States
[job_card.full_time]
Join us and shape the future of pharmacy: The Senior Marketing Manager, Core product line and Service is a strategic leader within the US Pharmacy Automation Marketing team.This role is responsible...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Senior Engineer Manager - Upstream

Senior Engineer Manager - Upstream

Amgen • Holly Springs, NC, United States
[job_card.full_time]
Join Amgen's Mission of Serving Patients.At Amgen, if you feel like you're part of something bigger, it's because you are.Our shared mission-to serve patients living with serious illnesses-drives a...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Manager, IT Category Management

Manager, IT Category Management

Blue Cross and Blue Shield of North Carolina • Chapel Hill, North Carolina, United States
[job_card.full_time]
The Manager, IT Category Management, is responsible for directing all sourcing and vendor contracting activities within the assigned categories.This role supervises staff engaged in the sourcing pr...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Client Service Manager, Large Pharma

Client Service Manager, Large Pharma

IQVIA • Durham, NC, United States
[job_card.part_time]
The Client Service Manager plays an important role in supporting IQVIA's largest pharmaceutical clients by ensuring strong operational execution, commercial coordination, business development suppo...[show_more]
[last_updated.last_updated_1_day] • [promoted]
Service Access Manager - DRH ED

Service Access Manager - DRH ED

Duke University • Durham, NC, United States
[job_card.full_time]
At Duke Health, we're driven by a commitment to compassionate care that changes the lives of patients, their loved ones, and the greater community.No matter where your talents lie, join us and disc...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Technical Services Lead

Technical Services Lead

Novartis Group Companies • Durham, NC, United States
[job_card.full_time]
The Technical Service Lead oversees the Asset Lifecycle organization, which is responsible to establish, maintain and improve the maintenance.The Asset Lifecycle Lead is also responsible to provide...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Sr. IT Manager Revenue Cycle Applications

Sr. IT Manager Revenue Cycle Applications

Duke Clinical Research Institute • Durham, North Carolina, United States
[job_card.full_time]
At Duke Health, we're driven by a commitment to compassionate care that changes the lives of patients, their loved ones, and the greater community.No matter where your talents lie, join us and disc...[show_more]
[last_updated.last_updated_1_day] • [promoted]
Senior Manager, Project Management

Senior Manager, Project Management

Grifols Shared Services North America, Inc • Durham, NC, United States
[job_card.full_time]
Would you like to join an international team working to improve the future of healthcare? Do you want to enhance the lives of millions of people? Grifols is a global healthcare company that since 1...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Special Projects Manager

Special Projects Manager

Self-Help • Durham, NC, United States
[job_card.full_time]
The Center for Responsible Lending (CRL) is a nonprofit, non-partisan organization working to ensure a fair, inclusive financial marketplace that creates opportunities for all.Through research, leg...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
BCG Platinion | Manager, Tech Advisory & Delivery

BCG Platinion | Manager, Tech Advisory & Delivery

BCG Digital Ventures • Durham, NC, United States
[job_card.full_time]
Locations: Atlanta | Austin | Boston | Brooklyn | Chicago | Dallas | Denver | Detroit | Durham | Houston | Miami | Minneapolis | Nashville | New York | Philadelphia | Pittsburgh | Summit | Washingt...[show_more]
[last_updated.last_updated_variable_hours] • [promoted] • [new]