Talent.com
Senior Datacenter System Software Architect - DGX Cloud
Senior Datacenter System Software Architect - DGX CloudNVIDIA • Santa Clara, CA, United States
[error_messages.no_longer_accepting]
Senior Datacenter System Software Architect - DGX Cloud

Senior Datacenter System Software Architect - DGX Cloud

NVIDIA • Santa Clara, CA, United States
[job_card.30_days_ago]
[job_preview.job_type]
  • [job_card.full_time]
[job_card.job_description]

NVIDIA is hiring engineers to scale up its AI Infrastructure. We expect you to have a strong programming background, a deep understanding of distributed systems, familiarity with software testing and deployment, and excellent communication and planning abilities. We also welcome out-of-the-box thinkers who can provide new ideas with strong at execution bias. Expect to be constantly challenged, improving, and evolving for the better. You and other engineers in this team will help advance NVIDIA's capacity to build and deploy leading infrastructure solutions for a broad range of AI-based applications that affect core data science. What are you waiting for if you're creative, passionate about what you do, and love having fun apply today!

We're looking for a highly motivated, creative engineer with strong experience in system software to join the DGX Cloud Software Team. You will lead the architecture, design and implementation of our next generation DGX cloud clusters using latest technologies. On this team, you will do full stack deployment including hardware architecture, workload orchestration and application performance tuning. Are you ready to change the next generation of computing? Join us at the forefront of technological advancement.

What you'll be doing :

  • Lead technical activities for data centers with focus on hybrid deployments between cloud and on-prem
  • Providing expertise in infrastructure workflows, including hardware, software release, workload orchestration and application tuning
  • Provide fast and creative solutions for complex problems and write effective, clear and reliable architecture specification
  • Translate requirements to vision, architecture and roadmap
  • Work with engineering teams across NVIDIA to ensure your software integrates seamlessly from the hardware all the way up to the AI training applications.

What we need to see :

  • Masters or PhD in Computer Science, Computer Engineering, Physics or equivalent experience.
  • 9+ years of experience in this field.
  • Data Sciences, Deep Learning, or Machine Learning coursework
  • Ability to seamlessly shift between Linux system environments to Python programming
  • Programming skills in 1 or more high-level languages (C, C++, Go, Rust, etc)
  • System-level experience with both hardware and software
  • Motivated self-starter with an equal balance of strong problem-solving skills and customer-facing communication skills
  • Strong design, coding, analytical, debugging and problem-solving skills
  • Passion for continuous learning and knowledge transfer. Ability to work concurrently with multiple groups locally and abroad in the organization
  • Ways to stand out from the crowd :

  • Experience with GPU deep learning and data sciences. Experience using TensorFlow, PyTorch or other DL framework. Experience working with Docker containers, Slurm, Terraform and Kubernetes
  • CUDA programming and NCCL experience. HPC programming experience including MPI, OpenACC, or other parallel programming tools. Hands-on experience with DGX Cloud, NVIDIA AI Enterprise AI Software, Base Command Manager, NEMO and NVIDIA Inference Microservices.
  • Interest in crafting, analyzing and fixing large-scale distributed systems.
  • Systematic problem-solving approach, coupled with strong communication skills and a sense of ownership and drive.
  • Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 184,000 USD - 287,500 USD for Level 4, and 224,000 USD - 356,500 USD for Level 5.

    You will also be eligible for equity and benefits.

    Applications for this job will be accepted at least until December 3, 2025.

    NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

    [job_alerts.create_a_job]

    System Architect • Santa Clara, CA, United States

    [internal_linking.related_jobs]
    Senior AI Data Center Systems Architect — Equity Eligible

    Senior AI Data Center Systems Architect — Equity Eligible

    NVIDIA • Santa Clara, CA, United States
    [job_card.full_time]
    NVIDIA is seeking a Senior Software Architect for their Data Center Systems team in Santa Clara.This role involves leading software activities for deep learning server platforms and collaborating w...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Senior Cloud Advisory Architect, Enterprise Cloud Services, Palo Alto

    Senior Cloud Advisory Architect, Enterprise Cloud Services, Palo Alto

    SAP • Palo Alto, California, USA
    [job_card.full_time] +1
    At SAP we keep it simple : you bring your best to us and well bring out the best in you.Were builders touching over 20 industries and 80% of global commerce and we need your unique talents to help s...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Strategic Cloud Architect for Semiconductors

    Strategic Cloud Architect for Semiconductors

    Google Inc. • Sunnyvale, CA, United States
    [job_card.full_time]
    A leading technology company is seeking a Principal Architect IV for Google Cloud in California.This role requires a blend of technical and business acumen, focusing on building strategic relations...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Specialist II TIS System Architect

    Specialist II TIS System Architect

    Enbridge • Danville, CA, United States
    [job_card.full_time]
    Employer Industry : Natural Gas Services.Why Consider this Job Opportunity : .Competitive benefits and pension plan.Opportunity for career advancement and growth within the organization.Flexibility to...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Cloud Architect

    Cloud Architect

    TEKsystems • San Jose, CA, United States
    [job_card.full_time]
    TEKsystems is seeking a Cloud Architect for an on-site role in San Jose, CA or Lehi, UT •.This is a long term contract position. Must be authorized to work in the US for any employer.Bachelor's degre...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Genomics Senior Systems Architect

    Genomics Senior Systems Architect

    University of California - Santa Cruz • Santa Cruz, CA, United States
    [job_card.full_time] +1
    NO VISA SPONSORSHIP AVAILABLE FOR THIS POSITION.Applicants must have current work authorization when accepting a Genomics Institute staff position. We are unable to sponsor or take over sponsorship ...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Lead Wireless System Architect – Multi-Antenna SDR

    Lead Wireless System Architect – Multi-Antenna SDR

    Mulya Technologies • Milpitas, CA, United States
    [job_card.full_time]
    A leading provider of communication solutions is seeking a Principal / Lead Wireless Communications System Architect in California. The ideal candidate will define complex wireless platforms and lea...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Senior System Design Engineer

    Senior System Design Engineer

    Cepton • San Jose, CA, US
    [job_card.full_time]
    CEPTON, a leading intelligent lidar solution provider, is seeking a seasoned Senior System Design Engineer to support R&D development of high-performance 3D LiDAR products, and also support n...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Senior Software Architect - Data Center Systems

    Senior Software Architect - Data Center Systems

    NVIDIA Corporation • Santa Clara, CA, United States
    [job_card.full_time]
    Senior Software Architect - Data Center Systems page is loaded.Senior Software Architect - Data Center Systems.Apply locations US, CA, Santa Clara US, TX, Austin US, TX, Remote US, OR, Hillsboro US...[show_more]
    [last_updated.last_updated_30] • [promoted]
    System Architect, Simulations & Models

    System Architect, Simulations & Models

    black.ai • Palo Alto, CA, US
    [job_card.full_time]
    Quantum computing holds the promise of humanity's mastery over the natural world, but only if we can build a.PsiQuantum is on a mission to build the first real, useful quantum computers, capable of...[show_more]
    [last_updated.last_updated_1_day] • [promoted]
    SoC Memory Subsystem Architect

    SoC Memory Subsystem Architect

    Baidu • Sunnyvale, CA, United States
    [job_card.full_time]
    We are looking for a world-class Memory Subsystem Architect to join our SoC team at Baidu’s Sunnyvale office.The successful candidate will be a motivated self-starter who will thrive in this highly...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Senior Software Architect Engineer permanent position at San Jose, CA

    Senior Software Architect Engineer permanent position at San Jose, CA

    MIT RESOURCE • San Jose, CA, US
    [job_card.permanent]
    Senior Software Architect Engineer permanent position at San Jose, CA Title : Senior Software Architect Engineer Type : permanent Location : San Jose, CA A Medical Device Company Located in San J...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Senior Staff System Architect, Power

    Senior Staff System Architect, Power

    Vishay • San Jose, California, USA
    [job_card.full_time]
    We are seeking great talent to help us build The DNA of tech.Vishay manufactures one of the worlds largest portfolios of discrete semiconductors and passive electronic components that are essential...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    System Solutions Architect -Large Clusters for AI & HPC workloads

    System Solutions Architect -Large Clusters for AI & HPC workloads

    AMD • San Jose, CA, US
    [job_card.full_time]
    System Solutions Architect - Large Clusters for AI & HPC workloads 2 days ago Be among the first 25 applicants.Your actual pay will be based on your skills and experience — talk with your recru...[show_more]
    [last_updated.last_updated_variable_hours] • [promoted] • [new]
    Principal Data Center Solutions Architect

    Principal Data Center Solutions Architect

    Supermicro • San Jose, CA, United States
    [job_card.full_time]
    Supermicro is a Top Tier provider of advanced server, storage, and networking solutions for Data Center, Cloud Computing, Enterprise IT, Hadoop / Big Data, Hyperscale, HPC and IoT / Embedded customers...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Senior Solutions Architect - Enterprise, Bay Area

    Senior Solutions Architect - Enterprise, Bay Area

    Elastic • Mountain View, CA, United States
    [job_card.full_time]
    Elastic, the Search AI Company, enables everyone to find the answers they need in real time, using all their data, at scale - unleashing the potential of businesses and people.The Elastic Search AI...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Distinguished System Architect - Innovation

    Distinguished System Architect - Innovation

    Infineon Technologies AG • San Jose, CA, United States
    [job_card.full_time]
    As a Distinguished System Architect - Innovatio, you are a senior Technical Leader and Innovator who owns and drives thought leadership in important depth or breadth areas relevant for Connected Se...[show_more]
    [last_updated.last_updated_30] • [promoted]
    AI Memory Architect for Cloud Datacenters

    AI Memory Architect for Cloud Datacenters

    Micron Technology, Inc. • San Jose, CA, US
    [job_card.full_time]
    A leading technology firm in San Jose is seeking a Memory Innovation Engineer to drive research in advanced memory architectures for AI and Datacenter applications. The ideal candidate will have a P...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]