Talent.com
Senior Datacenter System Software Architect - DGX Cloud
Senior Datacenter System Software Architect - DGX CloudNVIDIA • Santa Clara, CA, United States
[error_messages.no_longer_accepting]
Senior Datacenter System Software Architect - DGX Cloud

Senior Datacenter System Software Architect - DGX Cloud

NVIDIA • Santa Clara, CA, United States
[job_card.30_days_ago]
[job_preview.job_type]
  • [job_card.full_time]
[job_card.job_description]

NVIDIA is hiring engineers to scale up its AI Infrastructure. We expect you to have a strong programming background, a deep understanding of distributed systems, familiarity with software testing and deployment, and excellent communication and planning abilities. We also welcome out-of-the-box thinkers who can provide new ideas with strong at execution bias. Expect to be constantly challenged, improving, and evolving for the better. You and other engineers in this team will help advance NVIDIA's capacity to build and deploy leading infrastructure solutions for a broad range of AI-based applications that affect core data science. What are you waiting for if you're creative, passionate about what you do, and love having fun apply today!

We're looking for a highly motivated, creative engineer with strong experience in system software to join the DGX Cloud Software Team. You will lead the architecture, design and implementation of our next generation DGX cloud clusters using latest technologies. On this team, you will do full stack deployment including hardware architecture, workload orchestration and application performance tuning. Are you ready to change the next generation of computing? Join us at the forefront of technological advancement.

What you'll be doing :

  • Lead technical activities for data centers with focus on hybrid deployments between cloud and on-prem
  • Providing expertise in infrastructure workflows, including hardware, software release, workload orchestration and application tuning
  • Provide fast and creative solutions for complex problems and write effective, clear and reliable architecture specification
  • Translate requirements to vision, architecture and roadmap
  • Work with engineering teams across NVIDIA to ensure your software integrates seamlessly from the hardware all the way up to the AI training applications.

What we need to see :

  • Masters or PhD in Computer Science, Computer Engineering, Physics or equivalent experience.
  • 9+ years of experience in this field.
  • Data Sciences, Deep Learning, or Machine Learning coursework
  • Ability to seamlessly shift between Linux system environments to Python programming
  • Programming skills in 1 or more high-level languages (C, C++, Go, Rust, etc)
  • System-level experience with both hardware and software
  • Motivated self-starter with an equal balance of strong problem-solving skills and customer-facing communication skills
  • Strong design, coding, analytical, debugging and problem-solving skills
  • Passion for continuous learning and knowledge transfer. Ability to work concurrently with multiple groups locally and abroad in the organization
  • Ways to stand out from the crowd :

  • Experience with GPU deep learning and data sciences. Experience using TensorFlow, PyTorch or other DL framework. Experience working with Docker containers, Slurm, Terraform and Kubernetes
  • CUDA programming and NCCL experience. HPC programming experience including MPI, OpenACC, or other parallel programming tools. Hands-on experience with DGX Cloud, NVIDIA AI Enterprise AI Software, Base Command Manager, NEMO and NVIDIA Inference Microservices.
  • Interest in crafting, analyzing and fixing large-scale distributed systems.
  • Systematic problem-solving approach, coupled with strong communication skills and a sense of ownership and drive.
  • Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 184,000 USD - 287,500 USD for Level 4, and 224,000 USD - 356,500 USD for Level 5.

    You will also be eligible for equity and benefits.

    Applications for this job will be accepted at least until December 3, 2025.

    NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

    [job_alerts.create_a_job]

    System Architect • Santa Clara, CA, United States

    [internal_linking.similar_jobs]
    System Architect, Networking

    System Architect, Networking

    SiTime Corporation • Santa Clara, CA, US
    [job_card.full_time]
    SiTime Corporation is the precision timing company.Our semiconductor MEMS programmable solutions offer a rich feature set that enables customers to differentiate their products with higher performa...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Senior AI Data Center Systems Architect — Equity Eligible

    Senior AI Data Center Systems Architect — Equity Eligible

    NVIDIA • Santa Clara, CA, United States
    [job_card.full_time]
    NVIDIA is seeking a Senior Software Architect for their Data Center Systems team in Santa Clara.This role involves leading software activities for deep learning server platforms and collaborating w...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Senior Cloud Advisory Architect, Enterprise Cloud Services, Palo Alto

    Senior Cloud Advisory Architect, Enterprise Cloud Services, Palo Alto

    SAP • Palo Alto, California, USA
    [job_card.full_time] +1
    At SAP we keep it simple : you bring your best to us and well bring out the best in you.Were builders touching over 20 industries and 80% of global commerce and we need your unique talents to help s...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Genomics Senior Systems Architect

    Genomics Senior Systems Architect

    University of California - Santa Cruz • Santa Cruz, CA, United States
    [job_card.full_time] +1
    NO VISA SPONSORSHIP AVAILABLE FOR THIS POSITION.Applicants must have current work authorization when accepting a Genomics Institute staff position. We are unable to sponsor or take over sponsorship ...[show_more]
    [last_updated.last_updated_1_day] • [promoted]
    Senior Wireless System & Algorithm Design Engineer

    Senior Wireless System & Algorithm Design Engineer

    E-Space • Saratoga, CA, US
    [job_card.full_time]
    Ready to make connectivity from space universally accessible, secure and actionable? Then you’ve come to the right place!. E-Space is bridging Earth and space to enable hyper-scaled deployment...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Lead Wireless System Architect – Multi-Antenna SDR

    Lead Wireless System Architect – Multi-Antenna SDR

    Mulya Technologies • Milpitas, CA, United States
    [job_card.full_time]
    A leading provider of communication solutions is seeking a Principal / Lead Wireless Communications System Architect in California. The ideal candidate will define complex wireless platforms and lea...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Senior System Architect

    Senior System Architect

    VirtualVocations • Fremont, California, United States
    [job_card.full_time]
    A company is looking for a Senior System Architect.Key Responsibilities Identify, assess, and recommend solutions for functional and technical requirements Develop high-level system design diagr...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Senior Software Architect - Data Center Systems

    Senior Software Architect - Data Center Systems

    NVIDIA Corporation • Santa Clara, CA, United States
    [job_card.full_time]
    Senior Software Architect - Data Center Systems page is loaded.Senior Software Architect - Data Center Systems.Apply locations US, CA, Santa Clara US, TX, Austin US, TX, Remote US, OR, Hillsboro US...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Senior Storage System Engineer - Supercomputing

    Senior Storage System Engineer - Supercomputing

    Institute of Foundation Models • Sunnyvale, CA, US
    [job_card.full_time]
    About the Institute of Foundation Models.We are a dedicated research lab for building, understanding, using, and risk-managing foundation models. Our mandate is to advance research, nurture the next...[show_more]
    [last_updated.last_updated_30] • [promoted]
    SDET Android System III

    SDET Android System III

    Cypress HCM • Mountain View, CA, US
    [job_card.full_time]
    Understand the business requirements and develop and execute comprehensive test strategies, including functional, integration, regression, and performance testing, with a focus on core Android inte...[show_more]
    [last_updated.last_updated_30] • [promoted]
    SoC Memory Subsystem Architect

    SoC Memory Subsystem Architect

    Baidu • Sunnyvale, CA, United States
    [job_card.full_time]
    We are looking for a world-class Memory Subsystem Architect to join our SoC team at Baidu’s Sunnyvale office.The successful candidate will be a motivated self-starter who will thrive in this highly...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Signals Intelligence Systems Architect

    Signals Intelligence Systems Architect

    Northwest Talent Solutions LLC • San Jose, CA, US
    [job_card.full_time] +1
    SIGINT Systems Architect Aerospace & Defense | U.San Jose, CA (On-site, Hybrid, or Remote Options Available).Aerospace / Defense / Intelligence. Northwest Talent Solutions (NWTS) is partnering w...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Senior Business Systems Engineer

    Senior Business Systems Engineer

    CoreWeave • Sunnyvale, CA, US
    [job_card.permanent]
    CoreWeave is The Essential Cloud for AI™.Built for pioneers by pioneers, CoreWeave delivers a platform of technology, tools, and teams that enables innovators to build and scale AI with confi...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Senior Director, Cloud-Scale System Software

    Senior Director, Cloud-Scale System Software

    Arm • San Jose, California, United States
    [job_card.full_time]
    A leading tech company in California is seeking a Senior Director to lead the development of system software for datacenet solutions. Key responsibilities include guiding R&D efforts, managing budge...[show_more]
    [last_updated.last_updated_variable_hours] • [promoted] • [new]
    Senior Staff System Architect, Power

    Senior Staff System Architect, Power

    Vishay • San Jose, California, USA
    [job_card.full_time]
    We are seeking great talent to help us build The DNA of tech.Vishay manufactures one of the worlds largest portfolios of discrete semiconductors and passive electronic components that are essential...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Senior Data Center Power Architect for AI Infrastructure

    Senior Data Center Power Architect for AI Infrastructure

    Dell GmbH • Santa Clara, CA, United States
    [job_card.full_time]
    A global technology company is seeking a Technical Staff, Data Center Power Architect in Santa Clara, California.This role involves developing large scale AI infrastructure, engaging with high-prof...[show_more]
    [last_updated.last_updated_1_day] • [promoted]
    Senior Solutions Architect - Enterprise, Bay Area

    Senior Solutions Architect - Enterprise, Bay Area

    Elastic • Mountain View, CA, United States
    [job_card.full_time]
    Elastic, the Search AI Company, enables everyone to find the answers they need in real time, using all their data, at scale - unleashing the potential of businesses and people.The Elastic Search AI...[show_more]
    [last_updated.last_updated_30] • [promoted]
    DSP System Architect

    DSP System Architect

    Credo Semiconductor, Inc. • San Jose, CA, US
    [job_card.full_time]
    We are seeking a DSP System Architect to optimize digital signal processing (DSP) architectures for high-speed SerDes.This role defines system-level architecture, algorithms, and performance needs....[show_more]
    [last_updated.last_updated_30] • [promoted]