Talent.com
NVIDIA
Senior System Software Engineer - Scientific Computing PaaSNVIDIA • Santa Clara, CA, US
Senior System Software Engineer - Scientific Computing PaaS

Senior System Software Engineer - Scientific Computing PaaS

NVIDIA • Santa Clara, CA, US
30+ days ago
Job type
  • Full-time
  • Remote
Job description

We are seeking a

Sr System Software Engineer to help us build out our scientific computing platform on Nvidia DGX Cloud. We are building a cloud based accelerated scientific computing platform as a service on the Nvidia DGX cloud. This DGX scientific computing cloud platform enables Physics based Numerical Simulation Solvers, AI based Training, Inference and Visualization workflow for physical science and engineering problems.

Those applications include Weather prediction, Climate modeling, Industrial design and Digital twins simulation in various domains e.g Aerospace, Automotive, Sports, Renewable energy, Bio-medical and many more.

Are you passionate about solving rewarding problems at scale? Do you enjoy crafting robust, critical services for compute and data intensive workload? If so, you may be a phenomenal fit for our team!

What you’ll be doing:

  • Design, Build, Deploy and Operate Cloud native microservices and APIs for scientific computing workload on DGX cloud.

  • Design services and take ownership of underlying cloud infrastructure for physics informed and data driven scientific workflows

  • Design novel algorithms and actively engaged with operations to increase overall system performance, it spans across the stack e.g. deep understanding of application code e.g DL Framework, Numerical Solvers, Microservices, APIs and Heterogeneous accelerated computing with CPUs and GPUs.

  • Design, Build, Deploy and Operate scalable I/O infrastructure for checkpointing, data loading, pre & post processing of data.

  • Optimize compute, storage and network architecture specific to physics & simulation driven applications.

What we need to see:

  • BS/MS degree in Computer Science or related areas or equivalent experience.

  • 10+ years experience working on building and operating distributed compute and data intensive platform as a service on cloud

  • Proven skill in a compiled language (Go, Rust, C++ or otherwise).

  • Strong foundational knowledge in Cloud Computing e.g “The Datacenter is a Computer” architecture, cloud security architecture, virtualization - CPU, Memory and IO, Resource pooling and elasticity.

  • Proven skills in Distributed Systems & Parallel Processing e.g System model of distributed computation e.g. topology abstraction, logical time. Synchronization and deadlock detection in distributed systems, Fault Tolerance and Failure Detection, Consensus and Agreement protocols, Parallel algorithms, shared memory and distributed memory architecture, message passing (MPI, NCCL), Cluster scalability and performance.

  • Hands on Debugging skills with Process, Threads , Deadlock and Synchronization, Scheduling, IPC, Memory management, File system and I/O structure.

  • Strong Evidence on Algorithmic Thinking & System Design skills e.g Recursion, Graph, Tree, Stack and Queue, Large scale loosely coupled distributed system design and operational experience.

  • Be self-motivated, have strong interpersonal skills, and be able to work independently with multiple teams with minimal direction.

Ways to stand out from the crowd:

  • Have built , deployed and operated AI platforms on HPC clusters. Have built, deployed and operated cloud native system including distributed storage, scheduling, and orchestration among compute, storage and network

  • Configuring and troubleshooting hardware, operating systems, kernel, compilers for maximum performance

  • Hands on debugging skills to optimize performance of compute, networking and I/O framework. Extensively worked on third party source code for debugging and customization

NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people on the planet working for us. If you're creative and autonomous, we want to hear from you!

The base salary range is 180,000 USD - 339,250 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and . NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

#deeplearning
Create a job alert for this search

Senior System Software Engineer - Scientific Computing PaaS • Santa Clara, CA, US

Similar jobs

Senior DSP Software Engineer

Tarana Wireless, Inc.Milpitas, CA, US
$130,000.00 yearly
Full-time
Quick Apply

Join the Team That's Redefining Wireless Technology At  Tarana , we're more than just a fast-growing tech company—we’re a team of bold innovators on a mission to revolutionize broa... Show more

Bioinformatics Software Engineer

AccuraGenSan Jose, CA, US
Full-time
Quick Apply

As a Bioinformatics Software Engineer at AccuraGen, you will develop, optimize, and maintain bioinformatics pipelines utilizing AWS infrastructure and Nextflow workflow management systems.Your cont... Show more

Senior Software Engineer

DataVisorMountain View, CA, US
Full-time
Quick Apply

DataVisor is the world’s leading AI-powered Fraud and Risk Platform that delivers the best overall detection coverage in the industry.With an open SaaS platform that supports easy consolidation and... Show more

(1) Senior AWS DevOps and System Administrator

eTrigueSan Jose, CA, US
$120,000.00 yearly
Full-time +2
Quick Apply

We are seeking an extraordinary Senior AWS DevOp and System Administrator to join our team at eTrigue.Are you a Senior AWS DevOp who is energized by designing, deploying and maintaining infrastruct... Show more

Senior Sales Engineer

Contrast SecurityPleasanton, CA, United States
Full-time

At Contrast Security, we're redefining how organizations protect their software at the speed of modern development.With industry-leading Application Detection and Response (ADR), we give teams the ... Show more

 • Promoted

System Engineer

TradeJobsWorkForce95128 San Jose, CA, US
Full-time

System Engineer Job Duties: Manages and monitors all installed systems and infrastructure for ... Show more

 • Promoted

Senior Software Engineer - Circuit Simulation - US Remote*

Siemens EDA (Siemens Digital Industries Software)Fremont, CA, United States
Remote
Full-time

Senior Software Engineer - Circuit Simulation - US Remote at Siemens EDA (Siemens Digital Industries Software) Siemens EDA is a global technology leader in Electronic Design Automation software.Our... Show more

 • Promoted

System Architect

EVONAFremont, CA, United States
Full-time

EVONA Space is seeking an experienced.This role sits at the intersection of spacecraft systems engineering and next-generation AI-enabled mission integration.Ranked recommendations with clear engin... Show more

 • Promoted

Sr Advanced Systems Engineer [Remote, sign-on bonus]

Progeny Systems (Acquired by General Dynamics)San Jose, CA, United States
Remote
Full-time

Job TitleSystems EngineerBasic QualificationsRequires a Bachelor's degree in Systems Engineering, or a related Science, Engineering or Mathematics field.Also requires 8years of job-related experien... Show more

 • Promoted

Senior Sales Engineer

Eaton PlcPleasanton, CA, United States
Full-time

Eaton's ES AMER NAS division is currently seeking a Senior Sales Engineer! The primary function of the Senior Sales Engineer is to sell assigned product lines to targeted customers in the San Franc... Show more

 • Promoted

Physician (MD/DO) - Pediatrics - General/Other - $273,400 per year in Santa Cruz, CA

LocumJobsOnlineSanta Cruz, CA, US
Full-time +1

Doctor of Medicine | Pediatrics - General/Other.LocumJobsOnline is working with CompHealth to find a qualified Pediatrics MD in Santa Cruz, California, 95060!.Santa Cruz is a city on the central Ca... Show more

 • Promoted

Advanced Electronics / Computer Field Technician

US NavyBrookdale, CA, US
Full-time

Advanced Electronics / Computer Field (ET/FC).The Advanced Electronics and Computer Field trains Sailors to maintain, operate, and repair some of the Navy’s most sophisticated electronics and compu... Show more

 • Promoted

Senior Software Engineer - Circuit Simulation - US Remote o

SIEMENS AGFremont, CA, United States
Remote
Full-time

Job Family :SoftwareReq ID :484868Siemens EDA is a global technology leader in Electronic Design Automation software.Our software tools enable companies around the world to develop highly innovativ... Show more

 • Promoted

Senior Backend Engineer (High-Throughput Platforms)

Bright Vision TechnologiesFremont, CA, US
$100,000.00 yearly
Full-time
Quick Apply

Bright Vision Technologies is a forward-thinking software development company dedicated to building innovative solutions that help businesses automate and optimize their operations.We leverage cutt... Show more

Gastroenterology Physician

CommonSpirit HealthSanta Cruz, CA, US
Full-time

Job Summary and Responsibilities.Coastal CA: Exceptional Opportunity for a Flexible GI Practice.Dignity Health Medical Group - Dominican is seeking a team oriented, part time General GI Physician i... Show more

 • Promoted

Senior Software Engineer - Cloud SaaS & FinTech

Blackline Systems IncPleasanton, CA, United States
Full-time

A leading SaaS company is seeking a Sr.Software Engineer to design, develop, and optimize cloud-based services.You will deliver high-quality product features while collaborating with cross-function... Show more

 • Promoted

Senior Software Engineer - Core Team - San Jose, CA

ZEDEDA IncSan Jose, California, United States, 95113
$150,000.00 yearly
Full-time
Quick Apply

ZEDEDA unlocks the value of AI where it matters most, enabling enterprises to create, secure and operate edge AI at scale.ZEDEDA’s Edge Intelligence products and solutions are used by global distri... Show more

Embedded Systems Engineer

Warmboard Inc.Scotts Valley, California, United States
$90,000.00 yearly
Full-time

Salary: $90,000 - 130,000 per year.We require a bachelors degree, or equivalent professional experience, in computer engineering, software engineering, or a closely related technical discipline.We ... Show more

 • Promoted

Remote Sales and Service Agent - Aptos Hills-Larkin Valley

HMG Careerslarkin valley, ca, us
Remote
Full-time

Are you ready to join an exceptional team that offers comprehensive training, benefits, and flexible working hours? Our ideal candidate embodies qualities such as adaptability, trainability, and a ... Show more

 • Promoted • New!

Senior Infra Linux Engineer

eTeam IncPleasanton, California, United States
$44.00–$48.00 hourly
Full-time
Quick Apply

Infra Linux Engineer s primary function will be to advance the infrastructure team from a traditional infrastructure methodology to an infrastructure as code approach.You will be responsible for ma... Show more