Talent.com
GPU Performance Engineer
GPU Performance EngineerGenmo • San Francisco, CA, US
GPU Performance Engineer

GPU Performance Engineer

Genmo • San Francisco, CA, US
[job_card.variable_days_ago]
[job_preview.job_type]
  • [job_card.full_time]
[job_card.job_description]

Job Description

Job Description

We are Genmo, a research lab dedicated to building open, state-of-the-art models for video generation towards unlocking the right brain of AGI. Join us in shaping the future of AI and pushing the boundaries of what's possible in video generation.

We're seeking a GPU Performance Engineer to squeeze every last FLOP from our H100 infrastructure and optimize our model serving stack to its absolute limits.

The Role

You'll be our performance optimization expert, using advanced profiling tools to identify bottlenecks and implementing solutions that achieve 5-10x speedups. From writing custom CUDA kernels to eliminating cold start latency, you'll ensure our infrastructure delivers world-class performance. This role is perfect for someone who gets excited about microsecond optimizations and pushing hardware to its theoretical limits.

Key Responsibilities

Profile and optimize GPU workloads using Nsight Systems, nvprof, and custom instrumentation

Write high-performance CUDA and Triton kernels for critical model operations

Optimize cold start latency from seconds to milliseconds for our serving infrastructure

Tune memory access patterns, kernel fusion, and GPU utilization

Collaborate with ML engineers to optimize model implementations

Debug performance issues across the full stack from application to hardware

Implement custom memory pooling and allocation strategies

Share optimization techniques and build performance culture across teams

Qualifications

Bachelor's or Master's degree in Computer Science, Electrical Engineering, or related field

5+ years systems programming experience with 3+ years focused on GPU optimization

Expert proficiency with GPU profiling tools (Nsight Systems, nvprof)

Strong CUDA programming skills with production kernel development

Deep understanding of GPU architecture (memory hierarchy, SMs, warps)

Track record of achieving significant performance improvements (5-10x)

Experience with Python and C++ in production environments

We Value

Experience with Triton kernel development

Knowledge of CUTLASS or similar high-performance libraries

Background in ML-specific optimizations (attention, transformers)

RDMA / InfiniBand optimization experience

Contributions to GPU libraries or frameworks

Low-level debugging skills (PTX / SASS reading)

Genmo is an Equal Opportunity Employer. Candidates are evaluated without regard to age, race, color, religion, sex, disability, national origin, sexual orientation, veteran status, or any other characteristic protected by federal or state law. Genmo, Inc. is an E-Verify company and you may review the Notice of E-Verify Participation and the Right to Work posters in English and Spanish.

[job_alerts.create_a_job]

GPU Performance Engineer • San Francisco, CA, US

[internal_linking.similar_jobs]
Thermal Engineer

Thermal Engineer

Planet Labs PBC • San Francisco, CA, United States
[job_card.full_time]
We believe in using space to help life on Earth.Planet designs, builds, and operates the largest constellation of imaging satellites in history. This constellation delivers an unprecedented dataset ...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
GPU Systems Engineer : High-Performance C++

GPU Systems Engineer : High-Performance C++

10X Recruiting Partners • San Francisco, CA, United States
[job_card.full_time]
A technology consulting firm is seeking a highly skilled Software Engineer (C++ Systems) to join their client's team in San Francisco. This role focuses on optimizing GPU virtualization performance ...[show_more]
[last_updated.last_updated_30] • [promoted]
Lead Engineer

Lead Engineer

University of California - San Francisco Campus and Health • San Francisco, CA, United States
[job_card.full_time]
As the technical leader on this project, you will own the technical design and development of web-based applications and backend services. You will provide technical guidance for large scale project...[show_more]
[last_updated.last_updated_30] • [promoted]
ML Performance Engineer - GPU Kernels & Production Speedups

ML Performance Engineer - GPU Kernels & Production Speedups

Bold Capital Partners • San Francisco, CA, United States
[job_card.full_time]
A tech innovation firm is seeking a skilled individual to design and implement high-performance GPU kernels for novel model architectures. This role involves profiling, optimizing workflows, and col...[show_more]
[last_updated.last_updated_variable_hours] • [promoted] • [new]
Software Engineer - C++ GPU Performance

Software Engineer - C++ GPU Performance

Zoox • Foster City, CA, US
[job_card.full_time]
Zoox is building the world's most advanced self-driving hardware and software solution.The efficiency demands of such a system require an expert fine tuning of both the compute hardware archite...[show_more]
[last_updated.last_updated_30] • [promoted]
Senior Plastics Design EngineerMechanical Design Engineering • Berkeley, CA • Full time • On-site

Senior Plastics Design EngineerMechanical Design Engineering • Berkeley, CA • Full time • On-site

Form Energy • Berkeley, CA, United States
[job_card.full_time]
Are you ready to build America's energy future? Form Energy is an American manufacturing and energy technology company.We're revolutionizing energy storage with cost-effective, multi-day technology...[show_more]
[last_updated.last_updated_30] • [promoted]
Product Development Engineer, Reagents

Product Development Engineer, Reagents

Bruker • Emeryville, CA, United States
[job_card.full_time] +1
Product Development Engineer, Reagents.Bruker is enabling scientists to make breakthrough discoveries and develop new applications that improve the quality of human life. Bruker's high-performance s...[show_more]
[last_updated.last_updated_30] • [promoted]
Performance Engineer, GPU

Performance Engineer, GPU

Anthropic • San Francisco, CA, United States
[job_card.full_time]
Anthropic’s mission is to create reliable, interpretable, and steerable AI systems.We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group ...[show_more]
[last_updated.last_updated_30] • [promoted]
GPU Performance Engineer

GPU Performance Engineer

Genmo Inc. • San Francisco, CA, United States
[job_card.full_time]
We are Genmo, a research lab dedicated to building open, state-of-the-art models for video generation towards unlocking the right brain of AGI. Join us in shaping the future of AI and pushing the bo...[show_more]
[last_updated.last_updated_30] • [promoted]
Staff Engineer, Battery Engineering

Staff Engineer, Battery Engineering

Sila • Alameda, CA, US
[job_card.full_time]
We are Sila, a next-generation battery materials company.Our mission is to power the world's transition to clean energy.To create this future, our team is building a better lithium-ion battery ...[show_more]
[last_updated.last_updated_1_day] • [promoted]
Performance Engineer

Performance Engineer

VirtualVocations • Oakland, California, United States
[job_card.full_time]
A company is looking for a Shipping Optimization Services Performance Engineer.Key Responsibilities Lead strategic engagements to guide shippers through complex optimization projects Execute hig...[show_more]
[last_updated.last_updated_variable_hours] • [promoted] • [new]
Senior Scene Technician - Cal Performances

Senior Scene Technician - Cal Performances

University of California-Berkeley • Berkeley, CA, United States
[job_card.full_time] +1
At the University of California, Berkeley, we are dedicated to fostering a community where everyone feels welcome and can thrive. Our culture of openness, freedom and belonging make it a special pla...[show_more]
[last_updated.last_updated_30] • [promoted]
Performance ML Engineer : CUDA, GPU Systems

Performance ML Engineer : CUDA, GPU Systems

Relace • San Francisco, CA, United States
[job_card.full_time]
A tech company specializing in ML infrastructure is seeking a Machine Learning Engineer who excels at making models faster and more efficient through performance tuning and optimization.The ideal c...[show_more]
[last_updated.last_updated_30] • [promoted]
Staff Network Engineer (Menlo Park, CA or Durham, NC) #4507

Staff Network Engineer (Menlo Park, CA or Durham, NC) #4507

GRAIL • Menlo Park, CA, US
[job_card.full_time]
Our mission is to detect cancer early, when it can be cured.We are working to change the trajectory of cancer mortality and bring stakeholders together to adopt innovative, safe, and effective tech...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
HPC / AI Data Performance Engineer

HPC / AI Data Performance Engineer

Lawrence Berkeley National Laboratory • Berkeley, CA, United States
[job_card.full_time] +1
In this exciting role, you will serve as a Data Performance Engineer in NERSC's Application Performance Group, architecting HPC and AI data services that advance fundamental science.You'll optimize...[show_more]
[last_updated.last_updated_30] • [promoted]
VAVE Engineer III - Plastics

VAVE Engineer III - Plastics

Bio-Rad Laboratories • Hercules, CA, United States
[job_card.full_time]
As a Plastics / Polymer Engineer in Value Engineering, you'll support Bio-Rad products across consumables and instrumentation in our Diagnostics and Lifescience by optimizing plastic components end-t...[show_more]
[last_updated.last_updated_30] • [promoted]
System Engineer, GPU Fleet

System Engineer, GPU Fleet

Fluidstack • San Francisco, CA, United States
[job_card.full_time]
About Fluidstack : At Fluidstack, we’re building the infrastructure for abundant intelligence.We partner with top AI labs, governments, and enterprises to unlock compute at the speed of light.We’re ...[show_more]
[last_updated.last_updated_1_day] • [promoted]
Flight Dynamics / GNC Engineer (5221C), Space Sciences Laboratory - 83693

Flight Dynamics / GNC Engineer (5221C), Space Sciences Laboratory - 83693

InsideHigherEd • Berkeley, California, United States
[job_card.full_time]
Flight Dynamics / GNC Engineer (5221C), Space Sciences Laboratory - 83693.At the University of California, Berkeley, we are dedicated to fostering a community where everyone feels welcome and can thr...[show_more]
[last_updated.last_updated_variable_days] • [promoted]