Talent.com
GPU Performance Engineer
GPU Performance EngineerGenmo • San Francisco, CA, US
GPU Performance Engineer

GPU Performance Engineer

Genmo • San Francisco, CA, US
[job_card.variable_days_ago]
[job_preview.job_type]
  • [job_card.full_time]
[job_card.job_description]

Job Description

Job Description

We are Genmo, a research lab dedicated to building open, state-of-the-art models for video generation towards unlocking the right brain of AGI. Join us in shaping the future of AI and pushing the boundaries of what's possible in video generation.

We're seeking a GPU Performance Engineer to squeeze every last FLOP from our H100 infrastructure and optimize our model serving stack to its absolute limits.

The Role

You'll be our performance optimization expert, using advanced profiling tools to identify bottlenecks and implementing solutions that achieve 5-10x speedups. From writing custom CUDA kernels to eliminating cold start latency, you'll ensure our infrastructure delivers world-class performance. This role is perfect for someone who gets excited about microsecond optimizations and pushing hardware to its theoretical limits.

Key Responsibilities

Profile and optimize GPU workloads using Nsight Systems, nvprof, and custom instrumentation

Write high-performance CUDA and Triton kernels for critical model operations

Optimize cold start latency from seconds to milliseconds for our serving infrastructure

Tune memory access patterns, kernel fusion, and GPU utilization

Collaborate with ML engineers to optimize model implementations

Debug performance issues across the full stack from application to hardware

Implement custom memory pooling and allocation strategies

Share optimization techniques and build performance culture across teams

Qualifications

Bachelor's or Master's degree in Computer Science, Electrical Engineering, or related field

5+ years systems programming experience with 3+ years focused on GPU optimization

Expert proficiency with GPU profiling tools (Nsight Systems, nvprof)

Strong CUDA programming skills with production kernel development

Deep understanding of GPU architecture (memory hierarchy, SMs, warps)

Track record of achieving significant performance improvements (5-10x)

Experience with Python and C++ in production environments

We Value

Experience with Triton kernel development

Knowledge of CUTLASS or similar high-performance libraries

Background in ML-specific optimizations (attention, transformers)

RDMA / InfiniBand optimization experience

Contributions to GPU libraries or frameworks

Low-level debugging skills (PTX / SASS reading)

Genmo is an Equal Opportunity Employer. Candidates are evaluated without regard to age, race, color, religion, sex, disability, national origin, sexual orientation, veteran status, or any other characteristic protected by federal or state law. Genmo, Inc. is an E-Verify company and you may review the Notice of E-Verify Participation and the Right to Work posters in English and Spanish.

[job_alerts.create_a_job]

GPU Performance Engineer • San Francisco, CA, US

[internal_linking.similar_jobs]
GPU Systems Engineer : High-Performance C++

GPU Systems Engineer : High-Performance C++

10X Recruiting Partners • San Francisco, CA, United States
[job_card.full_time]
A technology consulting firm is seeking a highly skilled Software Engineer (C++ Systems) to join their client's team in San Francisco. This role focuses on optimizing GPU virtualization performance ...[show_more]
[last_updated.last_updated_30] • [promoted]
Space Networking Growth Architect

Space Networking Growth Architect

Ephemeris Net • Berkeley, CA, United States
[job_card.full_time]
A space networking company in Berkeley, CA is searching for a Business Development Lead to define and execute revenue strategies across government and commercial markets. The role involves establish...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Frontend Engineer

Frontend Engineer

Tarro • Menlo Park, California, United States
[job_card.full_time]
Here at Tarro we build products that empower small brick and mortar restaurants by liberating them of the operational burden of running their business. We accomplish this by providing a frictionless...[show_more]
[last_updated.last_updated_30] • [promoted]
ML Performance Engineer - GPU Kernels & Production Speedups

ML Performance Engineer - GPU Kernels & Production Speedups

Bold Capital Partners • San Francisco, CA, United States
[job_card.full_time]
A tech innovation firm is seeking a skilled individual to design and implement high-performance GPU kernels for novel model architectures. This role involves profiling, optimizing workflows, and col...[show_more]
[last_updated.last_updated_variable_hours] • [promoted] • [new]
Senior Performance Engineer : Optimize, Profile & Test

Senior Performance Engineer : Optimize, Profile & Test

Omega Solutions Inc. • San Francisco, CA, United States
[job_card.full_time]
A technology solutions firm based in California is seeking a Performance Engineer III to design and conduct performance tests. The ideal candidate will have over 5 years of experience in performance...[show_more]
[last_updated.last_updated_1_day] • [promoted]
Performance Engineer, GPU

Performance Engineer, GPU

Anthropic • San Francisco, CA, United States
[job_card.full_time]
Anthropic’s mission is to create reliable, interpretable, and steerable AI systems.We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group ...[show_more]
[last_updated.last_updated_30] • [promoted]
GPU Performance Engineer

GPU Performance Engineer

Genmo Inc. • San Francisco, CA, United States
[job_card.full_time]
We are Genmo, a research lab dedicated to building open, state-of-the-art models for video generation towards unlocking the right brain of AGI. Join us in shaping the future of AI and pushing the bo...[show_more]
[last_updated.last_updated_30] • [promoted]
Performance Engineer

Performance Engineer

VirtualVocations • Oakland, California, United States
[job_card.full_time]
A company is looking for a Shipping Optimization Services Performance Engineer.Key Responsibilities Lead strategic engagements to guide shippers through complex optimization projects Execute hig...[show_more]
[last_updated.last_updated_variable_hours] • [promoted] • [new]
Advanced Electronics / Computer Field Technician (Electronics Technician & Fire Controlman) - Full Time

Advanced Electronics / Computer Field Technician (Electronics Technician & Fire Controlman) - Full Time

U.S. Navy • Sausalito, CA, US
[job_card.full_time]
The Navys Advanced Electronics / Computer Field (AECF) offers extensive training in electronics, computer systems, radar, communications, and weapons fire control systems,.Navys advanced missile sy...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
AI-Driven GTM Systems Engineer for Marketing Ops

AI-Driven GTM Systems Engineer for Marketing Ops

Snowflake • Menlo Park, CA, United States
[job_card.full_time]
A leading cloud data platform company located in Menlo Park is seeking a GTM Systems Engineer to optimize marketing operations. The ideal candidate will have over 6 years of experience in Marketing ...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Radiology / Cardiology-X-Ray Tech

Radiology / Cardiology-X-Ray Tech

Zenex Partners • Berkeley, CA, United States
[job_card.full_time]
Job Opportunity : Radiology / Cardiology - X-Ray Tech.Facility : Sutter Health Alta Bates Summit Medical Center Ashby.Employment Type : Travel / Contract. Shift : Night (5x8 Hours) 23 : 00 7 : 00 / Evening (5...[show_more]
[last_updated.last_updated_30] • [promoted]
Senior HPC & GPU Infrastructure Engineer

Senior HPC & GPU Infrastructure Engineer

Sciforium • San Francisco, CA, United States
[job_card.full_time]
Sciforium is an AI infrastructure company developing next-generation multimodal AI models and a proprietary, high-efficiency serving platform. Backed by multi-million-dollar funding and direct spons...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Senior ML Infra Engineer — Distributed GPU Platforms

Senior ML Infra Engineer — Distributed GPU Platforms

Genesis Therapeutics Inc. • Burlingame, CA, United States
[job_card.full_time]
A biotechnology company in Burlingame is seeking experienced ML infrastructure engineers to lead engineering efforts on their AI platform focused on generative modeling. Responsibilities include opt...[show_more]
[last_updated.last_updated_30] • [promoted]
GPU Accelerated Bioinformatics Engineer

GPU Accelerated Bioinformatics Engineer

Prima Mente • San Francisco, CA, United States
[job_card.full_time]
Prima Mente is a frontier biology AI lab.We generate our own data, build general purpose biological foundation models, and translate discoveries into research and clinical outcomes.Our first goal i...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Wireless Power FEA & Thermal Simulation Engineer

Wireless Power FEA & Thermal Simulation Engineer

Apple Inc. • San Francisco, CA, United States
[job_card.full_time]
A leading technology company in California is seeking an experienced engineer to work on innovative wireless power products. The role involves design, simulation, and analysis using advanced techniq...[show_more]
[last_updated.last_updated_30] • [promoted]
Performance ML Engineer : CUDA, GPU Systems

Performance ML Engineer : CUDA, GPU Systems

Relace • San Francisco, CA, United States
[job_card.full_time]
A tech company specializing in ML infrastructure is seeking a Machine Learning Engineer who excels at making models faster and more efficient through performance tuning and optimization.The ideal c...[show_more]
[last_updated.last_updated_30] • [promoted]
System Engineer, GPU Fleet

System Engineer, GPU Fleet

Fluidstack • San Francisco, CA, United States
[job_card.full_time]
About Fluidstack : At Fluidstack, we’re building the infrastructure for abundant intelligence.We partner with top AI labs, governments, and enterprises to unlock compute at the speed of light.We’re ...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Flight Dynamics / GNC Engineer (5221C), Space Sciences Laboratory - 83693

Flight Dynamics / GNC Engineer (5221C), Space Sciences Laboratory - 83693

InsideHigherEd • Berkeley, California, United States
[job_card.full_time]
Flight Dynamics / GNC Engineer (5221C), Space Sciences Laboratory - 83693.At the University of California, Berkeley, we are dedicated to fostering a community where everyone feels welcome and can thr...[show_more]
[last_updated.last_updated_variable_days] • [promoted]