Talent.com
GPU Performance Engineer
GPU Performance EngineerGenmo Inc. • San Francisco, CA, United States
GPU Performance Engineer

GPU Performance Engineer

Genmo Inc. • San Francisco, CA, United States
[job_card.variable_days_ago]
[job_preview.job_type]
  • [job_card.full_time]
[job_card.job_description]

We are Genmo, a research lab dedicated to building open, state-of-the-art models for video generation towards unlocking the right brain of AGI. Join us in shaping the future of AI and pushing the boundaries of what’s possible in video generation.

We’re seeking a GPU Performance Engineer to squeeze every last FLOP from our H100 infrastructure and optimize our model serving stack to its absolute limits.

The Role

You’ll be our performance optimization expert, using advanced profiling tools to identify bottlenecks and implementing solutions that achieve 5-10x speedups. From writing custom CUDA kernels to eliminating cold start latency, you’ll ensure our infrastructure delivers world-class performance. This role is perfect for someone who gets excited about microsecond optimizations and pushing hardware to its theoretical limits.

Key Responsibilities

  • Profile and optimize GPU workloads using Nsight Systems, nvprof, and custom instrumentation
  • Write high-performance CUDA and Triton kernels for critical model operations
  • Optimize cold start latency from seconds to milliseconds for our serving infrastructure
  • Tune memory access patterns, kernel fusion, and GPU utilization
  • Collaborate with ML engineers to optimize model implementations
  • Debug performance issues across the full stack from application to hardware
  • Implement custom memory pooling and allocation strategies
  • Share optimization techniques and build performance culture across teams

Qualifications

  • Bachelor’s or Master’s degree in Computer Science, Electrical Engineering, or related field
  • 5+ years systems programming experience with 3+ years focused on GPU optimization
  • Expert proficiency with GPU profiling tools (Nsight Systems, nvprof)
  • Strong CUDA programming skills with production kernel development
  • Deep understanding of GPU architecture (memory hierarchy, SMs, warps)
  • Track record of achieving significant performance improvements (5-10x)
  • Experience with Python and C++ in production environments
  • We Value

  • Experience with Triton kernel development
  • Knowledge of CUTLASS or similar high-performance libraries
  • Background in ML-specific optimizations (attention, transformers)
  • RDMA / InfiniBand optimization experience
  • Contributions to GPU libraries or frameworks
  • Low-level debugging skills (PTX / SASS reading)
  • Genmo is an Equal Opportunity Employer. Candidates are evaluated without regard to age, race, color, religion, sex, disability, national origin, sexual orientation, veteran status, or any other characteristic protected by federal or state law. Genmo, Inc. is an E-Verify company and you may review the Notice of E-Verify Participation and the Right to Work posters in English and Spanish.

    #J-18808-Ljbffr

    [job_alerts.create_a_job]

    Performance Engineer • San Francisco, CA, United States

    [internal_linking.similar_jobs]
    Simulation Engineer

    Simulation Engineer

    Hammerhead AI • Redwood City, CA, US
    [job_card.full_time]
    We're unleashing AI with intelligent orchestration while addressing one of the most pressing bottlenecks for AI access to Power. Our cutting-edge platform optimizes data center power infrastruct...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Quality Assurance Engineer I

    Quality Assurance Engineer I

    EarLens Corporation • Menlo Park, CA, US
    [job_card.full_time]
    Quality Management System functions within our manufacturing operations.This role is ideal for an early career engineer looking to make an immediate impact in a fast-paced medical device environmen...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Senior Quality Assurance Engineer

    Senior Quality Assurance Engineer

    Bellota Labs • Redwood City, CA, US
    [job_card.full_time]
    Driven by innovation, game integrity, and exceptional customer experiences, we are on a mission to set new standards in online gaming. If you are passionate about cutting-edge technology and buildin...[show_more]
    [last_updated.last_updated_30] • [promoted]
    GPU Systems Engineer - HPC / Parallel Computing

    GPU Systems Engineer - HPC / Parallel Computing

    Vast.ai • San Francisco, CA, US
    [job_card.full_time]
    AI projects and businesses all over the world.We are democratizing and decentralizing AI computing—reshaping our future for the benefit of humanity. We are a small, growing, and highly motivat...[show_more]
    [last_updated.last_updated_30] • [promoted]
    GPU Systems Engineer : High-Performance C++

    GPU Systems Engineer : High-Performance C++

    10X Recruiting Partners • San Francisco, CA, United States
    [job_card.full_time]
    A prominent recruiting firm is seeking a highly skilled Software Engineer (C++ Systems) to join a client’s team focused on GPU virtualization. The role requires optimizing performance at the systems...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Staff Geotechnical Engineer

    Staff Geotechnical Engineer

    Certerra • Berkeley, CA, US
    [job_card.full_time]
    Certerra A3GEO is seeking a full-time Staff Geotechnical Engineer to join our Berkeley, California team.Certerra A3GEO is an established geotechnical consulting firm where people collaborate, innov...[show_more]
    [last_updated.last_updated_30] • [promoted]
    GPU Systems Engineer

    GPU Systems Engineer

    SLR Search • San Francisco, CA, US
    [job_card.full_time]
    Architect the foundation of the future's most performance-critical cloud infrastructure.Starting Salary targeting $200,000 - $280,000. Comprehensive medical, dental, vision.GPU Systems Engineer ...[show_more]
    [last_updated.last_updated_30] • [promoted]
    GPU Performance Engineer

    GPU Performance Engineer

    Genmo Inc. • San Francisco, CA, United States
    [job_card.full_time]
    We are Genmo, a research lab dedicated to building open, state-of-the-art models for video generation towards unlocking the right brain of AGI. Join us in shaping the future of AI and pushing the bo...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Performance Engineer, GPU

    Performance Engineer, GPU

    Anthropic • San Francisco, CA, United States
    [job_card.full_time]
    Anthropic’s mission is to create reliable, interpretable, and steerable AI systems.We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group ...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Professional Engineer

    Professional Engineer

    DES • Redwood City, CA, US
    [job_card.full_time]
    Join DES as a Professional Engineer and contribute to the continued growth of our structural team.We're looking for a motivated, detail-oriented engineer who values technical excellence and thr...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Senior Pavement Engineer (applied research focus)

    Senior Pavement Engineer (applied research focus)

    NICHOLS CONSULTING ENGINEERS CHTD • Emeryville, CA, US
    [job_card.full_time]
    NCE is actively recruiting a Senior Pavement Engineer to join out Pavements and Materials Group.If you enjoy a challenging career in applied pavement research and engineering, this opportunity may ...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Performance ML Engineer : CUDA, GPU Systems

    Performance ML Engineer : CUDA, GPU Systems

    Relace • San Francisco, CA, United States
    [job_card.full_time]
    A tech company specializing in ML infrastructure is seeking a Machine Learning Engineer who excels at making models faster and more efficient through performance tuning and optimization.The ideal c...[show_more]
    [last_updated.last_updated_30] • [promoted]
    GPU Fleet Engineer – Hyperscale Infra, Kubernetes & AI

    GPU Fleet Engineer – Hyperscale Infra, Kubernetes & AI

    OpenAI • San Francisco, CA, United States
    [job_card.full_time]
    Join a forward-thinking company as an engineer in the fleet infrastructure team, where you'll design and operate systems for one of the largest GPU fleets globally. This role offers the chance to wo...[show_more]
    [last_updated.last_updated_30] • [promoted]
    City Engineer

    City Engineer

    WBCP, Inc. • Berkeley, CA, US
    [job_card.full_time]
    The City of Berkeley, California is seeking an experienced, forward-thinking, and collaborative.This is an exceptional opportunity to guide complex capital improvement programs; advance multimodal,...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Application Performance Engineer

    Application Performance Engineer

    Computer Task Group, Inc • Redwood City, CA, US
    [job_card.full_time]
    Application Performance Engineer.We're looking for strong application.Oracle database with LLVM compiler technology to achieve optimal performance. The job will require analyzing performance iss...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Principal Air Quality Engineer / Scientist

    Principal Air Quality Engineer / Scientist

    Yorke Engineering • Berkeley, CA, US
    [job_card.full_time]
    Join Yorke Engineering, LLC, an Environmental Consulting leader in California that implements Environmental Engineering and Compliance solutions for our clients throughout California.Our mission is...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Product Quality Assurance Engineer (Medical Device)

    Product Quality Assurance Engineer (Medical Device)

    Eko • Emeryville, CA, US
    [job_card.full_time]
    Eko builds AI and digital tools to enable every healthcare provider to more accurately detect heart and lung disease – the leading causes of death globally. Our FDA cleared, industry leading p...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Commercial Solar Project Developer

    Commercial Solar Project Developer

    Sun Light & Power • Berkeley, CA, US
    [job_card.full_time]
    Commission is also given for each successful sale without cap.Do you want to become an employee-owner for a mission-driven, growth focused company? Do you approach your work with passion and dedica...[show_more]
    [last_updated.last_updated_30] • [promoted]