Talent.com
Staff ML Infrastructure Engineer
Staff ML Infrastructure EngineerCubiq Recruitment • Hayward, CA, US
Staff ML Infrastructure Engineer

Staff ML Infrastructure Engineer

Cubiq Recruitment • Hayward, CA, US
[job_card.variable_days_ago]
[job_preview.job_type]
  • [job_card.full_time]
[job_card.job_description]

Staff / Lead ML Infrastructure Engineer

San Francisco, CA — Onsite

Salary - Over market average + equity

We are building one of the world's leading generative video and multimodal AI platforms, and we're looking for a senior infrastructure engineer to drive the backbone that makes it possible. This role is ideal for an engineer from a top-tier tech company who has built cloud-scale systems, high-performance compute platforms, and battle-tested CI / CD pipelines that support complex ML workloads.

What You'll Own

  • Core ML Platform Architecture : Design and evolve the infrastructure that supports large-scale generative video and multimodal model training, evaluation, and deployment.
  • High-Throughput Compute Systems : Build and optimize GPU / TPU clusters, distributed training systems, and orchestration layers tailored for video-heavy pipelines.
  • Production Reliability for Generative Models : Create the tooling and services needed to safely push frequent model updates while handling massive compute loads and long-running jobs.
  • End-to-End CI / CD for ML : Lead the development of automated pipelines for model training, validation, artifact management, and production rollout.
  • Multimodal Data Infrastructure : Build systems to ingest, version, transform, and serve large-scale video, audio, and text datasets with high reliability.
  • Internal Developer Experience : Partner with research, product, and applied ML teams to build intuitive internal tooling for experiment tracking, model lineage, and resource scheduling.
  • Technical Leadership : Mentor engineers, set platform standards, and influence long-term architectural direction.

What You've Done

  • Experience architecting and operating large-scale infrastructure at a cloud provider, hyperscaler, or leading AI company.
  • Built or owned mission-critical CI / CD systems, high-capacity compute platforms, or data infrastructure supporting ML teams.
  • Deep experience with distributed compute across GPUs / accelerators, Kubernetes, and cloud infrastructure (AWS / GCP / Azure).
  • Strong engineering fundamentals in Python, Go, or equivalent languages.
  • Previous exposure to ML training pipelines—especially systems that handle heavy video, multimodal, or high-dimensional data.
  • Demonstrated ability to lead complex cross-org initiatives and drive technical strategy.
  • Nice to Have

  • Experience with video processing systems, large-scale media pipelines, or streaming architectures.
  • Familiarity with modern multimodal or video-generation frameworks (PyTorch, JAX, diffusers, custom accelerators).
  • Experience with Ray, Triton, CUDA optimization, or specialized scheduling for ML workloads.
  • Background working in high-growth AI startups or research-focused environments.
  • Security and compliance considerations for models that generate or process user content.
  • Why Join

  • Shape the underlying platform powering one of the most advanced generative video systems in the world.
  • Influence the future of multimodal AI by building infrastructure that directly accelerates research and product breakthroughs.
  • Work closely with experienced founding engineers, researchers, and platform builders from leading tech companies.
  • Highly competitive compensation, meaningful equity, and strong in-person engineering culture in San Francisco.
  • [job_alerts.create_a_job]

    Staff Engineer Infrastructure • Hayward, CA, US

    [internal_linking.similar_jobs]
    Hardcore Engineer - Multimodal Infrastructure

    Hardcore Engineer - Multimodal Infrastructure

    xAI • Palo Alto, CA, US
    [job_card.full_time]
    AI's mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on engineering exc...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Sr. Staff ML Platform Engineer (TLM)

    Sr. Staff ML Platform Engineer (TLM)

    Earnin • Mountain View, California, United States
    [job_card.full_time]
    As one of the first pioneers of earned wage access, our passion at EarnIn is building products that deliver real-time financial flexibility for those with the unique needs of living paycheck to pay...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Infrastructure Engineer

    Infrastructure Engineer

    Meshy • Sunnyvale, CA, US
    [job_card.full_time]
    Meshy is the leading 3D generative AI company on a mission to.Meshy makes it effortless for both professional artists and hobbyists to create unique 3D assets—turning text and images into stu...[show_more]
    [last_updated.last_updated_30] • [promoted]
    ML Infrastructure Engineer with GCP

    ML Infrastructure Engineer with GCP

    iSoftTek Solutions Inc • Mountain View, CA, US
    [job_card.full_time]
    Job Title : ML Infrastructure Engineer with GCP.Location : Mountain View, CA [Needs to be onsite for 1 week once in a quarter on your own expenses]. Note : Only PST and MST candidates are required.Expe...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Product Infrastructure Engineer - Site Reliability

    Product Infrastructure Engineer - Site Reliability

    Zyphra • Palo Alto, CA, US
    [job_card.full_time]
    Infrastructure Engineer - Site Reliability.Your work will be essential to ensuring the reliability and reproducibility of ML workloads, the safety and control of deployments, and the long-term main...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Staff Engineer II

    Staff Engineer II

    WHB Engineers • Dublin, CA, US
    [job_card.full_time]
    WHB Engineers is a well-established and growing Civil Engineering and Construction Management firm with offices in San Diego, San Mateo, Dublin, Pleasant Hill and Chico, California.Our firm provide...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Infrastructure Optimization Engineer

    Infrastructure Optimization Engineer

    VirtualVocations • Fremont, California, United States
    [job_card.full_time]
    A company is looking for an Engineer I - Infrastructure Optimization.Key Responsibilities Analyze infrastructure utilization and resource consumption to identify optimization opportunities Devel...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Staff Infrastructure Engineer (Core Infra)

    Staff Infrastructure Engineer (Core Infra)

    Workato • Palo Alto, CA, US
    [job_card.full_time]
    Workato transforms technology complexity into business opportunity.As the leader in enterprise orchestration, Workato helps businesses globally streamline operations by connecting data, processes, ...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Staff DevOps / MLOps Engineer

    Staff DevOps / MLOps Engineer

    Sonatus • Sunnyvale, CA, US
    [job_card.full_time]
    Join a high-performing team at Sonatus that's redefining what cars can do in the era of Software-Defined Vehicles (SDV).At Sonatus, we're driving the transformation to AI-enabled software-d...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Senior / Staff Software Engineer, Machine Learning Infrastructure

    Senior / Staff Software Engineer, Machine Learning Infrastructure

    Nuro • Mountain View, California, United States
    [job_card.full_time]
    Nuro is a self-driving technology company on a mission to make autonomy accessible to all.Founded in 2016, Nuro is building the world’s most scalable driver, combining cutting-edge AI with automoti...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Staff Solutions Architect - Data Infrastructure

    Staff Solutions Architect - Data Infrastructure

    Onehouse • Sunnyvale, CA, US
    [job_card.full_time]
    Onehouse is a mission-driven company dedicated to freeing data from data platform lock-in.We deliver the industry’s most interoperable data lakehouse through a cloud-native managed service bu...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Staff Infrastructure Engineer

    Staff Infrastructure Engineer

    Crusoe • Sunnyvale, CA, US
    [job_card.full_time]
    Crusoe's mission is to accelerate the abundance of energy and intelligence.We’re crafting the engine that powers a world where people can create ambitiously with AI — without sacrif...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Infrastructure Engineer

    Infrastructure Engineer

    Dtex Systems • Fremont, CA, US
    [job_card.full_time]
    DTEX is seeking an experienced Site Reliability Engineer (SRE) with a strong software engineering background to help drive modernization of our infrastructure and operations.This is a high-impact r...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Senior Machine Learning Infrastructure Engineer

    Senior Machine Learning Infrastructure Engineer

    PlusAI • Santa Clara, CA, US
    [job_card.full_time]
    Plus, also known as PlusAI, is a Physical AI company pioneering AI-based virtual driver software for factory-built autonomous trucks. Headquartered in Silicon Valley with operations in the United St...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Cloud Infrastructure Engineer

    Cloud Infrastructure Engineer

    Forhyre • Sunnyvale, CA, US
    [job_card.full_time]
    Do you enjoy solving technical issues, empathize with customer user experiences and want to keep up with the latest tech? We are looking for a Cloud Infrastructure Engineer that will work with tale...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Structural Engineer, Mid-level

    Structural Engineer, Mid-level

    Hazen and Sawyer • Concord, CA, US
    [job_card.full_time]
    Hazen and Sawyer is an employee-owned professional corporation providing consulting services in Environmental Engineering and Science to the public and private sectors since 1951.We are currently s...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Staff Cloud Infrastructure Engineer

    Staff Cloud Infrastructure Engineer

    Ceribell, Inc • Sunnyvale, CA, US
    [job_card.full_time]
    Ceribell is a medical technology company focused on transforming the diagnosis and management of patients with serious neurological conditions. The Ceribell System is a novel, point-of-care.EEG"...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Senior Infrastructure Engineer - Supercomputing

    Senior Infrastructure Engineer - Supercomputing

    Institute of Foundation Models • Sunnyvale, CA, US
    [job_card.full_time]
    About the Institute of Foundation Models.We are a dedicated research lab for building, understanding, using, and risk-managing foundation models. Our mandate is to advance research, nurture the next...[show_more]
    [last_updated.last_updated_30] • [promoted]