Talent.com
Tech Lead, AI Compute Infrastructure
Tech Lead, AI Compute InfrastructureHeyGen • Los Angeles, CA, United States
Tech Lead, AI Compute Infrastructure

Tech Lead, AI Compute Infrastructure

HeyGen • Los Angeles, CA, United States
[job_card.30_days_ago]
[job_preview.job_type]
  • [job_card.full_time]
[job_card.job_description]

About HeyGen

At HeyGen, our mission is to make visual storytelling accessible to all. Over the last decade, visual content has become the preferred method of information creation, consumption, and retention. But the ability to create such content, in particular videos, continues to be costly and challenging to scale. Our ambition is to build technology that equips more people with the power to reach, captivate, and inspire audiences.

Learn more at www.heygen.com. Visit our Mission and Culture doc here.

We are seeking a seasoned Technical Leader to build and scale the foundational compute infrastructure that powers our state‑of‑the‑art AI models—from multimodal training data pipelines to high‑throughput, low‑latency video generation.

Responsibilities

You will be the core engineer responsible for building the robust, efficient, and scalable platform that enables our research and production teams to rapidly iterate on HeyGen's generative video models. Your contributions will directly impact model performance, developer productivity, and the final quality of every AI‑generated video.

Optimize GPU Utilization : Design and implement mechanisms to aggressively optimize GPU and cluster utilization across thousands of devices for inference, training, data processing and large‑scale deployment of our state‑of‑art video generation models .

Develop Large‑Scale AI Job Framework : Build highly scalable, reliable frameworks for launching and managing massive, heterogeneous compute jobs, including multi‑modal high‑volume data ingestion / processing, distributed model training, and continuous evaluation / benchmarking.

Enhance Observability : Develop world‑class observability, tracing, and visualization tools for our compute cluster to ensure reliability, diagnose performance bottlenecks (e.g., memory, bandwidth, communication).

Accelerate Pipelines : Collaborate closely with AI researchers and AI engineers to integrate innovative acceleration techniques (e.g., custom CUDA kernels, distributed training libraries) into production‑ready, scalable training and inference pipelines.

Infrastructure Management : Champion the adoption and optimization of modern cloud and container technologies ( Kubernetes, Ray ) for elastic, cost‑efficient scaling of our distributed systems.

We are looking for a highly motivated engineer with deep experience operating and optimizing AI infrastructure at scale.

Bachelor's degree in Computer Science, Engineering, or a related field, or equivalent practical experience.

5+ years of full‑time industry experience in large‑scale MLOps, AI infrastructure, or HPC systems .

Experience with data frameworks and standards like Ray, Apache Spark, LanceDB .

Strong proficiency in Python and a high‑performance language such as C++ for developing core infrastructure components.

Deep understanding and hands‑on experience with modern orchestration and distributed computing frameworks such as Kubernetes and Ray .

Experience with core ML frameworks such as PyTorch, TensorFlow, or JAX .

Preferred Qualifications

Master's or PhD in Computer Science or a related technical field.

Demonstrated Tech Lead experience, driving projects from conceptual design through to production deployment across cross‑functional teams.

Prior experience building infrastructure specifically for Generative AI models (e.g., diffusion models, GANs, or large language models) where cost and latency are critical.

Proven background in building and operating large‑scale data infrastructure (e.g., Ray, Apache Spark) to manage petabytes of multi‑modal data (video, audio, text).

  • Expertise in GPU acceleration and deep familiarity with low‑level compute programming, including CUDA, NCCL , or similar technologies for efficient inter‑GPU communication.

What HeyGen Offers

  • Competitive salary and benefits package.
  • Dynamic and inclusive work environment.
  • Opportunities for professional growth and advancement.
  • Collaborative culture that values innovation and creativity.
  • Access to the latest technologies and tools.
  • HeyGen is an Equal Opportunity Employer. We celebrate diversity and are committed to creating an inclusive environment for all employees.

    #J-18808-Ljbffr

    [job_alerts.create_a_job]

    Tech Lead Ai • Los Angeles, CA, United States

    [internal_linking.related_jobs]
    Lead Data Architect

    Lead Data Architect

    Capital Group • Los Angeles, CA, United States
    [job_card.full_time]
    I can succeed as a Lead Data Architect at Capital Group.As a Lead Data Platform Architect, you will serve as the technical domain expert on foundational data and integration technologies to better ...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Research Lead - AI Cyber Testing & Evaluation

    Research Lead - AI Cyber Testing & Evaluation

    RAND Corporation • Santa Monica, CA, United States
    [job_card.temporary]
    Global and Emerging Risks (GER) division.As Research Lead - AI Cyber Testing & Evaluation, you'll direct a comprehensive research portfolio focused on assessing the offensive cyber capabilities of ...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Current Opening : Embedded AI Content Security Engineer HYBRID

    Current Opening : Embedded AI Content Security Engineer HYBRID

    Independent Security Evaluators • Los Angeles, California, United States
    [filters.remote]
    [job_card.full_time]
    ISE is hiring an Embedded AI Content Security Engineer to partner with the team at select clients in the media and entertainment industry. This role is particularly focused on AI and machine learnin...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Visiting AI Security Resident

    Visiting AI Security Resident

    RAND Corporation • Santa Monica, CA, United States
    [job_card.temporary]
    Global and Emerging Risks (GER) division.AI, information security, and national security.As a Visiting AI Security Resident, you'll manage and lead projects that directly impact AI and cybersecurit...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Digital Engineer, Lead

    Digital Engineer, Lead

    BOOZ, ALLEN & HAMILTON, INC. • El Segundo, CA, US
    [job_card.full_time] +1
    Create, integrate, and apply interdisciplinary digital models of products from concept throughout the product lifecycle.Apply leading-edge principles, theories, and concepts.Contribute to the devel...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Senior Azure Cloud Engineer

    Senior Azure Cloud Engineer

    Unisys Corporation • Long Beach, CA, United States
    [job_card.full_time]
    What success looks like in this role : .We are seeking a highly skilled Senior Azure Cloud Engineer with proven expertise in designing and deploying multitenant Microsoft Sentinel environments.The id...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Lead Full Stack Engineer, AI

    Lead Full Stack Engineer, AI

    Centerfield • Los Angeles, California, United States
    [job_card.full_time]
    Supercharged customer acquisition.Centerfield delivers outcome-based digital marketing solutions and personalized omnichannel experiences for the world’s leading brands. Powered by our proprietary D...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Lead Architect

    Lead Architect

    Computrition, Inc. • Los Angeles, CA, United States
    [job_card.full_time]
    Lead Software Architect (Decision Maker) - CloudNative.Computrition - Jonas Software (https : / / www.Occasional travel requirements to Bedford MA office for occasional inperson whiteboard sessions.You...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Client Technology Architecture Lead Software & AI Architect

    Client Technology Architecture Lead Software & AI Architect

    EY Studio+ Nederland • Los Angeles, California, USA
    [job_card.full_time]
    At EY were all in to shape your future with confidence.Well help you succeed in a globally connected powerhouse of diverse teams and take your career wherever you want it to go.Join EY and help to ...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Lead AI AppSec Engineer

    Lead AI AppSec Engineer

    Capital Group • Los Angeles, CA, United States
    [job_card.full_time]
    I can succeed as a Lead AI AppSec Engineer at Capital Group".As a LeadAIAppSecEngineer,you will work with application teams to ensure the security of custom andprocuredAI solutions.You'llcollaborat...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Digital Innovation Division DataOps Lead

    Digital Innovation Division DataOps Lead

    The Aerospace Corporation • El Segundo, CA, United States
    [job_card.full_time]
    The Aerospace Corporation is the trusted partner to the nation's space programs, solving the hardest problems and providing unmatched technical expertise. As the operator of a federally funded resea...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Principal Technical Architect

    Principal Technical Architect

    Akaasa Technologies • Los Angeles, California, USA
    [job_card.full_time]
    Required Qualifications (Must Have).Active CCIE (any track; Enterprise Infrastructure and / or Security strongly preferred). SD Access architecture and deployment across multiple sites.Proven exceptio...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Lead AI Security Engineer

    Lead AI Security Engineer

    Capital Group • Los Angeles, CA, United States
    [job_card.full_time]
    I can succeed as a Lead AI Security Engineer at Capital Group".As aLeadAISecurity Engineer, you willbe responsible forsecuring Capital Group's enterprise AI Platforms. You'llcollaborate with platfor...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Architect Lead

    Architect Lead

    Capital Group • Los Angeles, CA, United States
    [job_card.temporary]
    I can succeed as an Architect Lead at Capital Group.As a Lead Architect, you will serve as the technical domain expert for solution architectures across Capital Group's corporate capabilities-inclu...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Senior Manager, Engineering - Cloud Infrastructure

    Senior Manager, Engineering - Cloud Infrastructure

    Relativity • Los Angeles, CA, United States
    [job_card.full_time]
    The Relativity engineering department builds and maintains scalable, secure, and performant solutions that empower legal and compliance teams globally. Our mission spans multiple domains, including ...[show_more]
    [last_updated.last_updated_1_day] • [promoted]
    Lead Engineer, AI

    Lead Engineer, AI

    Absurd Ventures • Santa Monica, California, United States
    [job_card.full_time]
    We are seeking a highly skilled and experienced.As the Lead AI Engineer, you will be responsible for overseeing the design, development, and optimization of game artificial intelligence systems- fr...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Digital Innovation Division Lead of AI Operations

    Digital Innovation Division Lead of AI Operations

    The Aerospace Corporation • El Segundo, CA, United States
    [job_card.full_time]
    The Aerospace Corporation is the trusted partner to the nation's space programs, solving the hardest problems and providing unmatched technical expertise. As the operator of a federally funded resea...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Principle - Data Architect (Dicom / Medical Imaging)

    Principle - Data Architect (Dicom / Medical Imaging)

    CitiusTech • Long Beach, CA, US
    [job_card.full_time]
    Data Architect (Medical Imaging / Dicom).With over 8,500 healthcare technology professionals worldwide, CitiusTech powers healthcare digital innovation, business transformation, and industry-wide con...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]