Talent.com
AI & HPC Infrastructure Engineer
AI & HPC Infrastructure EngineerLos Angeles Staffing • Los Angeles, CA, United States
AI & HPC Infrastructure Engineer

AI & HPC Infrastructure Engineer

Los Angeles Staffing • Los Angeles, CA, United States
[job_card.variable_days_ago]
[job_preview.job_type]
  • [job_card.full_time]
[job_card.job_description]

Global Infrastructure Engineering AI & HPC Team

The Global Infrastructure Engineering AI & HPC team is at the center of enabling infrastructure reinvention for the next era of digital solutions powered by AI and High-Performance Computing (HPC). We bring together deep technical expertise across cloud, on-prem, and hybrid environments to design, build, and operate accelerated infrastructure that powers high-performance workloads at scale. Our solutions enable some of our most strategic and mission-critical clients to unlock new levels of performance, efficiency, and innovation. Our remit spans the full lifecyclefrom strategy and architecture through implementation and operationsdriving modernization across the entire infrastructure stack. We collaborate across the ecosystem to harness emerging technologies, fuel growth, and transform industries. In this rapidly growing market, our team is leading the way in shaping how enterprises leverage AI and HPC to drive breakthrough innovation and reimagine what's possible in infrastructure.

Key Responsibilities

  • Design and implement HPC and AI infrastructure solutions, aligning system architecture and deployment roadmaps to industry-specific performance and scalability needs
  • Deploy, configure, and manage XPU-based clusters (CPU / GPU / accelerators) using schedulers, VM / K8s orchestration platforms, Slurm, and containerized platforms in scalable designs to provide Metal as a Service (MaaS), GPUaaS, AIaaS, and other offerings
  • Optimize cluster performance, scalability, energy, and cost efficiency across on-premises, cloud, and hybrid environments
  • Integrate AI and HPC platforms with existing IT systems, data pipelines, and security frameworks
  • Monitor, troubleshoot, and tune infrastructure to ensure high availability, low-latency networking, and workload resiliency
  • Develop and maintain documentation including architecture diagrams, configuration baselines, and operational runbooks
  • Provide technical guidance and support to users, enabling efficient execution of HPC / AI workloads, large-scale models, and simulations. Travel may be required for this role. The amount of travel will vary from 25% to 100% depending on business need and client requirements.

Required Skills and Qualifications

  • Minimum 4+ year of hands-on experience designing, deploying, and managing HPC and AI infrastructure across on-premises, cloud, and hybrid environments in 2 or more segments : hyperscaler, neocloud, large Enterprise, Telco / Mobile, supporting key industries such as Financial Services, Life Sciences, Manufacturing, and Retail
  • Minimum 4+ years' experience of accelerated computing architectures (GPUs, XPUs, DPUs), high-performance fabrics (InfiniBand, Ethernet), SONiC, networking, and modern storage / data platforms (e.g. NVMe-oF, Lustre, GPFS, BeeGFS, VAST, DDN, Weka) to build robust solutions
  • Minimum 4+ year experience with cluster management and orchestration (e.g. Slurm, Run : ai, Kubernetes, Docker), real-time performance monitoring, and observability frameworks
  • Minimum 4+ years' experience with cloud and virtualization platforms (e.g. AWS, Azure, GCP, VMware, Nutanix) and expertise in automation and optimization using scripting (Python, AI tools) with foundational Infrastructure-as-Code tools such as Terraform and Ansible.
  • Minimum 4+ year experience implementing MLOps and DevSecOps frameworks to enable secure, automated, and reproducible workflows
  • Bachelor's degree or equivalent (minimum 12 years) work experience. (If Associate's Degree, must have minimum 6 years work experience)
  • Preferred Skills and Qualifications

  • Experience managing the deployment of 1,000+ GPU clusters for HPC and AI workloads with various infrastructure services enabled
  • Experience with GPU computing libraries and accelerators (e.g., NVIDIA CUDA, Dynamo, AMD ROCm)
  • Experience with AI and HPC Networking (e.g., RoCE, InfiniBand, muti-planar / multi-rail designs, platform buffer architectures)
  • Knowledge of Machine Learning and AI frameworks (e.g., TensorFlow, PyTorch, JAX), Jupyter notebooks / Google Colab environments
  • Experience with HPC & AI workload management and optimization techniques
  • Familiarity with DevOps practices and tools (e.g., Ansible, Terraform) for infrastructure automation
  • Industry certifications in NVIDIA infrastructure, public cloud providers, Data Science, etc. are a plus
  • Compensation

    Compensation at Accenture varies depending on a wide array of factors, which may include but are not limited to the specific office location, role, skill set, and level of experience. As required by local law, Accenture provides a reasonable range of compensation for roles that may be hired as set forth below.

    Role Location Annual Salary Range

    California $73,800 to $218,800

    Cleveland $68,300 to $175,000

    Colorado $73,800 to $189,000

    District of Columbia $78,500 to $201,300

    Illinois $68,300 to $189,000

    Maryland $73,800 to $189,000

    Massachusetts $73,800 to $201,300

    Minnesota $73,800 to $189,000

    New York / New Jersey $68,300 to $218,800

    Washington $78,500 to $201,300

    [job_alerts.create_a_job]

    AI HPC Infrastructure Engineer • Los Angeles, CA, United States

    [internal_linking.similar_jobs]
    Sr. Software Engineer - DPDK, Docker, and IP Networking

    Sr. Software Engineer - DPDK, Docker, and IP Networking

    Apposite Technologies LLC • Los Angeles, California, US
    [job_card.full_time]
    Job Description Job Description Apposite Technologies is looking for a Sr.Software engineer with strong DPDK, Docker, IP Networking, and Linux experience. Apposite's network emulation solutions ha...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    HRIS Integration Specialist

    HRIS Integration Specialist

    Modern HR • Burbank, California, US
    [job_card.full_time]
    Job Description Job Description The HRIS Integration Specialist focus is to administer, develop, support the maintenance and compliance of the HRIS system. This position serves as a technical poin...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Remote Investment Analyst – AI Trainer ($50-$60 / hour)

    Remote Investment Analyst – AI Trainer ($50-$60 / hour)

    Data Annotation • Burbank, California
    [filters.remote]
    [job_card.full_time] +1
    We are looking for a finance professional to join our team to train AI models.You will measure the progress of these AI chatbots, evaluate their logic, and solve problems to improve the quality of ...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Infrastructure Engineer

    Infrastructure Engineer

    Davis Wright Tremaine • Los Angeles, CA, United States
    [job_card.full_time]
    This is an exciting opportunity to work for one of the top law firms in the U.Davis Wright Tremaine LLP is looking for an. Seattle, Portland, Anchorage, Los Angeles, San Francisco, New York, or Wash...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Principal Director, Applied AI & Advanced Infrastructure (A3I)

    Principal Director, Applied AI & Advanced Infrastructure (A3I)

    The Aerospace Corporation • El Segundo, CA, United States
    [job_card.full_time]
    The Aerospace Corporation is the trusted partner to the nation’s space programs, solving the hardest problems and providing unmatched technical expertise. As the operator of a federally funded resea...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Forward Deployed Engineer

    Forward Deployed Engineer

    Titan Holdings • Los Angeles, California, US
    [job_card.full_time]
    Job Description Job Description Position Overview Were hiring a Forward Deployed Engineer (FDE) to deploy autonomous AI agents into real enterprise workflows. This is a ship-the-product-in-the-fie...[show_more]
    [last_updated.last_updated_1_day] • [promoted]
    Need Data Engineer - Long Beach, CA

    Need Data Engineer - Long Beach, CA

    ICS Global Soft INC • Long Beach, California, US
    [job_card.full_time]
    Job Description Job Description Requirement : Role : DataEngineer Location : Long Beach, CA Duration : 3months C2H Interviewprocess : Phone and Video or In person Job Description : Top Priorities : 1....[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Controls Engineer

    Controls Engineer

    Automationtechies • Carson, California, US
    [job_card.full_time]
    Job Description Job Description Control System Engineer - REMOTE (Location specific - must reside in SoCal or near Orlando, FL) Thriving systems integration company in Southern California, cateri...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Senior Network Design Engineer I / II Pipeline

    Senior Network Design Engineer I / II Pipeline

    Rocket Lab Corporation • Long Beach, California, US
    [job_card.permanent]
    Job Description Job Description ABOUT ROCKET LAB Rocket Lab is an end-to-end space company delivering responsive launch services, complete spacecraft design and manufacturing, payloads, satellite...[show_more]
    [last_updated.last_updated_1_day] • [promoted]
    Infrastructure Architect

    Infrastructure Architect

    Impulse Space • Redondo Beach, California, US
    [job_card.full_time] +1
    Job Description Job Description Description Impulse is seeking a Infrastructure Architect to define, design, and scale a secure, high-performance, multi-site infrastructure supporting aerospace an...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Network Engineer I (IT Infrastructure)

    Network Engineer I (IT Infrastructure)

    Santa Monica Seafood Company • Compton, California, US
    [job_card.full_time]
    Job Description Job Description Position : Network Engineer I (IT Infrastructure) Location : Rancho Dominguez, CA (Hybrid) Reports To : Sr. Systems Engineer Role Overview : The Network Engineer I is ...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Tech Lead, AI Compute Infrastructure

    Tech Lead, AI Compute Infrastructure

    HeyGen • Los Angeles, CA, United States
    [job_card.full_time]
    At HeyGen, our mission is to make visual storytelling accessible to all.Over the last decade, visual content has become the preferred method of information creation, consumption, and retention.But ...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Engineer 2

    Engineer 2

    Apex Companies • Signal Hill, California, US
    [job_card.full_time]
    Job Description Job Description Are you highly motivated, hard-working, and seeking to join a growth-focused consulting & engineering firm? Are you looking for a company that will invest in your ...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Sr. M365 AI Engineer

    Sr. M365 AI Engineer

    Konica Minolta Business Solutions • Los Angeles, CA, United States
    [job_card.full_time]
    All Covered, IT Managed Services Division of Konica Minolta Business Solutions, has an exciting opportunity available for a Sr. Developed and deliver Copilot and AI training sessions to end-users an...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Data Engineer Analytics Infrastructure (Foundational Hire)

    Data Engineer Analytics Infrastructure (Foundational Hire)

    Vast.ai • Los Angeles, California, US
    [job_card.full_time]
    Job Description Job Description About Us Vision : To make life substrate independent through Vast Artificial Intelligence Mission : To organize, optimize, and orient the world's computation Vast.AI...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Identity & Access Management (IAM) Engineer

    Identity & Access Management (IAM) Engineer

    InsideHigherEd • Los Angeles, California, United States
    [job_card.full_time]
    The UCLA Information Security team enables UCLA’s mission by providing leadership and expertise that assures the confidentiality, integrity, safeguarding, and availability of the university’s digit...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    DESIGN ENGINEER II

    DESIGN ENGINEER II

    AMPAM Parks Mechanical • Carson, California, US
    [job_card.full_time]
    Job Description Job Description Who We Are AMPAM is a leading Mechanical, Electrical, and Plumbing (MEP) contractor serving large-scale multifamily and commercial projects across California.With ...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    AI Algorithm Engineer

    AI Algorithm Engineer

    VINNOVATION INC • Los Angeles, California, US
    [job_card.full_time]
    Job Description Job Description Job Title : AI Algorithm Engineer Location : Hybrid / Remote Mostly Job Type : Full-time Salary : Negotiable Responsibilities • Core AI Module Development Deeply pa...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]