Talent.com
AI & HPC Infrastructure Engineer
AI & HPC Infrastructure EngineerLos Angeles Staffing • Los Angeles, CA, United States
AI & HPC Infrastructure Engineer

AI & HPC Infrastructure Engineer

Los Angeles Staffing • Los Angeles, CA, United States
[job_card.variable_days_ago]
[job_preview.job_type]
  • [job_card.full_time]
[job_card.job_description]

Global Infrastructure Engineering AI & HPC Team

The Global Infrastructure Engineering AI & HPC team is at the center of enabling infrastructure reinvention for the next era of digital solutions powered by AI and High-Performance Computing (HPC). We bring together deep technical expertise across cloud, on-prem, and hybrid environments to design, build, and operate accelerated infrastructure that powers high-performance workloads at scale. Our solutions enable some of our most strategic and mission-critical clients to unlock new levels of performance, efficiency, and innovation. Our remit spans the full lifecyclefrom strategy and architecture through implementation and operationsdriving modernization across the entire infrastructure stack. We collaborate across the ecosystem to harness emerging technologies, fuel growth, and transform industries. In this rapidly growing market, our team is leading the way in shaping how enterprises leverage AI and HPC to drive breakthrough innovation and reimagine what's possible in infrastructure.

Key Responsibilities

  • Design and implement HPC and AI infrastructure solutions, aligning system architecture and deployment roadmaps to industry-specific performance and scalability needs
  • Deploy, configure, and manage XPU-based clusters (CPU / GPU / accelerators) using schedulers, VM / K8s orchestration platforms, Slurm, and containerized platforms in scalable designs to provide Metal as a Service (MaaS), GPUaaS, AIaaS, and other offerings
  • Optimize cluster performance, scalability, energy, and cost efficiency across on-premises, cloud, and hybrid environments
  • Integrate AI and HPC platforms with existing IT systems, data pipelines, and security frameworks
  • Monitor, troubleshoot, and tune infrastructure to ensure high availability, low-latency networking, and workload resiliency
  • Develop and maintain documentation including architecture diagrams, configuration baselines, and operational runbooks
  • Provide technical guidance and support to users, enabling efficient execution of HPC / AI workloads, large-scale models, and simulations. Travel may be required for this role. The amount of travel will vary from 25% to 100% depending on business need and client requirements.

Required Skills and Qualifications

  • Minimum 4+ year of hands-on experience designing, deploying, and managing HPC and AI infrastructure across on-premises, cloud, and hybrid environments in 2 or more segments : hyperscaler, neocloud, large Enterprise, Telco / Mobile, supporting key industries such as Financial Services, Life Sciences, Manufacturing, and Retail
  • Minimum 4+ years' experience of accelerated computing architectures (GPUs, XPUs, DPUs), high-performance fabrics (InfiniBand, Ethernet), SONiC, networking, and modern storage / data platforms (e.g. NVMe-oF, Lustre, GPFS, BeeGFS, VAST, DDN, Weka) to build robust solutions
  • Minimum 4+ year experience with cluster management and orchestration (e.g. Slurm, Run : ai, Kubernetes, Docker), real-time performance monitoring, and observability frameworks
  • Minimum 4+ years' experience with cloud and virtualization platforms (e.g. AWS, Azure, GCP, VMware, Nutanix) and expertise in automation and optimization using scripting (Python, AI tools) with foundational Infrastructure-as-Code tools such as Terraform and Ansible.
  • Minimum 4+ year experience implementing MLOps and DevSecOps frameworks to enable secure, automated, and reproducible workflows
  • Bachelor's degree or equivalent (minimum 12 years) work experience. (If Associate's Degree, must have minimum 6 years work experience)
  • Preferred Skills and Qualifications

  • Experience managing the deployment of 1,000+ GPU clusters for HPC and AI workloads with various infrastructure services enabled
  • Experience with GPU computing libraries and accelerators (e.g., NVIDIA CUDA, Dynamo, AMD ROCm)
  • Experience with AI and HPC Networking (e.g., RoCE, InfiniBand, muti-planar / multi-rail designs, platform buffer architectures)
  • Knowledge of Machine Learning and AI frameworks (e.g., TensorFlow, PyTorch, JAX), Jupyter notebooks / Google Colab environments
  • Experience with HPC & AI workload management and optimization techniques
  • Familiarity with DevOps practices and tools (e.g., Ansible, Terraform) for infrastructure automation
  • Industry certifications in NVIDIA infrastructure, public cloud providers, Data Science, etc. are a plus
  • Compensation

    Compensation at Accenture varies depending on a wide array of factors, which may include but are not limited to the specific office location, role, skill set, and level of experience. As required by local law, Accenture provides a reasonable range of compensation for roles that may be hired as set forth below.

    Role Location Annual Salary Range

    California $73,800 to $218,800

    Cleveland $68,300 to $175,000

    Colorado $73,800 to $189,000

    District of Columbia $78,500 to $201,300

    Illinois $68,300 to $189,000

    Maryland $73,800 to $189,000

    Massachusetts $73,800 to $201,300

    Minnesota $73,800 to $189,000

    New York / New Jersey $68,300 to $218,800

    Washington $78,500 to $201,300

    [job_alerts.create_a_job]

    AI HPC Infrastructure Engineer • Los Angeles, CA, United States

    [internal_linking.similar_jobs]
    AI Security / Biosecurity Engineer, RAND CAST

    AI Security / Biosecurity Engineer, RAND CAST

    RAND Corporation • Santa Monica, CA, United States
    [job_card.temporary]
    The RAND Center on AI, Security, and Technology (RAND CAST).AI Security / Biosecurity Engineers to work across a number of our most critical and fast-paced AI security and biosecurity workstreams.R...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Principal FPGA Engineer I

    Principal FPGA Engineer I

    CesiumAstro • El Segundo, California, US
    [job_card.full_time] +1
    Job Description Job Description Please Note : To conform with the United States Government Space Technology Export Regulations, the applicant must be a U. At CesiumAstro , we are developers and pio...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Sr. Software Engineer - DPDK, Docker, and IP Networking

    Sr. Software Engineer - DPDK, Docker, and IP Networking

    Apposite Technologies LLC • Los Angeles, California, US
    [job_card.full_time]
    Job Description Job Description Apposite Technologies is looking for a Sr.Software engineer with strong DPDK, Docker, IP Networking, and Linux experience. Apposite's network emulation solutions ha...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Senior Software Engineer - Agentic AI Analytics (Rust / Data Systems)

    Senior Software Engineer - Agentic AI Analytics (Rust / Data Systems)

    Noor Staffing Group • Los Angeles, California, US
    [job_card.full_time]
    Job Description Job Description Title : Senior Software Engineer - Agentic AI Analytics (Rust / Data Systems) Location : Glendale, CA - structured hybrid (in-office component required) Compensation...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Infrastructure Engineer

    Infrastructure Engineer

    Davis Wright Tremaine • Los Angeles, CA, United States
    [job_card.full_time]
    This is an exciting opportunity to work for one of the top law firms in the U.Davis Wright Tremaine LLP is looking for an. Seattle, Portland, Anchorage, Los Angeles, San Francisco, New York, or Wash...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Senior AI Compute Infrastructure Engineer

    Senior AI Compute Infrastructure Engineer

    HeyGen • Los Angeles, CA, United States
    [job_card.full_time]
    A tech company specializing in AI infrastructure is seeking a Software Engineer to build a scalable compute platform for its generative video models. The ideal candidate will have over 5 years of ex...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Tech Lead, AI Compute Infrastructure

    Tech Lead, AI Compute Infrastructure

    HeyGen • Los Angeles, California, United States
    [job_card.full_time]
    At HeyGen, our mission is to make visual storytelling accessible to all.Over the last decade, visual content has become the preferred method of information creation, consumption, and retention.But ...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Senior AI Engineer

    Senior AI Engineer

    Sidecar Health • Los Angeles, CA, US
    [job_card.full_time]
    Sidecar Health is redefining health insurance.Our mission is to make excellent healthcare affordable and attainable for everyone. We know that to accomplish this lofty mission, we need driven people...[show_more]
    [last_updated.last_updated_1_day] • [promoted]
    AI Algorithm Engineer

    AI Algorithm Engineer

    VINNOVATION INC • Los Angeles, CA, US
    [job_card.full_time]
    Deeply participate in the architecture design, feature development, unit testing, and continuous maintenance of C++ core components within AI training and inference systems.Ensure high performance ...[show_more]
    [last_updated.last_updated_30] • [promoted]
    AI Platform Architect

    AI Platform Architect

    VirtualVocations • North Hollywood, California, United States
    [job_card.full_time]
    A company is looking for an AI Platform Architect to lead AI assessments and proofs of concept for enterprise clients.Key Responsibilities Lead technical workshops to identify and prioritize AI a...[show_more]
    [last_updated.last_updated_1_day] • [promoted]
    Principal Director, Applied AI & Advanced Infrastructure (A3I)

    Principal Director, Applied AI & Advanced Infrastructure (A3I)

    The Aerospace Corporation • El Segundo, CA, United States
    [job_card.full_time]
    The Aerospace Corporation is the trusted partner to the nation's space programs, solving the hardest problems and providing unmatched technical expertise. As the operator of a federally funded resea...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Senior Network Design Engineer I / II Pipeline

    Senior Network Design Engineer I / II Pipeline

    Rocket Lab Corporation • Long Beach, CA, US
    [job_card.permanent]
    Rocket Lab is an end-to-end space company delivering responsive launch services, complete spacecraft design and manufacturing, payloads, satellite components, and more – all with the goal of ...[show_more]
    [last_updated.last_updated_1_day] • [promoted]
    Sr. M365 AI Engineer

    Sr. M365 AI Engineer

    Konica Minolta Business Solutions • Los Angeles, CA, United States
    [job_card.full_time]
    All Covered, IT Managed Services Division of Konica Minolta Business Solutions, has an exciting opportunity available for a Sr. Developed and deliver Copilot and AI training sessions to end-users an...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Lead Software Engineer, AI Planning

    Lead Software Engineer, AI Planning

    Divergent • Los Angeles, California, US
    [job_card.full_time]
    Job Description Job Description Divergent is a technology company that has architected, invented, built, and commercialized an end-to-end factory system called the Divergent Adaptive Production S...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    HPC Software Engineer

    HPC Software Engineer

    Arete Associates • Los Angeles, California, US
    [job_card.full_time]
    Job Description Job Description At Areté, we are on the forefront of developing innovative solutions, with great minds from all backgrounds, to help solve the nation's most complex security chall...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Senior FPGA Engineer II - Network

    Senior FPGA Engineer II - Network

    CesiumAstro • El Segundo, California, US
    [job_card.full_time] +1
    Job Description Job Description Please Note : To conform with the United States Government Space Technology Export Regulations, the applicant must be a U. At CesiumAstro , we are developers and pio...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Distinguished Engineer, Cloud Architecture

    Distinguished Engineer, Cloud Architecture

    Saviynt • Los Angeles, CA, US
    [job_card.full_time]
    Saviynt is the most innovative cloud identity and access governance platform on the market.We secure hundreds of millions of identities at many of the world’s largest enterprises, helping the...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Generative AI Engineer

    Generative AI Engineer

    Regard • Los Angeles, California, US
    [job_card.full_time]
    Job Description Job Description As a Generative AI Engineer at Regard, you'll work across the full lifecycle of developing and deploying AI-driven features, from ideation and design to prototypin...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]