Talent.com
AI DevOps and Cloud Infrastructure Engineer
AI DevOps and Cloud Infrastructure EngineerCrowe • Los Angeles, CA, US
[error_messages.no_longer_accepting]
AI DevOps and Cloud Infrastructure Engineer

AI DevOps and Cloud Infrastructure Engineer

Crowe • Los Angeles, CA, US
[job_card.variable_days_ago]
[job_preview.job_type]
  • [job_card.full_time]
[job_card.job_description]

divh2Your Journey at Crowe Starts Here / h2pAt Crowe, you can build a meaningful and rewarding career. With real flexibility to balance work with life moments, youre trusted to deliver results and make an impact. We embrace you for who you are, care for your well-being, and nurture your career. Everyone has equitable access to opportunities for career growth and leadership. Over our 80-year history, delivering excellent service through innovation has been a core part of our DNA across our audit, tax, and consulting groups. Thats why we continuously invest in innovative ideas, such as AI-enabled insights and technology-powered solutions, to enhance our services. Join us at Crowe and embark on a career where you can help shape the future of our industry. / ppJob Description : / ppAbout Crowe AI Transformation / ppEverything we do is about making the future of human work more purposeful. We do this by leveraging state-of-the-art technologies, modern architecture, and industry experts to create AI-powered solutions that transform the way our clients do business. / ppThe new AI Transformation team will build on Crowes established AI foundation, furthering the capabilities of our Applied AI / Machine Learning team. By combining Generative AI, Machine Learning and Software Engineering, this team empowers Crowe clients to transform their business models through AI, irrespective of their current AI adoption stage. / ppAs a member of AI Transformation, you will help distinguish Crowe in the market and drive the firms technology and innovation strategy. The future is powered by AI, come build it with us. / ppAbout the Team / pulliWe invest in expertise. Youll have the time, space, and support to go deep in your projects and build lasting technical and strategic mastery. Youll work with developers, product stakeholders, and project managers as a trusted leader and domain expert. / liliWe believe in continuous growth. Our team is committed to professional development and knowledge-sharing. / liliWe protect balance. Our distributed team culture is grounded in trust and flexibility. We offer unlimited PTO, a flexible remote work policy, and a supportive environment that prioritizes sustainable, long-term performance. / li / ulpAbout the Role / ppThe AI DevOps and Cloud Infrastructure Manager leads teams responsible for designing, operating, and scaling AI / ML infrastructure, cloud platforms, and DevOps automation that support enterprise model training, inference, and generative AI workloads. This role is the strategy and execution of cloud-native, Kubernetes-based platforms that enable reliable, secure, and cost-efficient AI systems. / ppAs a manager, this position combines hands-on technical leadership with people management, delivery ownership, and strategic decision-making. The role oversees distributed compute environments, GPU clusters, CI / CD pipelines, and vector-search infrastructure while ensuring high availability, resilience, and compliance with security and responsible AI standards. The manager partners closely with AI engineering, data engineering, product, and security teams, serves as the primary technical owner for assigned initiatives, and communicates system risks, tradeoffs, and progress to leadership. / ppKey responsibilities include : / pulliLeading engineering teams responsible for AI / ML infrastructure, cloud operations, and MLOps automation. / liliDefining cloud, Kubernetes, and infrastructure strategy to support scalable model training, inference, and generative AI platforms. / liliGuiding the design and operation of distributed compute environments, GPU clusters, and vector database infrastructure. / liliOverseeing CI / CD pipelines that automate model training, testing, deployment, monitoring, and lifecycle management. / liliManaging incident response, failure analysis, and reliability engineering across AI platforms. / liliDirecting performance testing, capacity planning, and cost optimization for AI infrastructure. / liliEnsuring compliance with cloud security, IAM practices, governance requirements, and responsible AI frameworks. / liliImplementing multi-cloud resilience patterns, high availability, and automated failover for critical AI workloads. / liliSupporting platform modernization initiatives, including adoption of optimized LLM runtimes and new orchestration technologies. / liliEvaluating third-party infrastructure tools, GPU scheduling solutions, and platform enhancements. / liliCommunicating system status, dependencies, risks, and technical decisions to senior leadership. / liliManaging 45 direct reports, including coaching, performance management, and career development. / liliOwning project delivery, including budget, timelines, and quality of outcomes. / liliCoordinating with sales and stakeholders on project sizing, feasibility, and strategic opportunities. / liliDriving continuous improvement initiatives to advance DevOps maturity and AI infrastructure operational readiness. / li / ulpQualifications / pulli7+ years of professional experience in DevOps, cloud engineering, MLOps, or platform engineering. / lili2+ years of experience in engineering leadership or senior technical leadership roles. / liliExpert proficiency with distributed cloud systems, Kubernetes, and infrastructure-as-code. / liliAdvanced ability to troubleshoot infrastructure, networking, container, and deployment issues. / liliProficiency in Python, Bash, or similar automation and scripting languages. / liliStrong understanding of monitoring, observability, and reliability engineering patterns. / liliHands-on experience supporting infrastructure for ML or generative AI workloads. / liliStrong leadership, communication, and cross-functional collaboration skills. / li / ulpPreferred Qualifications / pulliBachelors degree in computer science, engineering, cloud computing, or a related field. / liliMasters degree in technical discipline. / liliCloud and AI certifications, including Azure (AZ-900, AZ-104, AZ-305, AZ-700, AZ-800, AI-102) or equivalent AWS / GCP certifications. / liliExtensive experience with Kubernetes platforms (EKS, AKS, GKE) and cloud ML services (Azure ML, SageMaker). / liliExperience with GPU workload orchestration, optimization, and multi-tenant inference environments. / liliExpertise in observability and distributed tracing (Prometheus, Grafana, CloudWatch, OpenTelemetry). / liliStrong experience with Terraform and infrastructure governance at scale. / liliFamiliarity with service mesh architectures (Istio, Linkerd) and advanced deployment patterns (blue / green, canary). / liliAdvanced experience supporting generative AI platforms, including LLM inference runtimes (vLLM, TGI), RAG infrastructure, and vector databases (Pinecone, Weaviate, FAISS). / liliExperience operating fine-tuned LLMs (LoRA, QLoRA), managing GenAI CI / CD pipelines, and implementing hallucination, drift, and reliability monitoring. / liliDemonstrated ability to make strategic technical decisions within defined delivery and budget constraints. / li / ul / div

[job_alerts.create_a_job]

AI DevOps and Cloud Infrastructure Engineer • Los Angeles, CA, US

[internal_linking.similar_jobs]
Lead AI Infrastructure & Tooling Engineer

Lead AI Infrastructure & Tooling Engineer

The Walt Disney Company (France) • Santa Monica, CA, United States
[job_card.full_time]
A major multimedia and entertainment corporation is seeking a Lead Software Engineer to innovate and enhance digital products across platforms. You will lead a team in developing scalable software f...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Infrastructure & Cloud Engineering Manager

Infrastructure & Cloud Engineering Manager

WPS • Torrance, CA, United States
[job_card.full_time]
Job Title : Manager, Infrastructure & Cloud Engineering.Director of Technology Operations & Information Security Officer. Department : Technology Operations.The Manager, Infrastructure & Cloud Enginee...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Databricks Data Engineer with DevOps

Databricks Data Engineer with DevOps

Apptad Inc • Los Angeles, CA, United States
[job_card.full_time]
[filters_job_card.quick_apply]
Job Description : We are looking for an experienced Databricks Data Engineer with strong DevOps expertis...[show_more]
[last_updated.last_updated_variable_days]
Staff Software Engineer - Cloud Platform

Staff Software Engineer - Cloud Platform

Northwoodspace • Los Angeles, California, United States
[job_card.full_time] +1
Northwood is on a mission to transform connectivity between earth and space and bring the benefits of space to the masses through innovations in space communications technologies.If you are energiz...[show_more]
[last_updated.last_updated_30] • [promoted]
Lead AI Infrastructure & Tooling Engineer

Lead AI Infrastructure & Tooling Engineer

The Walt Disney Company • Santa Monica, CA, United States
[job_card.full_time]
A prominent entertainment firm is seeking a Lead Software Engineer to develop AI / ML frameworks and contribute to innovative tooling across products. This role involves mentorship of junior developer...[show_more]
[last_updated.last_updated_30] • [promoted]
Senior Azure Cloud Engineer

Senior Azure Cloud Engineer

Unisys Corporation • Long Beach, CA, United States
[job_card.full_time]
What success looks like in this role : .We are seeking a highly skilled Senior Azure Cloud Engineer with proven expertise in designing and deploying multitenant Microsoft Sentinel environments.The id...[show_more]
[last_updated.last_updated_30] • [promoted]
Linux Infrastructure-as-Code DevOps Engineer

Linux Infrastructure-as-Code DevOps Engineer

The Aerospace Corporation • El Segundo, CA, United States
[job_card.full_time]
The Aerospace Corporation is the trusted partner to the nation's space programs, solving the hardest problems and providing unmatched technical expertise. As the operator of a federally funded resea...[show_more]
[last_updated.last_updated_30] • [promoted]
Senior Solutions Engineer — Cloud & DevOps Architect

Senior Solutions Engineer — Cloud & DevOps Architect

IBM • Santa Monica, CA, United States
[job_card.full_time]
A leading technology company is seeking a Sr.Solutions Engineer to transform customer challenges using HashiCorp offerings. This role involves acting as a trusted advisor, guiding customers through ...[show_more]
[last_updated.last_updated_30] • [promoted]
Senior Fullstack Engineer Agentic Infrastructure Job ID 2025-9881

Senior Fullstack Engineer Agentic Infrastructure Job ID 2025-9881

Internet Brands • El Segundo, California, United States
[job_card.full_time]
Senior Fullstack Engineer : Agentic Infrastructure Team.Internet Brands is looking for a Senior Fullstack Engineer to join our Agentic AI team. You'll help build our AI capabilities from the ground u...[show_more]
[last_updated.last_updated_30] • [promoted]
Hybrid Cloud Solutions Architect - AWS, API & Data

Hybrid Cloud Solutions Architect - AWS, API & Data

Sharp Decisions • Torrance, CA, United States
[job_card.full_time]
A client company in technology solutions is seeking an Enterprise Solutions Architect for a hybrid role based in Torrance, CA. This position involves driving alignment between business and IT, desig...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Senior Software Engineer, Infrastructure

Senior Software Engineer, Infrastructure

Sift Stack • El Segundo, California, United States
[job_card.full_time]
At Sift, we’re redefining how modern machines are built, tested, and operated.Our platform gives engineers real-time observability over high-frequency telemetry—eliminating bottlenecks and enabling...[show_more]
[last_updated.last_updated_30] • [promoted]
Information Security Engineer (CISSP, CISM)

Information Security Engineer (CISSP, CISM)

TechSource • San Fernando, CA, US
[job_card.full_time]
Please send me your updated resume at ashwini@tsourceinc.Position : Information Security Engineer (CISSP, CISM).Location : San Fernando Valley ,CA (Hybrid). Client : Hospital & Healthcare.Bachelor ...[show_more]
[last_updated.last_updated_variable_hours] • [promoted] • [new]
Cloud Engineer

Cloud Engineer

GPL Technologies • Los Angeles, CA, US
[job_card.full_time]
[filters_job_card.quick_apply]
Cloud Engineer - Media & Entertainmen Workflows At GPL Technologies, our mission is to provide powerful, reliable, and innovative technology services and leadership to creative companies ...[show_more]
[last_updated.last_updated_30]
Senior Systems Engineer (Modeling & Simulation)

Senior Systems Engineer (Modeling & Simulation)

Raytheon • Burbank, California, United States of America
[job_card.full_time] +1
US-MA-TEWKSBURY-TB3 ~ 50 Apple Hill Dr ~ CONCORD BLDG, Tewksbury Tb3 300 Concord.Person, or Immigration Status Requirements : . At Raytheon, the foundation of everything we do is rooted in our values ...[show_more]
[last_updated.last_updated_variable_hours] • [promoted] • [new]
Principal Director, Applied AI & Advanced Infrastructure (A3I)

Principal Director, Applied AI & Advanced Infrastructure (A3I)

The Aerospace Corporation • El Segundo, CA, United States
[job_card.full_time]
The Aerospace Corporation is the trusted partner to the nation's space programs, solving the hardest problems and providing unmatched technical expertise. As the operator of a federally funded resea...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Kubernetes Engineer

Kubernetes Engineer

The Aerospace Corporation • El Segundo, CA, United States
[job_card.full_time]
The Aerospace Corporation is the trusted partner to the nation's space programs, solving the hardest problems and providing unmatched technical expertise. As the operator of a federally funded resea...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Senior Technology Development Operations Engineer

Senior Technology Development Operations Engineer

Cooley LLP • Santa Monica, CA, United States
[job_card.full_time]
Senior Technology Development Operations Engineer.Cooley is seeking a Senior DevOps Engineer to join the.Infrastructure & Development Operations. The Technology Development Operations (DevOps) Engin...[show_more]
[last_updated.last_updated_30] • [promoted]
Software Engineer, Platform - Burbank, USA

Software Engineer, Platform - Burbank, USA

Speechify • Burbank, California, United States
[job_card.full_time]
The mission of Speechify is to make sure that reading is never a barrier to learning.Over 50 million people use Speechify’s text-to-speech products to turn whatever they’re reading – PDFs, books, G...[show_more]
[last_updated.last_updated_30] • [promoted]