Talent.com
AI DevOps and Cloud Infrastructure Engineer
AI DevOps and Cloud Infrastructure EngineerCrowe • Springfield, IL, US
AI DevOps and Cloud Infrastructure Engineer

AI DevOps and Cloud Infrastructure Engineer

Crowe • Springfield, IL, US
[job_card.variable_days_ago]
[job_preview.job_type]
  • [job_card.full_time]
[job_card.job_description]

Your Journey at Crowe Starts Here

At Crowe, you can build a meaningful and rewarding career. With real flexibility to balance work with life moments, you're trusted to deliver results and make an impact. We embrace you for who you are, care for your well-being, and nurture your career. Everyone has equitable access to opportunities for career growth and leadership. Over our 80-year history, delivering excellent service through innovation has been a core part of our DNA across our audit, tax, and consulting groups. That's why we continuously invest in innovative ideas, such as AI-enabled insights and technology-powered solutions, to enhance our services. Join us at Crowe and embark on a career where you can help shape the future of our industry.

Job Description :

About Crowe AI Transformation

Everything we do is about making the future of human work more purposeful. We do this by leveraging state-of-the-art technologies, modern architecture, and industry experts to create AI-powered solutions that transform the way our clients do business.

The new AI Transformation team will build on Crowe's established AI foundation, furthering the capabilities of our Applied AI / Machine Learning team. By combining Generative AI, Machine Learning and Software Engineering, this team empowers Crowe clients to transform their business models through AI, irrespective of their current AI adoption stage.

As a member of AI Transformation, you will help distinguish Crowe in the market and drive the firm's technology and innovation strategy. The future is powered by AI, come build it with us.

About the Team

  • We invest in expertise. You'll have the time, space, and support to go deep in your projects and build lasting technical and strategic mastery. You'll work with developers, product stakeholders, and project managers as a trusted leader and domain expert.
  • We believe in continuous growth. Our team is committed to professional development and knowledge-sharing.
  • We protect balance. Our distributed team culture is grounded in trust and flexibility. We offer unlimited PTO, a flexible remote work policy, and a supportive environment that prioritizes sustainable, long-term performance.

About the Role

The AI DevOps and Cloud Infrastructure Manager leads teams responsible for designing, operating, and scaling AI / ML infrastructure, cloud platforms, and DevOps automation that support enterprise model training, inference, and generative AI workloads. This role is the strategy and execution of cloud-native, Kubernetes-based platforms that enable reliable, secure, and cost-efficient AI systems.

As a manager, this position combines hands-on technical leadership with people management, delivery ownership, and strategic decision-making. The role oversees distributed compute environments, GPU clusters, CI / CD pipelines, and vector-search infrastructure while ensuring high availability, resilience, and compliance with security and responsible AI standards. The manager partners closely with AI engineering, data engineering, product, and security teams, serves as the primary technical owner for assigned initiatives, and communicates system risks, tradeoffs, and progress to leadership.

Key responsibilities include :

  • Leading engineering teams responsible for AI / ML infrastructure, cloud operations, and MLOps automation.
  • Defining cloud, Kubernetes, and infrastructure strategy to support scalable model training, inference, and generative AI platforms.
  • Guiding the design and operation of distributed compute environments, GPU clusters, and vector database infrastructure.
  • Overseeing CI / CD pipelines that automate model training, testing, deployment, monitoring, and lifecycle management.
  • Managing incident response, failure analysis, and reliability engineering across AI platforms.
  • Directing performance testing, capacity planning, and cost optimization for AI infrastructure.
  • Ensuring compliance with cloud security, IAM practices, governance requirements, and responsible AI frameworks.
  • Implementing multi-cloud resilience patterns, high availability, and automated failover for critical AI workloads.
  • Supporting platform modernization initiatives, including adoption of optimized LLM runtimes and new orchestration technologies.
  • Evaluating third-party infrastructure tools, GPU scheduling solutions, and platform enhancements.
  • Communicating system status, dependencies, risks, and technical decisions to senior leadership.
  • Managing 45 direct reports, including coaching, performance management, and career development.
  • Owning project delivery, including budget, timelines, and quality of outcomes.
  • Coordinating with sales and stakeholders on project sizing, feasibility, and strategic opportunities.
  • Driving continuous improvement initiatives to advance DevOps maturity and AI infrastructure operational readiness.
  • Qualifications

  • 7+ years of professional experience in DevOps, cloud engineering, MLOps, or platform engineering.
  • 2+ years of experience in engineering leadership or senior technical leadership roles.
  • Expert proficiency with distributed cloud systems, Kubernetes, and infrastructure-as-code.
  • Advanced ability to troubleshoot infrastructure, networking, container, and deployment issues.
  • Proficiency in Python, Bash, or similar automation and scripting languages.
  • Strong understanding of monitoring, observability, and reliability engineering patterns.
  • Hands-on experience supporting infrastructure for ML or generative AI workloads.
  • Strong leadership, communication, and cross-functional collaboration skills.
  • Preferred Qualifications

  • Bachelor's degree in computer science, engineering, cloud computing, or a related field.
  • Master's degree in technical discipline.
  • Cloud and AI certifications, including Azure (AZ-900, AZ-104, AZ-305, AZ-700, AZ-800, AI-102) or equivalent AWS / GCP certifications.
  • Extensive experience with Kubernetes platforms (EKS, AKS, GKE) and cloud ML services (Azure ML, SageMaker).
  • Experience with GPU workload orchestration, optimization, and multi-tenant inference environments.
  • Expertise in observability and distributed tracing (Prometheus, Grafana, CloudWatch, OpenTelemetry).
  • Strong experience with Terraform and infrastructure governance at scale.
  • Familiarity with service mesh architectures (Istio, Linkerd) and advanced deployment patterns (blue / green, canary).
  • Advanced experience supporting generative AI platforms, including LLM inference runtimes (vLLM, TGI), RAG infrastructure, and vector databases (Pinecone, Weaviate, FAISS).
  • Experience operating fine-tuned LLMs (LoRA, QLoRA), managing GenAI CI / CD pipelines, and implementing hallucination, drift, and reliability monitoring.
  • Demonstrated ability to make strategic technical decisions within defined delivery and budget constraints.
  • [job_alerts.create_a_job]

    Cloud Infrastructure Engineer • Springfield, IL, US

    [internal_linking.similar_jobs]
    Principal Architect - AI & Datacenter

    Principal Architect - AI & Datacenter

    SHI GmbH • Springfield, IL, United States
    [job_card.full_time]
    Since 1989, SHI International Corp.We've grown every year since, and today we're proud to be a $16 billion global provider of IT solutions and services. Over 17,000 organizations worldwide rely on S...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Sr Manager, AI Engineer

    Sr Manager, AI Engineer

    CVS Health • Springfield, IL, United States
    [job_card.full_time]
    At CVS Health, we're building a world of health around every consumer and surrounding ourselves with dedicated colleagues who are passionate about transforming health care.As the nation's leading h...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Travel Echo Tech - $2,586 per week in Springfield, IL

    Travel Echo Tech - $2,586 per week in Springfield, IL

    AlliedTravelCareers • Springfield, IL, US
    [job_card.full_time]
    Coast Medical Service is a nationwide travel nursing & allied healthcare staffing agency dedicated to providing an elite traveler experience for the experienced or first-time traveler.Coast is ...[show_more]
    [last_updated.last_updated_variable_hours] • [promoted] • [new]
    GenAI PhD Applied Scientist - Oracle Cloud Infrastructure (OCI)

    GenAI PhD Applied Scientist - Oracle Cloud Infrastructure (OCI)

    Oracle • Springfield, IL, United States
    [job_card.full_time]
    Intended for students graduating with their Doctorate degree by December 2025, or have graduated within 12 months of start date. Austin, Nashville, Santa Clara, or Seattle Hub.Our future success dep...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Remote C# Software Engineer - AI Trainer

    Remote C# Software Engineer - AI Trainer

    SuperAnnotate • Springfield, Illinois, US
    [filters.remote]
    [job_card.full_time]
    This is an hourly-paid, fully remote contractor role where you will review AI-generated responses and / or generate C# / . NET engineering content, evaluating reasoning quality and step-by-step problem-...[show_more]
    [last_updated.last_updated_variable_days]
    Travel Echo Tech - $2,129 to $2,360 per week in Springfield, IL

    Travel Echo Tech - $2,129 to $2,360 per week in Springfield, IL

    AlliedTravelCareers • Springfield, IL, US
    [job_card.full_time]
    Ready to start your next travel adventure? LRS Healthcare offers a full benefits package, 24 / 7 support, and a responsive, traveler-first culture. What are you waiting for? Apply today!.Valid license...[show_more]
    [last_updated.last_updated_variable_hours] • [promoted] • [new]
    GenAI PhD Applied Scientist Intern - Oracle Cloud Infrastructure (OCI)

    GenAI PhD Applied Scientist Intern - Oracle Cloud Infrastructure (OCI)

    Oracle • Springfield, IL, United States
    [job_card.full_time]
    Must be enrolled in a university prior to and post internship.Target Internship Duration : May-Aug or June-Sept 2026.This position is located in office in Redwood Shores, CA or Seattle, WA.Our futur...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Principal Architect, Autonomous and Knowledge Platform

    Principal Architect, Autonomous and Knowledge Platform

    Teradata • Springfield, IL, United States
    [job_card.permanent]
    At Teradata, we believe that people thrive when empowered with better information.That's why we built the most complete cloud analytics and data platform for AI. By delivering harmonized data, trust...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Information Systems Specialist I - ServiceNow

    Information Systems Specialist I - ServiceNow

    Illinois Secretary of State • Springfield, IL, US
    [job_card.full_time]
    [filters_job_card.quick_apply]
    Office of the Illinois Secretary of State Alexi Giannoulias Job Title : .Information Systems Specialist I – ServiceNow Division : Governance & Business Management...[show_more]
    [last_updated.last_updated_30]
    Data Science Manager, Analytics

    Data Science Manager, Analytics

    META • Springfield, IL, United States
    [job_card.full_time]
    As a Data Science Manager at Meta, you will play a key role in shaping the future of experiences for billions of people and hundreds of millions of businesses worldwide. You will apply your leadersh...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Diesel Fleet Techs III - Earn $35-$42.21 / Hr - $7.5k Sign-On + Benefits

    Diesel Fleet Techs III - Earn $35-$42.21 / Hr - $7.5k Sign-On + Benefits

    Sysco • Litchfield, IL, US
    [job_card.full_time]
    Sysco is Now Hiring Diesel Fleet Technicians in St.Hour • - Up to $7,500 Sign-On Bonus for New Hires.Annual Tool Allowance - Comprehensive Benefits. We offer our colleagues the opportunity to grow pe...[show_more]
    [last_updated.last_updated_1_day] • [promoted]
    Software Engineer, Platform - Springfield, USA

    Software Engineer, Platform - Springfield, USA

    Speechify • Springfield, Illinois, United States
    [job_card.full_time]
    The mission of Speechify is to make sure that reading is never a barrier to learning.Over 50 million people use Speechify’s text-to-speech products to turn whatever they’re reading – PDFs, books, G...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Senior Software Engineer, Core Experiences - Springfield, USA

    Senior Software Engineer, Core Experiences - Springfield, USA

    Speechify • Springfield, Illinois, United States
    [job_card.full_time]
    Speechify is the easiest way to listen to the world’s information.Articles on the web, documents in the cloud, books on your phone. We absorb it all and let you listen to it at your desk, on the go,...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Data Science Manager

    Data Science Manager

    Coinbase • Springfield, IL, United States
    [job_card.full_time]
    Ready to be pushed beyond what you think you’re capable of?.At Coinbase, our mission is to increase economic freedom in the world. It’s a massive, ambitious opportunity that demands the best of us, ...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Travel Physical Therapist (PT) - $2,400 per week in Litchfield, IL

    Travel Physical Therapist (PT) - $2,400 per week in Litchfield, IL

    AlliedTravelCareers • Litchfield, IL, US
    [job_card.full_time]
    Client in IL seeking Physical Therapist : Home Health.We are looking for a healthcare professional who is ready to provide exceptional patient care in this contract / travel role.Contract / travel assig...[show_more]
    [last_updated.last_updated_30] • [promoted]
    MuleSoft Architect - Information Systems Advisor I

    MuleSoft Architect - Information Systems Advisor I

    Illinois Secretary of State • Springfield, IL, US
    [job_card.full_time]
    [filters_job_card.quick_apply]
    Office of the Illinois Secretary of State Alexi Giannoulias Job Title : .MuleSoft Architect – Information Systems Advisor I Division : Systems & Programmin...[show_more]
    [last_updated.last_updated_30]
    Cust Svc Operations Analyst 2

    Cust Svc Operations Analyst 2

    Public Consulting Group • Springfield, IL, United States
    [job_card.full_time]
    Public Consulting Group LLC (PCG) is a leading public sector solutions implementation and operations improvement firm that partners with health, education, and human services agencies to improve li...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Analyst

    Analyst

    TradeJobsWorkforce • 62563 Rochester, IL, US
    [job_card.full_time]
    ESSENTIAL JOB FUNCTIONS Analyzes global markets for IT Services, servers, storage, backup, IT security, productivity software, remote monitoring services, hyperconvergence and IoT.Studies SMB and m...[show_more]
    [last_updated.last_updated_30] • [promoted]