Talent.com

Reliability Jobs in Cary, NC

Create a job alert for this search

Reliability • cary nc

Last updated: 3 hours ago

Senior Manager AI Reliability Engineering

LenovoMorrisville, North Carolina, United States of America
Full-time

AI platform that spans Windows, Android, and cloud.As part of this vision, we are expanding the reliability engineering organization powering .Lenovo’s cross‑device Personal AI that operates seamle... Show more

Deployment Site Reliability Engineer - Associate

Deutsche BankCary, 3000 CentreGreen Way
Full-time

Deployment Site Reliability Engineer.As a Deployment Site Reliability Engineer, you will be responsible for planning and executing complex network deployments that support high availability and per... Show more

Utilities Energy Reliability Consultant - Manager

PwCRaleigh,NC
Full-time

SummaryAt PwC, our people in operations consulting specialise in providing consulting services on optimising operational efficiency and effectiveness.These individuals analyse client needs, develop... Show more

Maintenance Technician II

Reddy Ice Manufacturing & DistributionRaleigh, NC, US
Full-time
Quick Apply

POSITION PURPOSE: (2nd shift 1p - 9p & 1st Shift 9a - 5p).This role accelerates business value by responsibilities for the reliable operation and maintenance of the plant’s manufacturing system... Show more

Morning Babysitter for Elementary Age Children

WyndyRaleigh
Part-time
Quick Apply

We are seeking a reliable and punctual caregiver to assist with the morning routine for two boys, ages 6 and 9, in Durham, NC.The position is part-time with morning shifts required from Monday to F... Show more

Service Technician

Accentuate Staffing Morrisville, NC, US
Full-time

Perform service visits at customer locations across the U.Support equipment installations and commissioning activities at customer sites worldwide.Receive hands-on training with machines and system... Show more

Network SRE

Piper CompaniesRaleigh, NC
Full-time

Piper Companies is seeking an experienced.Responsibilities of the Network SRE include:.Design and implement scalable, secure, and highly available network architectures in AWS to support enterprise... Show more

Maintenance Planner

KellanovaCary, NC, US
Part-time +1

Do you have a passion for work planning and the maintenance trade? We are looking for our next thought leader who has a vision for excellent results at our.Muncy, PA facility as a Maintenance Plann... Show more

 • New!

Reliability Engineer

CatalentMorrisville, NC
Full-time

Monday - Friday, 8:00am to 5:00pm.The Morrisville (MSV) facility is Catalent’s center of excellence for nasal product development and manufacturing, providing end‑to‑end services from early formula... Show more

Remote Image Annotation Expert - AI Trainer ($30-$30 per hour)

MercorApex, North Carolina, US
Remote
Full-time

We are looking for detail-oriented individuals with strong visual reasoning skills to annotate images and diagrams for AI training.In this role, you will answer precise visual reasoning questions g... Show more

QUALITY ASSURANCE SPECIALIST (TITLE 32)

Department of the ArmyMorrisville, NC, United States
Full-time

As a Quality Assurance Specialist, GS-1910-09, you will develop AASF Quality Management Plan; develop quality assessment plans; develop local regulations and/or operating instructions for implement... Show more

Lead Reliability Engineer

TruistRaleigh, NC
Full-time +1

ESSENTIAL DUTIES AND RESPONSIBILITIES.Following is a summary of the essential functions for this job.Other duties may be performed, both major and minor, which are not mentioned below.Specific acti... Show more

Site Reliability Engineer

Alltech Consulting ServicesRaleigh
Full-time

SRE with strong Unix background who can code in Python and Ansible.Candidate coming from past Unix SA background is a best fit. Show more

Twin-Ax Cable Product Engineer

Amphenol TCSRaleigh, NC, US
Full-time

Position Summary: Twin-Ax Cable Product Engineer.The Twin-Ax Cable Product Engineer is the technical owner for high-speed copper cable assemblies, responsible for ensuring mechanical reliabili... Show more

Twin-Ax Cable Product Engineer

Amphenol ICCNorth Carolina, Raleigh
Full-time

Product Ownership & Technical Leadership.Serve as the technical expert for Twin-Ax cable and cable assemblies, ensuring product quality, reliability, and mechanical integrity.Maintain and manage pr... Show more

CDL Class A Driver OTR

CORNERSTONE TRUCKING LLCCary, NC, US
Full-time

Cornerstone Trucking, LLC is a leading logistics and transportation company committed to delivering exceptional service.We pride ourselves on our reliability, professionalism, and dedication to our... Show more

AI User Experience Reliability Lead

LenovoMorrisville, North Carolina, United States of America
Full-time

AI platform spanning Windows, Android, and cloud.As part of this vision, we are expanding the engineering organization behind.Lenovo’s cross‑device Personal AI that delivers intelligent, safe, and ... Show more

CDL-A Owner-Operators (Growing Freight Demand)

Optimal Dispatch Service LLCRaleigh, North Carolina, United States
Full-time

Growing freight demand in southeastern corridors has created opportunities for CDL-A owner-operators to maintain consistent operations.This role emphasizes reliability and coordination.Independent ... Show more

Sr. DevOps Engineer – TS/SCI

Zachary PiperRALEIGH, North Carolina
Full-time

Zachary Piper Solutions is seeking a Sr.DevOps Engineer – TS/SCI for a mission-driven organization operating within the national security and advanced technology industry in the Raleigh Durham, NC ... Show more

People also ask
The cities near Cary, NC that boast the highest number of reliability jobs are:
Senior Manager AI Reliability Engineering

Senior Manager AI Reliability Engineering

LenovoMorrisville, North Carolina, United States of America
30+ days ago
Job type
  • Full-time
Job description

Description and Requirements

About Our Team

Lenovo is building Quantum, a next‑generation hybrid AI platform that spans Windows, Android, and cloud. As part of this vision, we are expanding the reliability engineering organization powering Qira, Lenovo’s cross‑device Personal AI that operates seamlessly across Lenovo and Motorola products.

We are hiring a Senior Manager, AI Reliability Engineering to lead the engineering teams responsible for Qira’s foundational reliability capabilities — including system‑level observability, telemetry, performance engineering, resiliency architecture, and the reliability of Qira’s hybrid edge/cloud AI service.

This is a high‑impact leadership role shaping how we measure, operate, and improve reliability across one of Lenovo’s most ambitious AI initiatives.


Location: Open to remote work in the US. The preferred work location is Chicago, IL.

What You’ll Do

Engineering Leadership

  • Lead and grow multiple engineering teams focused on reliability, observability, and system performance across Qira’s hybrid AI ecosystem.

  • Define strategy, roadmaps, and priorities to improve reliability, insight, and operational readiness across device, edge, and cloud systems.

  • Champion reliability as an engineering discipline through design patterns, best practices, and a culture of continuous improvement.

Observability & Telemetry

  • Own the systems that deliver metrics, logs, traces, distributed tracing, AI‑specific signals, dashboards, and alerting.

  • Drive the adoption of unified telemetry standards and instrumentation across all Qira components.

  • Ensure engineers have actionable insight into performance, reliability, cost, and AI behavior.

Service Reliability & Performance Engineering

  • Lead engineering efforts to improve the reliability, performance, and scalability of Qira’s service architecture — including inference, retrieval, data pipelines, and hybrid edge/cloud workflows.

  • Drive the design and adoption of resilience patterns such as graceful degradation, fallback paths, bulkheads, and rate‑limiting strategies.

  • Oversee capacity planning, cost optimization, and performance tuning for high‑throughput AI systems.

System Design & Architectural Influence

  • Work with cross‑functional engineering teams to embed reliability early in the design process (“shift left”).

  • Guide architectural decisions to ensure Qira’s engineering foundations remain stable, observable, and predictable at scale.

  • Set service readiness standards for new components entering production.

Cross‑Functional Collaboration

  • Partner with Applied AI/ML Engineering, Platform Engineering, Firmware, Product, and Security to align reliability goals with Qira’s broader roadmap.

  • Collaborate closely with the incident management and operations teams to ensure strong signal quality, runbook depth, and operational tooling.

  • Act as a reliability engineering representative in executive and engineering leadership forums.

Team & Talent Development

  • Hire and develop world‑class engineers across observability, reliability, and performance domains.

  • Provide coaching, mentorship, and clear technical and leadership career paths.

  • Foster a culture of ownership, operational craftsmanship, and data‑driven engineering.

Basic Qualifications

  • 12+ years of experience in Site Reliability Engineering, Observability Engineering, Platform Engineering, or large‑scale distributed systems, including 5+ years leading engineering teams.

  • Bachelor’s Degree in Computer Science, Engineering, or a related technical field.

  • Engineering experience in several of the following:

  • Observability systems (OpenTelemetry, metrics/logs/traces)

  • Distributed systems reliability and performance

  • Cloud infrastructure (Azure preferred)

  • Kubernetes and containerized environments

  • CI/CD pipelines and deployment workflows

  • Infrastructure-as-Code (Terraform, Bicep, etc.)

  • Deep understanding of Linux systems, networking, scalability, and system performance fundamentals.

  • Proven ability to lead engineering teams and drive cross‑organizational initiatives.

Preferred Qualifications

  • Experience building or operating large‑scale telemetry and observability platforms.

  • Hands‑on experience with Grafana, Prometheus, Loki, Tempo, or similar tooling.

  • Experience supporting AI/ML inference systems, vector databases, or GPU‑accelerated compute.

  • Background in hybrid systems spanning device, edge, and cloud.

  • Experience implementing resilience patterns and reliability frameworks.

  • Experience with SLOs, SLIs, error budgets, and reliability governance.

  • Passion for building scalable reliability engineering teams and systems.

Why This Role Matters

Qira’s reliability is mission‑critical to delivering a safe, fast, and trustworthy AI experience to millions of users.
In this role, you will:

  • Build the telemetry and reliability insights that power Qira

  • Architect the service‑level reliability patterns that keep Qira stable at scale

  • Lead the engineering teams that ensure Qira performs predictably across devices, edge, and cloud

  • Shape how reliability engineering is practiced across Lenovo’s AI ecosystem

This is a rare opportunity to define the engineering foundation of a next‑generation global AI platform.


The base salary budgeted range for this position is $190K - $230K. Individuals may also be considered for bonus and/or commission.

Lenovo’s various benefits can be found on