Talent.com
Lead Site Reliability Engineer
Lead Site Reliability EngineerCox Automotive • Redan, GA, US
[error_messages.no_longer_accepting]
Lead Site Reliability Engineer

Lead Site Reliability Engineer

Cox Automotive • Redan, GA, US
[job_card.variable_days_ago]
[job_preview.job_type]
  • [job_card.full_time]
[job_card.job_description]

The Lead Site Reliability Engineer will be part of the Site Reliability Engineering (SRE) team. The SRE team drives reliability, observability, and engineering practice maturity across over 150 teams made up of over a thousand engineers in our part of Cox Automotive. We build processes, documentation, and tools that scale : deep observability to detect and diagnose issues faster, engineering maturity assessments that drive measurable improvement, reusable golden paths that accelerate delivery, and trusted advisory relationships that align reliability with business priorities. Much of our work focuses on eliminating toil through automation and establishing self-service capabilities that multiply our impact. If you love building monitoring systems that reveal truth, evaluating engineering practices to raise the bar organization-wide, and acting as a trusted advisor to engineers and leadership, we want to talk to you. As a Lead Software Engineer, Site Reliability Engineering at Cox Automotive you will : Define and drive adoption of SLIs, SLOs, error budgets, and high-quality alerting standards across the organization Architect end-to-end observability strategies (metrics, logs, traces, business signals) with consistent taxonomy and discoverability Build centralized dashboards, reliability scorecards, and runbooks used by engineering teams and leadership Establish engineering practice maturity baselines and partner with teams on measurable improvement plans Create golden paths-standardized pipelines, infrastructure modules, and service templates-that enable rapid, consistent delivery Lead internal workshops, game days, and learning programs to spread operational excellence Act as a trusted advisor to product and engineering leadership, providing data-driven insights on reliability risk and trade-offs Guide post-incident reviews toward systemic remediation (guardrails, automation, design changes) rather than superficial fixes Design and extend self-service platforms for deployment, progressive delivery, and automated recovery Reduce MTTR through better telemetry, automation, and resilience patterns Mentor engineers across teams to become local reliability champions, scaling SRE impact without adding headcount Qualifications : Experience programming in at least one of the following languages : Python, Typescript, or Java. Bachelor's degree in a related discipline and 6 years' experience in a related field. The right candidate could also have a different combination, such as a master's degree and 4 years' experience; a Ph.D. and 1 year of experience; or 18 years' experience in a related field. Applicants must currently be authorized to work in the United States for any employer without current or future sponsorship. No OPT, CPT, STEM / OPT or visa sponsorship now or in future. Expertise in designing, analyzing, and troubleshooting large-scale distributed systems. Deep hands-on experience with modern observability tools (CloudWatch and NewRelic) Proven ability to assess engineering practices and drive measurable improvements across multiple teams. Experience establishing SLIs / SLOs, managing error budgets, and improving alert signal-to-noise ratios. Strong background in release engineering, CI / CD, and progressive deployment strategies. Deep expertise in AWS, Terraform, AWS CDK, and GitHub / GitHub Actions. Track record reducing MTTR and improving availability through automation and architectural improvements. Excellent written and verbal communication skills tailored to both engineers and executives. Systematic problem-solving approach with a sense of drive and ownership. Understanding of Linux operating systems, networking, and performance fundamentals. Ability to build trust and influence decisions through data-driven insights. Experience facilitating effective post-incident analysis and driving systemic remediation. Desire to work in a fast-paced, evolving, growing, dynamic environment. USD 119,600.00 - 199,400.00 per year Compensation : Compensation includes a base salary of $119,600.00 - $199,400.00. The base salary may vary within the anticipated base pay range based on factors such as the ultimate location of the position and the selected candidate's knowledge, skills, and abilities. Position may be eligible for additional compensation that may include an incentive program. Benefits : The Company offers eligible employees the flexibility to take as much vacation with pay as they deem consistent with their duties, the company's needs, and its obligations; seven paid holidays throughout the calendar year; and up to 160 hours of paid wellness annually for their own wellness or that of family members. Employees are also eligible for additional paid time off in the form of bereavement leave, time off to vote, jury duty leave, volunteer time off, military leave, and parental leave.

[job_alerts.create_a_job]

Site Reliability Engineer • Redan, GA, US

[internal_linking.similar_jobs]
Entry Level Packaging Systems Design Engineer - May 2026 Start

Entry Level Packaging Systems Design Engineer - May 2026 Start

Dennis Group for New Grads, Co-Ops & Internships • Duluth, GA, US
[job_card.full_time]
Packaging Engineers work closely with our process, controls, and building system engineers to scope, layout, install, and commission packaging equipment lines for our food and beverage industry cli...[show_more]
[last_updated.last_updated_30] • [promoted]
Plant Engineering & Reliability Manager

Plant Engineering & Reliability Manager

Ascend Elements • Covington, GA, US
[job_card.full_time]
Ascend Element is revolutionizing the production of lithium-ion battery materials by establishing a clean and sustainable supply chain using recycled feedstock. Its patented Hydro-to-Cathode™ ...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Entry Level Construction

Entry Level Construction

Brothers that just do Gutters- Atlanta • Suwanee, GA, US
[job_card.full_time]
Would you like to start a career in the construction industry?.Would you like to learn a skill that is in demand and will allow you to be paid well?. The Brothers that Just do Gutters is a National ...[show_more]
[last_updated.last_updated_less] • [promoted] • [new]
Site Safety Manager

Site Safety Manager

Ace Electric • Covington, GA, US
[job_card.full_time]
Our Mission is to Identify, Hire, Train and Retain the very best people! Could that be you?.Join the Ace Electric team for opportunities to work with the best team and build your career with Ace Un...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Group Leader - Land Development / Civil Engineering

Group Leader - Land Development / Civil Engineering

Ardurra Group, Inc. • Buford, GA, US
[job_card.full_time]
Ardurra is looking to hire an experienced Civil Engineering leader for our Land Development practice in Atlanta, GA.Our civil engineers, urban planners, and staff scientists work together daily to ...[show_more]
[last_updated.last_updated_30] • [promoted]
Field Engineer - Industrial - Atlanta, GA

Field Engineer - Industrial - Atlanta, GA

Reeves Young LLC • Buford, GA, US
[job_card.full_time]
At Reeves Young, everything we do – from 30 feet below the ground to 30 floors above – is about people.The culture we cultivate spreads throughout our employees and flows into the relat...[show_more]
[last_updated.last_updated_30] • [promoted]
Lead Cloud Data Engineer - Remote Opportunity

Lead Cloud Data Engineer - Remote Opportunity

The Mutual Group • Duluth, GA, US
[filters.remote]
[job_card.full_time]
We are building a next-generation Cloud Data Platform to unify data from Policy, Claims, Billing, and Administration systems into a single source of truth. We are seeking a Lead Cloud Data Engineer ...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Travel CT Tech - $2132.68 / Week

Travel CT Tech - $2132.68 / Week

Atlas MedStaff • Covington, GA, US
[job_card.full_time]
Atlas MedStaff is seeking an experienced CT Tech for an exciting Travel Allied job in Covington, GA.Shift : 3x12 hr nights Start Date : 01 / 12 / 2026 Duration : 12 weeks Pay : $2132.Atlas Medstaff is curr...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Senior Reliability Engineer

Senior Reliability Engineer

Viasat • Duluth, GA, United States
[job_card.full_time]
At Viasat, we're on a mission to deliver connections with the capacity to change the world.For more than 35 years, Viasat has helped shape how consumers, businesses, governments and militaries arou...[show_more]
[last_updated.last_updated_30] • [promoted]
Lead Engineer - Hotel

Lead Engineer - Hotel

Hilton Garden Inn Atlanta East / Stonecrest • Lithonia, GA, US
[job_card.full_time]
The hotel lead engineer is responsible for overseeing the maintenance and repair of the hotel's physical plant, including its structure, plumbing, electrical, and HVAC systems.This leaders...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Senior Analog Design Engineer

Senior Analog Design Engineer

Cypress HCM • Suwanee, GA, US
[job_card.full_time]
Senior Design Engineer – Analog.Semiconductor, integrated circuits, memory |.Contributing to design tasks in a small team environment, with a focus on power components for integrated circuits...[show_more]
[last_updated.last_updated_30] • [promoted]
NW Deployment Build Lead I, GND - DCC Communities

NW Deployment Build Lead I, GND - DCC Communities

Georgia Staffing • Covington, GA, US
[job_card.full_time]
AWS Infrastructure Services owns the design, planning, delivery, and operation of all AWS global infrastructure.We're the people who keep the cloud running. We support all AWS data centers and all o...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
CRNA Needed in West GA!

CRNA Needed in West GA!

HealthEcareers - Client • Lithonia, Georgia, United States
[job_card.full_time]
Board Certified or Board Eligible.Credentialing Timeframe : Approx 90 Days.There might be exceptions made on the radius rule. The operational team will make the final decision.Must be willing to cove...[show_more]
[last_updated.last_updated_30] • [promoted]
Travel CT Tech - $2416 / Week

Travel CT Tech - $2416 / Week

Cynet Health • Covington, GA, US
[job_card.full_time]
Cynet Health is seeking an experienced CT Tech for an exciting Travel Allied job in Covington, GA.Shift : 3x12 hr nights Start Date : 01 / 12 / 2026 Duration : 12 weeks Pay : $2416 / Week.Ranked #5 Best Tr...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Systems Engineer

Systems Engineer

Verinext • Duluth, GA, US
[job_card.full_time]
Join Verinext, a technology company that's not just keeping up with the future, but actively shaping it.At Verinext, we firmly believe that work should be as enjoyable as it is rewarding.You...[show_more]
[last_updated.last_updated_30] • [promoted]
Process / Project Engineer (Mid-Level)

Process / Project Engineer (Mid-Level)

Dennis Group Atlanta • Duluth, GA, US
[job_card.full_time]
Dennis Group’s process engineers are key in our projects of designing and building food and beverage processing facilities. Process Engineers work in every aspect of a project - controls, pack...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Director, Solutions Development

Director, Solutions Development

Mitsubishi Electric US, Inc. • Suwanee, GA, US
[job_card.full_time]
The Director of Solutions Development is a visionary and strategic leader responsible for driving the end-to-end development of advanced HVAC control and software solutions.This role plays a critic...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Senior Systems Engineer

Senior Systems Engineer

Mujin • Suwanee, GA, US
[job_card.full_time]
Mujin is the future of industrial robotic systems in production and distribution environments.Our technology gives robots perception and awareness, enabling them to take on more advanced tasks.Our ...[show_more]
[last_updated.last_updated_variable_days] • [promoted]