Talent.com
Site Reliability Engineering (SRE) Architect
Site Reliability Engineering (SRE) ArchitectQTech • Atlanta, Georgia, USA
Site Reliability Engineering (SRE) Architect

Site Reliability Engineering (SRE) Architect

QTech • Atlanta, Georgia, USA
[job_card.variable_days_ago]
[job_preview.job_type]
  • [job_card.full_time]
[job_card.job_description]

Job Title : Site Reliability Engineering (SRE) Architect

Location : Atlanta Georgia (Hybrid)

Long Term Contract

Looking for W2 Candidates. No C2C

Job Discription :

As an SRE Architect you will be a pivotal technical leader responsible for designing building and evolving the foundational systems and practices that ensure the reliability scalability performance and efficiency of our critical services. Moving beyond day-to-day operations you will focus on the strategic architectural direction of SRE function defining standards blueprints and frameworks that enable development teams and fellow SRE operations team to build and operate highly resilient systems. Leverage deep expertise in software engineering distributed systems cloud infrastructure and SRE principles to influence technology choices establish best practices and foster a proactive culture of reliability across the organization and much beyond observability pillar.

Key Responsibilities :

1. Reliability Strategy & Design :

o Architect and design highly available scalable secure and cost-effective infrastructure and application patterns on AWS

o Define and evangelize SRE best practices standards and blueprints for service design deployment monitoring and operational readiness across the engineering organization

o Review current observability implementation to identify gaps and define steps to reach next level maturity of observability setup to provide deep insights into system health and behaviour

o With overall maturity lead the definition and implementation strategy for Service Level Indicators (SLIs) Service Level Objectives (SLOs) and Error Budgets for critical services

2. Platform Architecture & Automation :

o Design solutions to systematically reduce operational toil through automation and improved system design

o Evaluate current SRE tools and automation frameworks (e.g. CI / CD pipelines Infrastructure as Code modules automated incident remediation chaos engineering platforms) and suggest enhancement that will help overall enhancement of capability

o Evaluate prototype and recommend new technologies tools and methodologies to enhance system reliability developer productivity and operational efficiency

3. Technical Leadership & Consultation :

o Act as a senior technical advisor and subject matter expert on reliability scalability and performance for development and platform teams

o Provide architectural guidance during the design phase of new services and features to ensure reliability principles are embedded early (shift-left)

o Mentor and coach other SREs and engineers fostering technical excellence and adherence to SRE principles

o Lead architectural reviews and production readiness assessments for critical systems

4. Resilience :

o Lead blameless postmortems for significant incidents ensuring root causes are identified and systemic architectural improvements are prioritized and implemented

o Architect and advocate for resilience patterns (e.g. circuit breaking rate limiting graceful degradation chaos engineering) within applications and infrastructure

Required Qualifications :

Proven experience in an architectural role designing solutions for reliability scalability and performance

Deep understanding and practical application of SRE principles (SLIs / SLOs error budgets toil reduction automation incident management postmortems)

Expertise in cloud computing platforms (e.g. AWS) including infrastructure networking and security services

Strong experience with containerization and orchestration technologies (Kubernetes Docker serverless computing)

Solid experience designing and implementing observability solutions (e.g. Dynatrace Prometheus Grafana ELK / EFK Stack Jaeger OpenTelemetry)

Strong programming / scripting skills (e.g. Python Go Bash) for automation and tool development

Excellent analytical problem-solving and strategic thinking skills.

Strong communication collaboration and leadership skills with the ability to influence technical direction across teams

Preferred Qualifications :

Experience designing and implementing chaos engineering practices and platforms

Best Regards

Tarun K

Phone : 1-

Email : Key Skills

Fashion Retail,Highway Design,Apache Web Server,Atl,CAD CAM,ABAP

Employment Type : Full Time

Experience : years

Vacancy : 1

[job_alerts.create_a_job]

Site Reliability Sre • Atlanta, Georgia, USA

[internal_linking.related_jobs]
SRE Architect

SRE Architect

Blue Ribbon Global Technologies • Atlanta, Georgia, USA
[job_card.full_time] +1
Client : Xebia / Delta Airlines.As an SRE Architect you will be a pivotal technical leader responsible for designing building and evolving the foundational systems and practices that ensure the reli...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Senior Site Reliability Engineer

Senior Site Reliability Engineer

Cloudious LLC • Atlanta, Georgia, USA
[job_card.full_time]
Senior Site Reliability Engineer.Manage and optimize data streaming and API components in OpenShift Onpremise and AWS.Proactively review the applications APIs and processes to identify opportunitie...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Travel Registered Respiratory Therapist (RRT) - $1,535 to $1,735 per week in Fayetteville, GA

Travel Registered Respiratory Therapist (RRT) - $1,535 to $1,735 per week in Fayetteville, GA

AlliedTravelCareers • Fayetteville, GA, US
[job_card.full_time]
AlliedTravelCareers is working with Titan Medical Group to find a qualified RRT in Fayetteville, Georgia, 30214!.Travel - Respiratory Therapist. Fayetteville, GA, United States.BCLS / BLS - American H...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Solutions Architect, Field Services

Solutions Architect, Field Services

Presidio Networked Solutions, LLC • Atlanta, GA, United States
[job_card.full_time]
SEIZE THE OPPORTUNITY TO BE A PART OF SOMETHING GREAT!.Presidio is on the leading edge of a technology-driven movement to transform the way business is done, for our customers and our customers' cu...[show_more]
[last_updated.last_updated_30] • [promoted]
Site CEO

Site CEO

Advanced Recovery Systems • STOCKBRIDGE, Georgia, US
[job_card.full_time]
We're looking for an experience and passionate Executive leader in the Atlanta market!.We put behavioral health front and center, providing assistance to people with substance abuse issues, addicti...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Real Estate Sales Agent

Real Estate Sales Agent

The Hester Group • Fayetteville, GA, US
[job_card.full_time]
If you’re ready to positively change your real estate career in 2025 and moving forward, you’re going to want to keep reading. The Hester Group has more leads than we can handle - it&rsq...[show_more]
[last_updated.last_updated_30] • [promoted]
Sr. Manager, Engineering

Sr. Manager, Engineering

OpenGov • Atlanta, GA, United States
[job_card.full_time]
OpenGov is the leader in AI and ERP solutions for local and state governments in the U.More than 2,000 cities, counties, state agencies, school districts, and special districts rely on the OpenGov ...[show_more]
[last_updated.last_updated_30] • [promoted]
Senior Validation Engineering Manager

Senior Validation Engineering Manager

OSI Engineering • Johns Creek, GA, US
[job_card.full_time]
Senior Validation Engineering Manager A leading chip and silicon IP provider is looking to hire a Validation Manager to join its Memory Interface Chip business unit. In this full-time role, you’ll c...[show_more]
[last_updated.last_updated_30] • [promoted]
SRE Architect

SRE Architect

Cortex consultants LLC • Atlanta, Georgia, USA
[job_card.full_time]
Job Title : Site Reliability Engineering (SRE) Architect.As an SRE Architect you will be a pivotal technical leader responsible for designing. SRE function defining standards blueprints and framework...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Manager Site Reliability Engineering

Manager Site Reliability Engineering

RELX • Alpharetta, GA, US
[job_card.full_time]
Are you an experienced site reliability engineering leader ready to shape strategy, inspire teams, and drive innovation at scale? Are you looking to lead a high-impact sre team where your leadershi...[show_more]
[last_updated.last_updated_30] • [promoted]
Site Reliability Engineer

Site Reliability Engineer

CD Newco LLC d / b / a Curve Dental • Alpharetta, Georgia, United States, 30009
[job_card.full_time]
At Flex Dental, we go beyond checking boxes; our integration and automation are unparalleled.Every feature serves a purpose, creating seamless collaboration with Open Dental’s practice management s...[show_more]
[last_updated.last_updated_30]
Site Reliability Engineer

Site Reliability Engineer

Donato Technologies, Inc • Atlanta, Georgia, USA
[job_card.full_time]
Senior Site Reliability Engineer.Manage and optimize data streaming and API components in OpenShift Onpremise and AWS.Proactively review the applications APIs and processes to identify opport...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Site Reliability Engineer

Site Reliability Engineer

T-Mobile USA, Inc. • Atlanta, GA, United States
[job_card.full_time] +1
At T-Mobile, we invest in YOU! Our Total Rewards Package ensures that employees get the same big love we give our customers. All team members receive a competitive base salary and compensation pack...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Site Reliability Engineering Architect

Site Reliability Engineering Architect

TechniPros • Atlanta, Georgia, USA
[job_card.full_time]
Job Title : Site Reliability Engineering (SRE) Architect.Location : Atlanta Georgia (Hybrid).As an SRE Architect you will be a pivotal technical leader responsible for designing building and evolvi...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
SRE Architect GOEDC5498339 (Atlanta)

SRE Architect GOEDC5498339 (Atlanta)

Compunnel Inc. • Atlanta, GA, United States
[job_card.full_time]
As an SRE Architect, you will be a pivotal technical leader responsible for designing, building, and evolving the foundational systems and practices that ensure the reliability, scalability, perfor...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Travel Registered Respiratory Therapist (RRT) - $1,535 to $1,735 per week in Fayetteville, GA

Travel Registered Respiratory Therapist (RRT) - $1,535 to $1,735 per week in Fayetteville, GA

AlliedTravelNetwork • Fayetteville, GA, US
[job_card.full_time]
AlliedTravelNetwork is working with Titan Medical Group to find a qualified RRT in Fayetteville, Georgia, 30214!.Travel - Respiratory Therapist. Fayetteville, GA, United States.RRT / BCLS / BLS - Americ...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Site Reliability Engineering Manager (Alpharetta)

Site Reliability Engineering Manager (Alpharetta)

LexisNexis Risk Solutions • Alpharetta, GA, US
[job_card.part_time]
Are you an experienced Site Reliability Engineering leader ready to shape strategy, inspire teams, and drive innovation at scale?. Are you looking to lead a high-impact SRE team where your leadershi...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Cloud Infrastructure Site Reliability Engineer (SRE) (Alpharetta)

Cloud Infrastructure Site Reliability Engineer (SRE) (Alpharetta)

Intelliswift - An LTTS Company • Alpharetta, GA, United States
[job_card.full_time]
Job Posting Title : Cloud Infrastructure Site Reliability Engineer (SRE).Location : Alpharetta, GA or Berkeley Heights, NJ. As a Cloud Infrastructure Site Reliability Engineer (SRE) with expertise in ...[show_more]
[last_updated.last_updated_variable_days] • [promoted]