Talent.com
Site Reliability Engineering (SRE) Architect
Site Reliability Engineering (SRE) ArchitectQTech • Atlanta, Georgia, USA
Site Reliability Engineering (SRE) Architect

Site Reliability Engineering (SRE) Architect

QTech • Atlanta, Georgia, USA
[job_card.variable_days_ago]
[job_preview.job_type]
  • [job_card.full_time]
[job_card.job_description]

Job Title : Site Reliability Engineering (SRE) Architect

Location : Atlanta Georgia (Hybrid)

Long Term Contract

Looking for W2 Candidates. No C2C

Job Discription :

As an SRE Architect you will be a pivotal technical leader responsible for designing building and evolving the foundational systems and practices that ensure the reliability scalability performance and efficiency of our critical services. Moving beyond day-to-day operations you will focus on the strategic architectural direction of SRE function defining standards blueprints and frameworks that enable development teams and fellow SRE operations team to build and operate highly resilient systems. Leverage deep expertise in software engineering distributed systems cloud infrastructure and SRE principles to influence technology choices establish best practices and foster a proactive culture of reliability across the organization and much beyond observability pillar.

Key Responsibilities :

1. Reliability Strategy & Design :

o Architect and design highly available scalable secure and cost-effective infrastructure and application patterns on AWS

o Define and evangelize SRE best practices standards and blueprints for service design deployment monitoring and operational readiness across the engineering organization

o Review current observability implementation to identify gaps and define steps to reach next level maturity of observability setup to provide deep insights into system health and behaviour

o With overall maturity lead the definition and implementation strategy for Service Level Indicators (SLIs) Service Level Objectives (SLOs) and Error Budgets for critical services

2. Platform Architecture & Automation :

o Design solutions to systematically reduce operational toil through automation and improved system design

o Evaluate current SRE tools and automation frameworks (e.g. CI / CD pipelines Infrastructure as Code modules automated incident remediation chaos engineering platforms) and suggest enhancement that will help overall enhancement of capability

o Evaluate prototype and recommend new technologies tools and methodologies to enhance system reliability developer productivity and operational efficiency

3. Technical Leadership & Consultation :

o Act as a senior technical advisor and subject matter expert on reliability scalability and performance for development and platform teams

o Provide architectural guidance during the design phase of new services and features to ensure reliability principles are embedded early (shift-left)

o Mentor and coach other SREs and engineers fostering technical excellence and adherence to SRE principles

o Lead architectural reviews and production readiness assessments for critical systems

4. Resilience :

o Lead blameless postmortems for significant incidents ensuring root causes are identified and systemic architectural improvements are prioritized and implemented

o Architect and advocate for resilience patterns (e.g. circuit breaking rate limiting graceful degradation chaos engineering) within applications and infrastructure

Required Qualifications :

Proven experience in an architectural role designing solutions for reliability scalability and performance

Deep understanding and practical application of SRE principles (SLIs / SLOs error budgets toil reduction automation incident management postmortems)

Expertise in cloud computing platforms (e.g. AWS) including infrastructure networking and security services

Strong experience with containerization and orchestration technologies (Kubernetes Docker serverless computing)

Solid experience designing and implementing observability solutions (e.g. Dynatrace Prometheus Grafana ELK / EFK Stack Jaeger OpenTelemetry)

Strong programming / scripting skills (e.g. Python Go Bash) for automation and tool development

Excellent analytical problem-solving and strategic thinking skills.

Strong communication collaboration and leadership skills with the ability to influence technical direction across teams

Preferred Qualifications :

Experience designing and implementing chaos engineering practices and platforms

Best Regards

Tarun K

Phone : 1-

Email : Key Skills

Fashion Retail,Highway Design,Apache Web Server,Atl,CAD CAM,ABAP

Employment Type : Full Time

Experience : years

Vacancy : 1

[job_alerts.create_a_job]

Site Reliability Sre • Atlanta, Georgia, USA

[internal_linking.similar_jobs]
Site Reliability Engineer

Site Reliability Engineer

VirtualVocations • Alpharetta, Georgia, United States
[job_card.full_time]
A company is looking for a Site Reliability Engineer to join a dynamic Cloud Services team in a fully remote role.Key Responsibilities Act as a subject matter expert in cloud technologies, guidin...[show_more]
[last_updated.last_updated_30] • [promoted]
SRE Architect

SRE Architect

Blue Ribbon Global Technologies • Atlanta, Georgia, USA
[job_card.full_time] +1
Client : Xebia / Delta Airlines.As an SRE Architect you will be a pivotal technical leader responsible for designing building and evolving the foundational systems and practices that ensure the reli...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Senior Site Reliability Engineer

Senior Site Reliability Engineer

Cloudious LLC • Atlanta, Georgia, USA
[job_card.full_time]
Senior Site Reliability Engineer.Manage and optimize data streaming and API components in OpenShift Onpremise and AWS.Proactively review the applications APIs and processes to identify opportunitie...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Site Reliability Engineer

Site Reliability Engineer

Canonical • Atlanta, GA, US
[job_card.full_time]
Canonical is a leading provider of open source software and operating systems to the global enterprise and technology markets. Our platform, Ubuntu, is widely used in breakthrough enterprise initiat...[show_more]
[last_updated.last_updated_30] • [promoted]
Lead Site Reliability Engineer - Federal Team

Lead Site Reliability Engineer - Federal Team

Saviynt • Atlanta, GA, US
[job_card.full_time]
Saviynt is an identity authority platform built to power and protect the world at work.In a world of digital transformation, where organizations are faced with increasing cyber risk but cannot affo...[show_more]
[last_updated.last_updated_30] • [promoted]
Site Reliability Engineer

Site Reliability Engineer

Matlen Silver • Alpharetta, Georgia, United States
[job_card.temporary]
Title : Senior Cloud Security Engineer / Architect.Locations : Alpharetta, GA, Columbus, OH, Berkeley Heights, NJ, Frisco, TX. Duration : 6 month contract to hire.Conversion salary : $150k-$188k.Due to cl...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
SRE Architect

SRE Architect

Cortex consultants LLC • Atlanta, Georgia, USA
[job_card.full_time]
Job Title : Site Reliability Engineering (SRE) Architect.As an SRE Architect you will be a pivotal technical leader responsible for designing. SRE function defining standards blueprints and framework...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Manager Site Reliability Engineering

Manager Site Reliability Engineering

RELX • Alpharetta, GA, US
[job_card.full_time]
Are you an experienced site reliability engineering leader ready to shape strategy, inspire teams, and drive innovation at scale? Are you looking to lead a high-impact sre team where your leadershi...[show_more]
[last_updated.last_updated_30] • [promoted]
Site Reliability Engineer

Site Reliability Engineer

CD Newco LLC d / b / a Curve Dental • Alpharetta, Georgia, United States, 30009
[job_card.full_time]
At Flex Dental, we go beyond checking boxes; our integration and automation are unparalleled.Every feature serves a purpose, creating seamless collaboration with Open Dental’s practice management s...[show_more]
[last_updated.last_updated_30]
Site Reliability Engineering Manager (Alpharetta)

Site Reliability Engineering Manager (Alpharetta)

LexisNexis Risk Solutions • Alpharetta, GA, United States
[job_card.full_time]
Are you an experienced Site Reliability Engineering leader ready to shape strategy, inspire teams, and drive innovation at scale?. Are you looking to lead a high-impact SRE team where your leadershi...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Principal Site Reliability Engineer

Principal Site Reliability Engineer

Priority Technology Holdings, LLC • Alpharetta, GA, US
[job_card.full_time]
Job title : Principal Site Reliability Engineer.Reports to : Director, Site Reliability Engineering.Location : Alpharetta, GA or Remote. Priority Technology Holdings, Inc.Our vision is to eliminate the...[show_more]
[last_updated.last_updated_30] • [promoted]
Travel Registered Respiratory Therapist (RRT) - $2,002 per week in Fayetteville, GA

Travel Registered Respiratory Therapist (RRT) - $2,002 per week in Fayetteville, GA

AlliedTravelNetwork • Fayetteville, GA, US
[job_card.full_time]
AlliedTravelNetwork is working with Care Career to find a qualified RRT in Fayetteville, Georgia, 30214!.Respiratory therapists interview and examine patients with breathing or cardiopulmonary diso...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Site Reliability Engineer

Site Reliability Engineer

T-Mobile USA, Inc. • Atlanta, GA, United States
[job_card.full_time] +1
At T-Mobile, we invest in YOU! Our Total Rewards Package ensures that employees get the same big love we give our customers. All team members receive a competitive base salary and compensation pack...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
DevOps - Site Reliability Engineer ( SRE)

DevOps - Site Reliability Engineer ( SRE)

Resource Informatics Group Inc • Atlanta, GA, US
[job_card.full_time]
Role : Site Reliability Engineer.This Software Engineer will be part of the Site Reliability Engineering (SRE) team.The SRE team is an innovative team devoted to providing automated solutions and se...[show_more]
[last_updated.last_updated_30] • [promoted]
Cloud Solutions Architect (Remote)

Cloud Solutions Architect (Remote)

Scale AI • Atlanta, Georgia, United States
[filters.remote]
[job_card.full_time]
Join a global community of talented professionals to shape the future of AI.Earn up to $15 USD / hr and additional rewards based on quality of submission. Outlier is committed to improving the intelli...[show_more]
[last_updated.last_updated_1_day] • [promoted]
Site Reliability Engineering Architect

Site Reliability Engineering Architect

TechniPros • Atlanta, Georgia, USA
[job_card.full_time]
Job Title : Site Reliability Engineering (SRE) Architect.Location : Atlanta Georgia (Hybrid).As an SRE Architect you will be a pivotal technical leader responsible for designing building and evolvi...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
SRE Architect GOEDC5498339 (Atlanta)

SRE Architect GOEDC5498339 (Atlanta)

Compunnel Inc. • Atlanta, GA, United States
[job_card.full_time]
As an SRE Architect, you will be a pivotal technical leader responsible for designing, building, and evolving the foundational systems and practices that ensure the reliability, scalability, perfor...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Travel Registered Respiratory Therapist (RRT) - $1,937 per week in Fayetteville, GA

Travel Registered Respiratory Therapist (RRT) - $1,937 per week in Fayetteville, GA

AlliedTravelCareers • Fayetteville, GA, US
[job_card.full_time]
Registered Respiratory Therapist.Windsor Healthcare Recruitment Group, Inc.AlliedTravelCareers is working with Windsor Healthcare Recruitment Group, Inc. RRT in Fayetteville, Georgia, 30214!.UNIT DE...[show_more]
[last_updated.last_updated_variable_days] • [promoted]