Talent.com
Staff Site Reliability Engineer
Staff Site Reliability EngineerASCENDING • Fairfax, VA, US
Staff Site Reliability Engineer

Staff Site Reliability Engineer

ASCENDING • Fairfax, VA, US
[job_card.variable_days_ago]
[job_preview.job_type]
  • [job_card.temporary]
[job_card.job_description]

Job Description

Job Description

Required U.S. Citizenship  /  No clearance needed  /  100% remote within the US  /  EST Time Zone

Staff Site Reliability Engineer  /  Cloud SME

Location :  100% remote in the continental US

Type :  Long-term contract (3+ years)

Role Summary

As the Staff SRE / Cloud SME, you will be a critical technical leader driving the rearchitecting of our existing monolithic system into a resilient, cloud-native architecture. This role requires deep expertise across multiple cloud platforms (Azure and AWS) and container orchestration (Kubernetes) to ensure the next-generation platform meets the highest standards of scalability, reliability, and security.

Key Responsibilities

Architecture & Transformation Leadership

  • Lead the technical rearchitecting efforts, transforming a large-scale monolithic system into a modern microservices-based, cloud-native application.
  • Collaborate with cross-functional teams (Engineering, Architecture, Product) to define and implement the new system architecture using domain-driven design (DDD) principles.
  • Conduct technology evaluations and provide recommendations for new tools, frameworks, and cloud services to enhance our infrastructure.

Reliability Engineering & Cloud Operations

  • Utilize  Kubernetes (K8S)  for container orchestration and management, ensuring extreme scalability, reliability, and high availability of the system.
  • Implement robust, highly resilient, and highly available components for the system.
  • Develop and implement comprehensive monitoring, logging, and alerting mechanisms to ensure optimal system performance and availability.
  • Drive the adoption of DevOps principles and practices throughout the software development lifecycle, ensuring seamless integration and continuous deployment processes.
  • Technical Expertise & Mentorship

  • Stay up-to-date with emerging technologies, frameworks, and industry trends related to systems and cloud computing.
  • Mentor and provide technical guidance to junior team members, fostering a culture of continuous learning and professional growth.
  • Required Qualifications

  • Cloud Platforms :  7+ years of experience with cloud computing platforms. Strong multi-cloud expertise required with  AWS  and  Azure .
  • Cloud-Native Transformation :  7+ years of experience in rearchitecting large-scale monolithic applications to cloud-native architectures.
  • Container Orchestration :  Strong expertise in  Kubernetes (K8S)  is required, including hands-on experience with both  AKS (Azure Kubernetes Service)  and  EKS (Elastic Kubernetes Service) .
  • Networking :  Strong experience with  Cloud Networking , with the ability to design and resolve complex cloud networking architecture problems.
  • IaC :  Expert knowledge of  Terraform  for infrastructure-as-code deployment and management.
  • Security :  Must possess strong knowledge of security best practices for containers and Kubernetes clusters.
  • Education :  Bachelor's or Master's degree in Computer Science, Software Engineering, or a related field.
  • Bonus Knowledge :  Knowledge of load balancing algorithms.
  • Thanks for applying!

    Powered by JazzHR

    Aro9CLihCN

    [job_alerts.create_a_job]

    Site Reliability Engineer • Fairfax, VA, US

    [internal_linking.similar_jobs]
    Site Reliability Engineer - OpenStack

    Site Reliability Engineer - OpenStack

    Verisign • Reston, VA, United States
    [job_card.full_time]
    Verisign helps enable the security, stability, and resiliency of the internet.We are a trusted provider of internet infrastructure services for the networked world and deliver unmatched performance...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    Careviso • Vienna, VA, US
    [job_card.full_time]
    [filters_job_card.quick_apply]
    Senior Site Reliability Engineer Location : .Remote in the United States About the Role We're looking for a Senior Site Reliability Engineer or DevOps Engineer to join our small but growing infrastru...[show_more]
    [last_updated.last_updated_variable_days]
    Staff Site Reliability Engineer

    Staff Site Reliability Engineer

    Visa • Ashburn, VA, US
    [job_card.full_time]
    Visa is a world leader in payments and technology, with over 259 billion payments transactions flowing safely between consumers, merchants, financial institutions, and government entities in more t...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Site Reliability Engineer

    Site Reliability Engineer

    Lightedge • Ashburn, VA, US
    [job_card.full_time]
    As a Site Reliability Engineer (SRE), you will be an integral part of the team at LightEdge Solutions.This position will report to the DevOps Manager, and will be responsible for reliable operation...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Nuclear Hardness & Surivivability Engineer

    Nuclear Hardness & Surivivability Engineer

    The Aerospace Corporation • Chantilly, VA, United States
    [job_card.full_time]
    The Aerospace Corporation is the trusted partner to the nation's space programs, solving the hardest problems and providing unmatched technical expertise. As the operator of a federally funded resea...[show_more]
    [last_updated.last_updated_30] • [promoted]
    IT Disaster Recovery Lead

    IT Disaster Recovery Lead

    Legal & General America • Frederick, MD, United States
    [job_card.full_time]
    At Legal & General America, we aim to make a positive difference in the lives of our customers, partners, colleagues, and the communities in which they live. As a recognized market leader of term li...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Site Reliability Engineer

    Site Reliability Engineer

    Tax Analysts • Falls Church, VA, US
    [job_card.full_time]
    [filters_job_card.quick_apply]
    Tax Analysts is seeking a Site Reliability Engineer (SRE) to help establish and shape our reliability engineering practice from the ground up. This is a unique opportunity to join a mission-driven o...[show_more]
    [last_updated.last_updated_30]
    Senior Technology Site Reliability Engineer

    Senior Technology Site Reliability Engineer

    Cooley LLP • Reston, VA, United States
    [job_card.full_time]
    Senior Technology Site Reliability Engineer.Cooley is seeking a Senior Site Reliability Engineer to join the.Infrastructure & Development Operations. The Senior Technology Site Reliability Engineer(...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Scientist, Systems Engineer TS / SCI

    Scientist, Systems Engineer TS / SCI

    L3Harris Technologies • CLARKSBURG, Maryland, United States
    [job_card.full_time]
    L3Harris is dedicated to recruiting and developing high-performing talent who are passionate about what they do.Our employees are unified in a shared dedication to our customers’ mission and quest ...[show_more]
    [last_updated.last_updated_1_day] • [promoted]
    Field Service Engineer III Ashburn VA (Winchester VA / Frederick MD)

    Field Service Engineer III Ashburn VA (Winchester VA / Frederick MD)

    Segra • Frederick, MD, United States
    [job_card.full_time]
    Segra is searching for a dynamic and experienced.Ashburn / Winchester VA; Frederick MD (NOVA).Based on the manager's evaluation of candidate experience and competency, we are open to hiring for this ...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Site Reliability Engineer

    Site Reliability Engineer

    LMI Consulting, LLC • Tysons, VA, United States
    [job_card.full_time]
    Salaried High Fringe / Full-Time.LMI seeks a Site Reliability Engineer (SRE) to support the U.Army Center for Initial Military Training's (CIMT) Holistic Health & Fitness Management System (H2FMS).H2...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Apptad- Site Reliability Engineer Lead

    Apptad- Site Reliability Engineer Lead

    Apptad Inc • Fairfax, VA, United States
    [job_card.full_time]
    [filters_job_card.quick_apply]
    The Support Lead (SRE) is responsible for overseeing the support operations and site reliability engineering tasks, ensuring the effective functioning of systems and applications.The primary goal i...[show_more]
    [last_updated.last_updated_variable_days]
    Travel CT Tech - $3,150 per week in La Plata, MD

    Travel CT Tech - $3,150 per week in La Plata, MD

    AlliedTravelNetwork • Germantown, Maryland, US
    [job_card.full_time]
    AlliedTravelNetwork is working with Cross Country Allied to find a qualified CT Tech in La Plata, Maryland, 20646!.As a CT technologist, you will use computerized tomography to take medical images ...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Principal Software Engineer - On-site

    Principal Software Engineer - On-site

    Leonardo DRS • Germantown, MD, United States
    [job_card.full_time]
    DRS RADA Technologies, a subsidiary of Leonardo DRS, is focused on proprietary radar solutions and legacy avionics systems supporting the defense industry globally. The company is a global pioneer o...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Deputy Director, Infrastructure Operations (#1883)

    Deputy Director, Infrastructure Operations (#1883)

    BNBI • Fort Detrick, MD, United States
    [job_card.temporary]
    The National Biodefense Analysis and Countermeasures Center (NBACC) is a one-of-a-kind facility located on Fort Detrick in Frederick MD and is dedicated to defending the nation against biological t...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Manager, Site Reliability Engineer (Global Payment Network)

    Manager, Site Reliability Engineer (Global Payment Network)

    Capital One • McLean, VA, US
    [job_card.full_time] +1
    Manager, Site Reliability Engineer (Global Payment Network).Do you love building and pioneering in the technology space? Do you enjoy solving complex business problems in a fast-paced, collaborativ...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Palantir Engineer, Lead

    Palantir Engineer, Lead

    BOOZ, ALLEN & HAMILTON, INC. • Springfield, VA, US
    [job_card.full_time] +1
    As an experienced engineer, you know that machine learning is critical to understanding and processing massive datasets.Your ability to conduct statistical analyses on business processes using ML t...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Travel CT Tech - $2,939 per week in Baltimore, MD

    Travel CT Tech - $2,939 per week in Baltimore, MD

    AlliedTravelCareers • Germantown, Maryland, US
    [job_card.full_time]
    AlliedTravelCareers is working with Olaro to find a qualified CT Tech in Baltimore, Maryland, 21201!.We are seeking a skilled and detail-oriented CT Technologist to join our diagnostic imaging team...[show_more]
    [last_updated.last_updated_30] • [promoted]