Talent.com
Director of Site Reliability Engineering
Director of Site Reliability EngineeringJobot • Houston, TX, US
Director of Site Reliability Engineering

Director of Site Reliability Engineering

Jobot • Houston, TX, US
[job_card.variable_days_ago]
[job_preview.job_type]
  • [job_card.full_time]
[job_card.job_description]

This Jobot Job is hosted by : Merwan Zattam

Are you a fit? Easy Apply now by clicking the "Apply Now" button and sending us your resume.

Salary : $220,000 - $260,000 per year

A bit about us :

We are a mission-driven organization dedicated to making AI adoption safe and secure for enterprises worldwide. As the leading provider of Security for AI, our platform protects agentic, generative, and predictive AI applications across the entire lifecycle—safeguarding intellectual property, ensuring compliance, and enabling organizations to innovate with confidence.

Our team was founded by cybersecurity and machine learning veterans who experienced a real adversarial AI attack firsthand. That moment led to the creation of a new category focused entirely on protecting machine learning systems from threats such as prompt injection, adversarial manipulation, model theft, and supply chain compromise.

Backed by strategic investors including Microsoft’s Venture Fund (M12), Moore Strategic Ventures, Booz Allen Ventures, IBM Ventures, and Capital One Ventures, we combine patented technology with industry-leading research to defend the world’s most critical AI systems.

Recognized by Gartner as a “Cool Vendor for AI Security” and trusted by Fortune 500 organizations, government agencies, and enterprises across highly regulated industries, we are shaping the future of AI security in real time. With strong product–market fit and rapid growth, this is an opportunity to join a generational company at a true inflection point—where the mission is bold, the bar is high, and the room for impact and growth is unmatched.

Why join us?

Top Benefits of Working Here

  • Be part of a new, fast-growing category

Work at the forefront of AI security, an emerging space with massive demand and almost no competition.

  • High-impact mission
  • Your work protects mission-critical AI systems for Fortune 500 companies, government agencies, and regulated industries.

  • Cutting-edge engineering
  • Tackle challenges in AI / ML security, adversarial defense, model protection, and large-scale distributed systems.

  • Backed by top-tier investors
  • Strong funding and stability from groups like Microsoft’s venture fund, IBM Ventures, and others.

  • Build from the ground up
  • Shape the SRE, platform, and reliability culture—this is not a legacy environment.

  • High autonomy & ownership
  • Influence roadmap, architecture, tooling, and direction. Your work is visible and meaningful.

  • Fully remote, U.S.-based
  • Flexibility, work-life balance, and a high-performance culture.

  • Competitive pay + real equity upside
  • Top-tier compensation with equity at a company in a hyper-growth phase.

  • Elite team & steep career growth
  • Collaborate with seasoned leaders in cybersecurity, ML, and enterprise infrastructure—and grow as the company grows.

    Job Details

    Director of Site Reliability Engineering

    Remote – United States

    We are seeking a Director of Site Reliability Engineering to lead the broader Platform Engineering organization with a strategic focus on building a world-class SRE function. Reporting to the VP of Engineering, you will be responsible for the reliability, scalability, and operational excellence of the mission-critical AI security platform used by enterprises and government organizations worldwide.

    In this senior leadership role, you will define the SRE strategy, mentor and scale a high-performing team, and implement the systems, practices, and culture required to support rapid growth. You will work at the intersection of cutting-edge AI security technology and enterprise-grade infrastructure, ensuring the platform delivers the always-on performance our customers depend on.

    Your work will directly strengthen the security posture of organizations protecting their most valuable AI assets—from financial institutions and healthcare providers to government and Fortune 500 enterprises.

    What You’ll Do

    Build and Lead the SRE Function

    Define and execute the SRE strategy and roadmap, positioning reliability as a core product feature

    Build, mentor, and scale a high-performing SRE and Platform Engineering team

    Establish SRE principles, culture, and best practices across engineering

    Create clear career development paths and raise the bar for hiring and excellence

    Drive Platform Reliability & Operational Excellence

    Own reliability, availability, latency, and performance across multi-cloud, multi-region deployments (AWS, Azure, GCP)

    Set and achieve SLOs / SLIs aligned with business objectives

    Architect multi-region resiliency : automated failover, graceful degradation, and disaster recovery

    Build robust observability : distributed tracing, metrics, logging, and actionable alerting

    Lead incident management : on-call processes, incident command, blameless post-mortems, and systematic remediation

    Enable Developer Velocity & Platform Excellence

    Own CI / CD pipelines and deployment infrastructure for safe, fast, reliable delivery

    Build internal developer platforms and tooling that reduce toil and improve productivity

    Implement progressive delivery (canaries, feature flags, automated rollbacks)

    Partner with engineering teams to embed reliability requirements and design patterns early in development

    Security, Compliance & Enterprise Requirements

    Ensure alignment with standards such as FedRAMP, SOC 2, ISO 27001, and other regulatory requirements

    Build and support air-gapped and on-premises deployment capabilities

    Implement infrastructure security controls, secrets management, and audit logging

    Support customer-facing SLAs and maintain trust with enterprise and government clients

    Scale & Optimize the Platform

    Lead capacity planning and performance engineering for platform growth

    Drive chaos engineering and resilience testing to validate system behavior under failure

    Optimize cost while maintaining reliability and performance

    Automate operational workflows to eliminate toil and improve efficiency

    What You Bring

    Leadership & Experience

    8+ years in infrastructure, platform engineering, or SRE roles

    4+ years in engineering leadership

    Experience supporting mission-critical, always-on systems at enterprise scale

    Strong people leadership and a track record of building high-performing teams

    Technical Expertise

    Deep knowledge of cloud infrastructure (AWS, Azure, GCP) and multi-region systems

    Strong experience with Kubernetes, Docker, and infrastructure-as-code (Terraform, Pulumi, CloudFormation)

    Proven ability to build and operate large-scale distributed systems

    Expertise in observability tooling (Prometheus, Grafana, Datadog, New Relic, ELK / EFK, distributed tracing)

    Proficiency in Python, Go, or similar languages

    Understanding of databases, data pipelines, message queues, and caching systems

    Strategic & Operational Skills

    Experience driving SRE strategy, SLOs / SLIs, error budgets, and incident management

    Ability to partner across engineering, product, security, and customer success

    Strong communication skills across technical and non-technical audiences

    Pragmatic problem-solving and sound decision-making

    Bonus Experience

    Background in cybersecurity or AI / ML infrastructure

    Familiarity with compliance frameworks (FedRAMP, SOC 2, ISO 27001, NIST)

    Experience supporting air-gapped or on-premise deployments

    Hands-on experience with chaos engineering and game day exercises

    Open-source contributions or SRE community leadership

    Why This Opportunity Stands Out

    Impact : Define reliability strategy for a category-leading AI security platform

    Growth : Build and scale the SRE function from the ground up in a fast-growing, well-funded environment

    Mission : Work on technology that is shaping the future of secure AI adoption

    Team : Join a world-class engineering organization with deep roots in security, ML, and distributed systems

    Innovation : Solve novel problems at the intersection of AI, security, and infrastructure

    Flexibility : Fully remote role with competitive compensation, equity, and benefits

    Location & Work Environment

    This is a fully remote position within the United States. We value flexibility, ownership, collaboration, and excellence. The team operates across time zones with a blend of async communication, regular syncs, and purposeful in-person gatherings.

    Equal Opportunity

    We are an equal opportunity employer and do not discriminate based on race, color, religion, sex, sexual orientation, gender identity, national origin, age, disability, veteran status, or any legally protected status. We are committed to fostering an inclusive environment where all team members can thrive.

    If you need accommodations during the application or interview process, please let us know.

    Interested in hearing more? Easy Apply now by clicking the "Apply Now" button.

    Jobot is an Equal Opportunity Employer. We provide an inclusive work environment that celebrates diversity and all qualified candidates receive consideration for employment without regard to race, color, sex, sexual orientation, gender identity, religion, national origin, age (40 and over), disability, military status, genetic information or any other basis protected by applicable federal, state, or local laws. Jobot also prohibits harassment of applicants or employees based on any of these protected categories. It is Jobot’s policy to comply with all applicable federal, state and local laws respecting consideration of unemployment status in making hiring decisions.

    Sometimes Jobot is required to perform background checks with your authorization. Jobot will consider qualified candidates with criminal histories in a manner consistent with any applicable federal, state, or local law regarding criminal backgrounds, including but not limited to the Los Angeles Fair Chance Initiative for Hiring and the San Francisco Fair Chance Ordinance.

    Information collected and processed as part of your Jobot candidate profile, and any job applications, resumes, or other information you choose to submit is subject to Jobot's Privacy Policy, as well as the Jobot California Worker Privacy Notice and Jobot Notice Regarding Automated Employment Decision Tools which are available at jobot.com / legal.

    By applying for this job, you agree to receive calls, AI-generated calls, text messages, or emails from Jobot, and / or its agents and contracted partners. Frequency varies for text messages. Message and data rates may apply. Carriers are not liable for delayed or undelivered messages. You can reply STOP to cancel and HELP for help. You can access our privacy policy here : jobot.com / privacy-policy

    [job_alerts.create_a_job]

    Director of Site Reliability Engineering • Houston, TX, US

    [internal_linking.similar_jobs]
    Director of Advanced Engineering Solutions

    Director of Advanced Engineering Solutions

    CareerBliss • Houston, TX, US
    [job_card.full_time]
    High Growth + International + Make your mark + Own Strategy + Incredible Leadership.This Jobot Job is hosted by : Chelsea Piekarski. Are you a fit? Easy Apply now by clicking the "Apply" button and s...[show_more]
    [last_updated.last_updated_variable_hours] • [promoted] • [new]
    Director of Engineering

    Director of Engineering

    The Falcon Group • Houston, TX, US
    [job_card.full_time]
    [filters_job_card.quick_apply]
    WHO WE ARE At The Falcon Group, our mission is to deliver exceptional service across a broad range of industries, including multifamily, industrial, commercial, retail, hospitality, healthcare, and...[show_more]
    [last_updated.last_updated_variable_days]
    Remote Senior Director, Heavy Industry Market NA

    Remote Senior Director, Heavy Industry Market NA

    Hitachi Energy • Houston, TX, United States
    [filters.remote]
    [job_card.full_time]
    A global energy solutions company is seeking a Senior Director for the Heavy Industry Market in North America.This remote role involves developing innovative marketing strategies for Transformers p...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Operations Director

    Operations Director

    BGT Interior Solutions • HOUSTON, TX, US
    [job_card.full_time]
    [filters_job_card.quick_apply]
    Job Summary BGT’s Operations Director will oversee the planning, directing, and coordinating of all material by creating and enforcing Standard Operating Procedures (SOPs).During the producti...[show_more]
    [last_updated.last_updated_30]
    Reliability & Quality Systems Manager

    Reliability & Quality Systems Manager

    Scientific Drilling Inc. • Houston, TX, USA
    [job_card.full_time]
    [filters_job_card.quick_apply]
    Scientific Drilling is looking for a.Reliability & Quality Systems Manager .Scientific Drilling is an independent directional drilling and wellbore navigation, surveying and logging Servic...[show_more]
    [last_updated.last_updated_30]
    Site Safety Manager

    Site Safety Manager

    Qcells • Houston, TX, US
    [filters.remote]
    [job_card.full_time] +1
    Hanwha Qcells USA Corp (Qcells USA), headquartered in Irvine, CA, specializes in providing utility-scale modules, solar photovoltaic (PV), and battery energy storage systems (BESS) project developm...[show_more]
    [last_updated.last_updated_variable_days]
    Remote Senior Director Utility Market NA - Growth Leader

    Remote Senior Director Utility Market NA - Growth Leader

    Hitachi Vantara Corporation • Houston, TX, United States
    [filters.remote]
    [job_card.full_time]
    A leading technology company is seeking a Senior Director for the Utility Market in North America.This remote role involves spearheading marketing and sales strategies for transformers products, op...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Maintenance Reliability Engineer

    Maintenance Reliability Engineer

    Vallourec Star LP • Houston, Texas, US
    [job_card.full_time]
    Job Description Job Description POSITION SUMMARY : Apply Engineering concepts to the optimization of equipment, procedures, and departmental budgets to achieve better Maintainability and Reliabili...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Site Reliability Engineer (IB4)

    Site Reliability Engineer (IB4)

    Foxconn Industrial Internet - FII • Houston, TX, US
    [job_card.full_time]
    [filters_job_card.quick_apply]
    Responsible for supporting production Design systems and processes and support implementation User training and system issue tracking Learn about MES and manufacturing production Design control sys...[show_more]
    [last_updated.last_updated_30]
    Site Reliability Engineer (SRE)

    Site Reliability Engineer (SRE)

    Bright Vision Technologies • Houston, TX, US
    [job_card.full_time]
    [filters_job_card.quick_apply]
    Site Reliability Engineer (SRE) Bright Vision Technologies is a forward-thinking software development company dedicated to building innovative solutions that help businesses automate and optimize t...[show_more]
    [last_updated.last_updated_variable_days]
    Remote Senior Director Utility Market Growth & Strategy

    Remote Senior Director Utility Market Growth & Strategy

    Hitachi Energy • Houston, TX, United States
    [filters.remote]
    [job_card.full_time]
    A leading multinational energy company is seeking a Senior Director Utility Market for North America.This remote role entails spearheading marketing and sales strategies for Transformer products wi...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Electrical Construction Director of Safety (Houston, TX area)

    Electrical Construction Director of Safety (Houston, TX area)

    Gpac • South Houston, Texas, United States
    [job_card.full_time]
    [filters_job_card.quick_apply]
    Commercial Electrical Construction Safety Director.Safety (EHS) Manager or Director .Great company to work with that has a excellent growth opportunities in the local .Projects focus...[show_more]
    [last_updated.last_updated_30]
    Site Lead

    Site Lead

    BB&E Inc. • Houston, Texas, United States
    [job_card.full_time]
    BB&E is an employee-owned full service civil and environmental engineering and consulting firm, headquartered in Northville, Michigan, which services both the Federal and Industrial sectors through...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Executive Director, Dialysis Services - San Antonio, TX (Relocation Assistance Available)

    Executive Director, Dialysis Services - San Antonio, TX (Relocation Assistance Available)

    University Health • HOUSTON, Texas, United States
    [job_card.full_time]
    At University Health, we are dedicated to improving the health of our community through exceptional patient care, education, and innovation. Our team embodies a strong commitment to excellence, and ...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Executive Director, Dialysis Services - San Antonio, TX (Relocation Assistance Available) - Lead an Innovative Healthcare Team (HOUSTON)

    Executive Director, Dialysis Services - San Antonio, TX (Relocation Assistance Available) - Lead an Innovative Healthcare Team (HOUSTON)

    University Health • Houston, TX, US
    [job_card.full_time]
    At University Health, we are dedicated to improving the health of our community through exceptional patient care, education, and innovation. Our team embodies a strong commitment to excellence, and ...[show_more]
    [last_updated.last_updated_1_day] • [promoted]
    Site Safety Manager

    Site Safety Manager

    Sendero Energy Services • Houston, TX, US
    [job_card.full_time]
    [filters_job_card.quick_apply]
    Site Safety Manager – Renewables / Heavy Civil Construction Company : Sendero Energy Services Headquarters : Houston, TX Position Type : Full-Time About Sendero Energy Services : Sendero Energy S...[show_more]
    [last_updated.last_updated_variable_days]
    Director of Wellness

    Director of Wellness

    Aetherflux • Houston, Texas, US
    [job_card.full_time]
    Job Description Job Description Location : Remote Company : Aetherflux About Aetherflux Aetherflux is building an American power grid in space—advancing AI compute in orbit and delivering solar en...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Site Manager PV – Dallas, Texas, USA

    Site Manager PV – Dallas, Texas, USA

    SEYSES Ibérica SL • Houston, Texas, United States
    [job_card.full_time]
    [filters_job_card.quick_apply]
    Make an impact in the Renewable Energy sector.SEYSES is an international engineering and construction company specialized in renewable energy, with offices in Spain, the United States, Mexico, Chil...[show_more]
    [last_updated.last_updated_variable_days]