Talent.com
Senior Site Reliability Engineer
Senior Site Reliability EngineerBraze • San Francisco
Senior Site Reliability Engineer

Senior Site Reliability Engineer

Braze • San Francisco
[job_card.variable_days_ago]
[job_preview.job_type]
  • [job_card.full_time]
[job_card.job_description]

At Braze, we have found our people. We’re a genuinely approachable, exceptionally kind, and intensely passionate crew.

We seek to ignite that passion by setting high standards, championing teamwork, and creating work-life harmony as we collectively navigate rapid growth on a global scale while striving for greater equity and opportunity – inside and outside our organization.

To flourish here, you must be prepared to set a high bar for yourself and those around you. There is always a way to contribute : Acting with autonomy, having accountability and being open to new perspectives are essential to our continued success.

Our deep curiosity to learn and our eagerness to share diverse passions with others gives us balance and injects a one-of-a-kind vibrancy into our culture.

If you are driven to solve exhilarating challenges and have a bias toward action in the face of change, you will be empowered to make a real impact here, with a sharp and passionate team at your back. If Braze sounds like a place where you can thrive, we can’t wait to meet you.

WHAT YOU'LL DO

Site Reliability Engineers (SREs) are responsible for keeping all internal-facing services and platforms running smoothly. In a nutshell, SREs ensure site uptime. SREs blend sensible system administrators and software engineers who apply sound engineering principles, operational discipline, and mature automation to the environments and infrastructure services we provide. We specialize in systems–whether it be networking, the Linux kernel, or some more specific interest in scaling–algorithms or distributed systems.

Our team helps to improve automation, infrastructure reliability, and empowers Braze’s other engineering teams to leverage the infrastructure products and platforms we create easily. Braze operates at a massive scale with over 3.3 billion monthly active users across our customers, collecting hundreds of billions of data points each month, and sending billions of messages to end-users daily. We use a diverse technology stack rooted in Ruby on Rails, MongoDB, Redis, Kafka, Kubernetes, and more. As a Senior Site Reliability Engineer at Braze, you will collaborate with your team and consumer engineering teams to continuously improve the infrastructure, automation, and tooling that build internal products from these technologies.

Main responsibilities :

  • Partner with Braze’s engineering teams on :
  • Architecting products to effectively utilize infrastructure platforms in a scalable, reliable manner
  • Debugging reliability and scalability issues across all stack layers, including the products built using our infrastructure platforms
  • Make monitoring and alerting alerts on symptoms and not on outages
  • Ensure that Braze meets our strict enterprise-grade SLAs with customers
  • Develop Braze’s internal platform infrastructure :
  • Create Infrastructure as code using Chef, Terraform, and Kubernetes
  • Develop deployment pipelines for applications in multiple languages using Docker, Kubernetes, etc
  • Provide centralized / common tooling, services, and automation frameworks that are critical for scaling operations, capacity management, reducing operational pain, and improving the day-to-day workflow of Braze’s engineering teams
  • Manage incidents :
  • Be on a PagerDuty rotation to respond to availability incidents and provide support for other engineers
  • Use your on-call shift to prevent incidents from ever happening
  • Retrospect everything that happens to turn lessons into system improvements / changes, automation, etc

WHO ARE YOU

  • 5+ years of experience as a Software, DevOps, or Site Reliability Engineer
  • You think about systems - interfaces, boundaries, edge cases, failure modes, behaviors, specific implementations
  • Have an urge to collaborate, document, and deliver quickly
  • Collaborating across the global remote teams, often working asynchronously
  • Document everything so you don't need to learn the same thing (or plan the same work) twice
  • Delivering fast to delight our customers - even internal ones
  • Have an enthusiastic, go-for-it attitude. When you see something broken, you can't help but fix it
  • Have a desire to solve everyday challenges facing software engineers and automate their toil away
  • Have an excellent ability to manage multiple tasks and expectations at once
  • Know your way around Linux and Unix Shell.
  • Have strong programming skills - Ruby and / or Go preferred
  • Have experience with Docker, Kubernetes, Terraform, or similar IaC technologies
  • Have experience with MongoDB, Redis, Kafka, Postgres, or similar data technologies
  • For candidates based in the United States, the pay range for this position at the start of employment is expected to be between $128,842 and $232,200 / year with an expected On Target Earnings (OTE) between $144,000 and $258,000 / year (including bonus or commission). Your exact offer may vary depending on multiple individualized factors, including market location, job-related knowledge, skills, and experience. In addition to cash compensation, Braze offers full- and part- time employees a comprehensive Total Rewards package that includes equity grants of restricted stock (RSUs) so that all Braze employees own a piece of our company.

    #LI-Hybrid

    WHAT WE OFFER

    Braze benefits vary by location, and we encourage you to review our specific benefits offerings for each country . More details on benefits plans will be provided if you receive an offer of employment.

    From offering comprehensive benefits to fostering hybrid ways of working, we’ve got you covered so you can prioritize work-life harmony. Braze offers benefits such as :

  • Competitive compensation that may include equity
  • Retirement and Employee Stock Purchase Plans
  • Flexible paid time off
  • Comprehensive benefit plans covering medical, dental, vision, life, and disability
  • Family services that include fertility benefits and equal paid parental leave
  • Professional development supported by formal career pathing, learning platforms, and a yearly learning stipend
  • A curated in-office employee experience, designed to foster community, team connections, and innovation
  • Opportunities to give back to your community, including an annual company-wide Volunteer Week and donation matching
  • Employee Resource Groups that provide supportive communities within Braze
  • Collaborative, transparent, and fun culture recognized as a Great Place to Work®
  • ABOUT BRAZE

    Braze is the leading customer engagement platform that empowers brands to Be Absolutely Engaging.™ Braze helps brands deliver great customer experiences that drive value both for consumers and for their businesses. Built on a foundation of composable intelligence, BrazeAI™ allows marketers to combine and activate AI agents, models, and features at every touchpoint throughout the Braze Customer Engagement Platform for smarter, faster, and more meaningful customer engagement. From cross-channel messaging and journey orchestration to Al-powered decisioning and optimization, Braze enables companies to turn action into interaction through autonomous, 1 : 1 personalized experiences.

    The company has repeatedly been recognized as a Leader in marketing technology by industry analysts, and was voted a G2 “Best of Marketing and Digital Advertising Software Product” in 2025.

    Braze was also named a 2025 Best Companies To Work For by U.S. News & World Report, a 2025 America’s Greatest Companies by Newsweek, and a 2025 Fortune Best Workplace in Technology™ by Great Place To Work®, among other accolades. Braze is also proudly certified as a Great Place to Work® in the U.S., the UK, Australia, and Singapore.

    The company is headquartered in New York with offices in Austin, Berlin, Bucharest, Chicago, Dubai, Jakarta, London, Paris, San Francisco, São Paulo, Singapore, Seoul, Sydney and Tokyo.

    BRAZE IS AN EQUAL OPPORTUNITY EMPLOYER

    At Braze, we strive to create equitable growth and opportunities inside and outside the organization.

    Building meaningful connections is at the heart of everything we do, and that includes our recruiting practices. We're committed to offering all candidates a fair, accessible, and inclusive experience – regardless of age, color, disability, gender identity, marital status, maternity, national origin, pregnancy, race, religion, sex, sexual orientation, or status as a protected veteran. When applying and interviewing with Braze, we want you to feel comfortable showcasing what makes you you.

    We know that sometimes different circumstances can lead talented people to hesitate to apply for a role unless they meet 100% of the criteria. If this sounds familiar, we encourage you to apply, as we’d love to meet you.

    Please see our for more information on how Braze processes your personal information during the recruitment process and, if applicable based on your location, how you can exercise any privacy rights.

    [job_alerts.create_a_job]

    Senior Site Reliability Engineer • San Francisco

    [internal_linking.similar_jobs]
    Site Reliability Engineer

    Site Reliability Engineer

    gamma.app • San Francisco, CA, United States
    [job_card.full_time]
    We're building the creative layer for modern communication.Every month, over a billion people make presentations — but the tools they use to make them haven't evolved in decades.We're changing that...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Site Reliability Engineer - Storage

    Site Reliability Engineer - Storage

    xAI • San Francisco, CA, United States
    [job_card.full_time]
    AI’s mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on engineering excelle...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Site Reliability Engineer

    Site Reliability Engineer

    Mercor • San Francisco, CA, United States
    [job_card.full_time]
    Mercor is at the intersection of labor markets and AI research.We partner with leading AI labs and enterprises to provide the human intelligence essential to AI development.Our vast talent network ...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Senior / Staff Site Reliability Engineer

    Senior / Staff Site Reliability Engineer

    Circle • San Francisco, CA, United States
    [job_card.full_time]
    Senior Site Reliability Engineer.Circle (NYSE : CRCL) is one of the world’s leading internet financial platform companies, building the foundation of a more open, global economy through digital asse...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Staff Site Reliability Engineer

    Staff Site Reliability Engineer

    Redwood Materials, Inc. • San Francisco, CA, United States
    [job_card.full_time]
    Redwood is localizing a global battery supply chain that seamlessly integrates recovery, reuse, and recycling—keeping critical minerals in circulation and driving the energy transition.Founded in 2...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Senior Software Engineer - Site Reliability

    Senior Software Engineer - Site Reliability

    Ironclad Inc. • San Francisco, CA, United States
    [job_card.full_time]
    Every dollar earned, relationship formed, and advantage gained comes down to the contract that makes it real.But getting a contract done is more complicated than it should be.And when contract data...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Sr / Staff Site Reliability Engineer, Consumer Apps

    Sr / Staff Site Reliability Engineer, Consumer Apps

    Attain • Redwood City, CA, United States
    [job_card.permanent]
    Prospectus is pleased to partner with our client to find an Executive & Governance Assistant.The charity is a leading occupational benevolent organisation supporting individuals and families who ar...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    Alembic Technologies • San Francisco, CA, United States
    [job_card.full_time]
    Senior Site Reliability Engineer.This range is provided by Alembic Technologies.Your actual pay will be based on your skills and experience — talk with your recruiter to learn more.We’re looking fo...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    ZetaChain • San Francisco, CA, United States
    [job_card.full_time]
    We're building something ambitious at ZetaChain : the first universal blockchain and AI platform that connects everything—Bitcoin, Ethereum, Solana, and more—while pioneering in the GenAI space.We'r...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Senior Reliability Engineer

    Senior Reliability Engineer

    Lunar Energy • San Francisco, CA, United States
    [job_card.full_time]
    Reliability Engineers at Lunar Energy will be responsible for ensuring product reliability throughout the entire lifecycle of our revolutionary home energy products. This includes providing input du...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Site Reliability Engineer — ML Cloud & Observability

    Site Reliability Engineer — ML Cloud & Observability

    Anyscale • San Francisco, CA, United States
    [job_card.full_time]
    A technology company is seeking a Site Reliability Engineer in San Francisco, CA.The role involves ensuring the smooth operation of user-facing services, building monitoring systems, and establishi...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Senior SRE : Scale Reliability, Observability & CI / CD

    Senior SRE : Scale Reliability, Observability & CI / CD

    Breakout Tools • San Francisco, CA, United States
    [job_card.full_time]
    A tech startup in San Francisco is looking for Site Reliability Engineers to enhance system reliability and performance.Ideal candidates have over 5 years of relevant experience and strong expertis...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    Hive • San Francisco, CA, United States
    [job_card.full_time]
    Hive is the leading provider of cloud-based AI solutions to understand, search, and generate content, and is trusted by hundreds of the world's largest and most innovative organizations.The company...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Site Reliability Engineer

    Site Reliability Engineer

    Writemed • San Francisco, CA, United States
    [job_card.full_time]
    Would you like to join one of the fastest-growing organizations with a goal of using the latest AI, GenAI, LLM, Cloud, and Digital Technologies to advance drug development and improve patient care ...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Senior / Staff Site Reliability Engineer – Cloud-Fintech Infra

    Senior / Staff Site Reliability Engineer – Cloud-Fintech Infra

    Attaindata • Redwood City, CA, United States
    [job_card.full_time]
    A leading fintech platform is seeking a Senior / Staff Site Reliability Engineer.The role involves designing and maintaining infrastructure, automating processes, and collaborating with engineering t...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Software Engineer, Site ReliabilityNew York, NY; Redwood City, CA

    Software Engineer, Site ReliabilityNew York, NY; Redwood City, CA

    Fireworks AI • Redwood City, CA, United States
    [job_card.full_time]
    Senior Site Reliability Engineer.At Fireworks, we're building the future of generative AI infrastructure.Fireworks offers the generative AI platform with the highest-quality models and the fastest,...[show_more]
    [last_updated.last_updated_1_day] • [promoted]
    Site Reliability Engineer, Government

    Site Reliability Engineer, Government

    Recruiting From Scratch • San Francisco, CA, United States
    [job_card.full_time]
    Recruiting from Scratch is a talent firm that focuses on placing the best candidate for our clients.Our team is 100% remote and we work with teams across North America, South America, and Europe to...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Senior+ Site Reliability Engineer

    Senior+ Site Reliability Engineer

    Crusoe Energy Systems LLC • San Francisco, CA, United States
    [job_card.full_time]
    Crusoe's mission is to accelerate the abundance of energy and intelligence.We’re crafting the engine that powers a world where people can create ambitiously with AI — without sacrificing scale, spe...[show_more]
    [last_updated.last_updated_30] • [promoted]