Site Reliability EngineerPrimer • San Francisco, California, United States

Site Reliability Engineer

Primer • San Francisco, California, United States

[job_card.30_days_ago]

[job_preview.job_type]

[job_card.full_time]

[job_card.job_description]

Primer helps B2B products break out of the B2C-centric marketing box. Our platform turns consumer ad channels, data streams, and emerging AI workflows into measurable growth engines for go-to-market teams. We ingest billions of rows from first- and third-party sources, map them to rich company context, and surface hyper-targeted audiences and real-time performance alerts—all without vendor lock-in.

That only works if the lights stay

on , queries stay

fast , and incidents stay

rare . That’s where you come in.

As our first dedicated

Site Reliability Engineer , you’ll be the force multiplier who designs, builds, and operates the infrastructure that powers everything : petabyte-scale data pipelines, LLM-backed services, and the APIs our customers (and engineers!) rely on every day. You’ll pair hard-won ops experience with a mentor’s mindset—levelling up the whole team while keeping us four steps ahead of failure.

YOUR MISSION

Own reliability from design to customer.

Define and uphold SLOs / SLIs, manage error budgets, and lead blameless post-mortems.

Automate toil out of existence—CI / CD, infra-as-code, capacity planning, and chaos testing.

Drive incident response end-to-end : detection, mitigation, root-cause analysis, and long-term fixes.

Scale multi-cloud data pipelines (Prefect, ClickHouse, Iceberg) and GPU / LLM workloads.

Teach best practices, review designs, and coach engineers so reliability becomes a team sport.

WHAT YOU’LL DO

Design, implement, and tune distributed systems that handle

high-throughput B2B traffic .

Harden our AWS stack with IaC (e.g. Terraform)

Instrument everything—logs, traces, metrics, and AI-powered anomaly detection.

Champion security, cost optimization, and disaster-recovery strategies.

Jump into the weeds when something breaks, fix it fast, then automate it away.

WHAT YOU’LL BRING

Must-Haves

5+ years owning production systems at meaningful scale (sub-second latency, “four-nines” targets).

Mastery of SRE fundamentals : SLO / SLI design, error budgets, incident playbooks.

Deep hands-on with Linux, networking, containers / K8s, and at least one major cloud (AWS / GCP / Azure).

Proven track record automating infra with Terraform, Helm, or similar IaC tooling.

Fluency in at least one systems / scripting language (Go, Python, Rust, etc.).

Experience operating complex data pipelines (Prefect, Airflow, Temporal) or real-time streaming systems.

History of mentoring engineers and embedding reliability culture across teams.

Pragmatic decision-maker—balances uptime, velocity, and cost for startup reality.

Curiosity for AI-augmented ops (LLM chat-ops, anomaly detection, self-healing).

Nice-to-Haves

Managed GPU clusters and ML inference workloads.

Operated data lakes / lakehouses at scale (Iceberg, Delta, etc.).

Meaningful open-source contributions in SRE, DevOps, or data-infra projects.

WHY PRIMER

Mission with impact

– We’re unlocking new growth channels for thousands of B2B marketers.

High-trust, low-ego culture

– Fully distributed team, meeting-light weeks, Friday focus days.

Work & life, balanced

– Five weeks PTO, generous parental leave, and flexibility for families.

Career rocket-fuel

– Small team, huge problems, real ownership. Shape the future with bold innovators, driving impact that redefines industries.

Diverse & global

– Teammates span six countries—and counting.

Intro Call with Engineering Manager

– 30 min

System Design

– 60 min

Operational Excellence Drill-down

– 60 min

Strategic Pragmatism Chat with CTO

– 45 min

Technical Coding / Systems Deep Dive

– 30 min

Culture & Values with CEO

– 45 min

Decision typically within 24-48 hrs of final conversation.

READY TO LEVEL UP B2B MARKETING INFRASTRUCTURE?

careers@sayprimer.com

with your résumé, LinkedIn, GitHub, or anything that showcases your reliability superpowers. Let’s build the future—without the fire-drills.

#J-18808-Ljbffr

[job_alerts.create_a_job]

Site Reliability Engineer • San Francisco, California, United States

[internal_linking.similar_jobs]

Site Reliability Engineer

Mercor, Inc. • San Francisco, California, United States

[job_card.full_time]

About Mercor Mercor is at the intersection of labor markets and AI research.We partner with leading AI labs and enterprises to provide the human intelligence essential to AI development.Our vast ta...[show_more]

[last_updated.last_updated_variable_hours] • [promoted] • [new]

Senior Technology Site Reliability Engineer

Cooley LLP • San Francisco, CA, United States

[job_card.full_time]

Senior Technology Site Reliability Engineer.Cooley is seeking a Senior Site Reliability Engineer to join the.Infrastructure & Development Operations. The Senior Technology Site Reliability Engineer(...[show_more]

[last_updated.last_updated_30] • [promoted]

Site Reliability Engineer I

prosper.com • San Francisco, CA, United States

[job_card.full_time]

As a Site Reliability Engineer I at Prosper, you will play a crucial role in enhancing the reliability, scalability, and maintainability of our technology platform. This entry‑level position is desi...[show_more]

[last_updated.last_updated_30] • [promoted]

Senior Site Reliability Engineer – Platform

Icon Ventures • San Francisco, CA, United States

[job_card.full_time]

At Quizlet, our mission is to help every learner achieve their outcomes in the most effective and delightful way.We blend cognitive science with machine learning to personalize and enhance the lear...[show_more]

[last_updated.last_updated_30] • [promoted]

Site Reliability Engineer (SRE)

SS&C Technologies • San Francisco, CA, United States

[job_card.full_time]

SS&C Technologies is a global investment and financial services software provider, headquartered in Windsor, Connecticut, and supporting more than 28,000 employees across 35 countries.It specialize...[show_more]

[last_updated.last_updated_30] • [promoted]

Site Reliability Engineer

Rethink recruit • San Francisco, CA, United States

[job_card.full_time]

Runloop is building the foundational infrastructure for the next generation of AI development.We provide AI engineers and data scientists with lightning-fast, secure, and reproducible code sandboxe...[show_more]

[last_updated.last_updated_variable_days] • [promoted]

Site Reliability Engineer

Mercor • San Francisco, CA, United States

[job_card.full_time]

Mercor is at the intersection of labor markets and AI research.We partner with leading AI labs and enterprises to provide the human intelligence essential to AI development.Our vast talent network ...[show_more]

[last_updated.last_updated_variable_hours] • [promoted] • [new]

Site Reliability Engineer

WorkOS • San Francisco, CA, United States

[job_card.full_time]

WorkOS builds tools and services for developers to help them implement authentication, identity, authorization, and overall enterprise readiness. We’re a fully distributed team with employees across...[show_more]

[last_updated.last_updated_30] • [promoted]

Staff Site Reliability Engineer

Redwood Materials, Inc. • San Francisco, CA, United States

[job_card.full_time]

Redwood is localizing a global battery supply chain that seamlessly integrates recovery, reuse, and recycling—keeping critical minerals in circulation and driving the energy transition.Founded in 2...[show_more]

[last_updated.last_updated_30] • [promoted]

Principal Site Reliability Engineer

Early Warning Services LLC • San Francisco, CA, United States

[job_card.full_time]

Positions located in Scottsdale, San Francisco, Chicago, or New York follow a hybrid work model to allow for a more collaborative working environment. Candidates responding to this posting must inde...[show_more]

[last_updated.last_updated_30] • [promoted]

Senior Site Reliability Engineer

Alembic Technologies • San Francisco, CA, United States

[job_card.full_time]

Senior Site Reliability Engineer.This range is provided by Alembic Technologies.Your actual pay will be based on your skills and experience — talk with your recruiter to learn more.We’re looking fo...[show_more]

[last_updated.last_updated_30] • [promoted]

Staff / Principal Site Reliability Engineer

The Resume Database • Redwood City, CA, United States

[job_card.full_time]

Staff / Principal Site Reliability Engineer.Staff / Principal Site Reliability Engineer.You’ll architect scalable solutions, navigate complex technical challenges independently, and deliver results und...[show_more]

[last_updated.last_updated_30] • [promoted]

Site Reliability Engineer

gamma.app • San Francisco, CA, United States

[job_card.full_time]

We're building the creative layer for modern communication.Every month, over a billion people make presentations — but the tools they use to make them haven't evolved in decades.We're changing that...[show_more]

[last_updated.last_updated_variable_days] • [promoted]

Site Reliability Engineer

Primer • San Francisco, CA, United States

[job_card.full_time]

[last_updated.last_updated_30] • [promoted]

Senior Site Reliability Engineer

Hive • San Francisco, CA, United States

[job_card.full_time]

Hive is the leading provider of cloud-based AI solutions to understand, search, and generate content, and is trusted by hundreds of the world's largest and most innovative organizations.The company...[show_more]

[last_updated.last_updated_30] • [promoted]

Principal Site Reliability Engineer

Early Warning® • San Francisco, CA, United States

[job_card.full_time]

At Early Warning, we’ve powered and protected the U.Zelle®, Paze℠, and so much more.As a trusted name in payments, we partner with thousands of institutions to increase access to financial services...[show_more]

[last_updated.last_updated_30] • [promoted]

Senior Site Reliability Engineer

AppOmni • San Francisco, CA, United States

[job_card.full_time]

AppOmni, a leader in SaaS Security, helps customers achieve secure productivity with their applications.Security teams and owners can quickly detect and mitigate threats using unmatched depth of pr...[show_more]

[last_updated.last_updated_30] • [promoted]

Site Reliability Engineer

ConductorOne Inc. • San Francisco, CA, United States

[job_card.full_time]

ConductorOne is the first AI-native identity security platform that protects every identity : human, non-human, and AI.With powerful automation, platform-level AI, and out-of-the-box connectors, it ...[show_more]

[last_updated.last_updated_variable_days] • [promoted]