Talent.com
Site Reliability Engineer
Site Reliability EngineerZoox • Foster City, CA, US
Site Reliability Engineer

Site Reliability Engineer

Zoox • Foster City, CA, US
[job_card.30_days_ago]
[job_preview.job_type]
  • [job_card.full_time]
[job_card.job_description]

Job Description

Job Description
Zoox is seeking a Site Reliability Engineer to help ensure the availability, performance, and resilience of the services that power the development and operation of our autonomous vehicles. In this role, you will own the full lifecycle of our services—from designing fault-tolerant, maintainable systems to deploying, operating, and continuously improving them in production. As a robotics company, Zoox embraces automation at every layer of our infrastructure, and you’ll help drive that ethos forward. You’ll work hands-on with systems that process massive volumes of data and support compute-intensive pipelines running on both CPUs and GPUs.
In this role, you will:
  • Design and implement highly scalable and reliable systems to support Zoox's autonomous vehicle platform.
  • Optimize system performance, reliability, and scalability.
  • Develop and maintain monitoring, alerting, and reporting systems to ensure proactive identification and resolution of issues.
  • Collaborate with software engineering teams to improve software architecture, deployment processes, and automation.
  • Conduct root cause analysis of production issues and implement corrective actions.
  • Implement disaster recovery and business continuity plans.
Qualifications
  • 5+ years of experience in site reliability engineering or a similar role, with a strong background in working with large-scale distributed systems.
  • Proven experience with cloud platforms such as AWS, GCP, or Azure.
  • Expertise in container orchestration technologies like Kubernetes.
  • Deep understanding of networking, storage, and database technologies.
  • Strong programming skills in languages such as Python, Go, C/C++, or Java.
  • Experience with infrastructure as code tools such as Terraform, Ansible, Salt, or CloudFormation.
Bonus Qualifications
  • Experience in the automotive or autonomous vehicle industry.
  • Knowledge of security best practices and compliance requirements.

Base Salary Range

There are three major components to compensation for this position: salary, Amazon Restricted Stock Units (RSUs), and Zoox Stock Appreciation Rights. A sign-on bonus may be offered as part of the compensation package. The listed range applies only to the base salary. Compensation will vary based on geographic location and level. Leveling, as well as positioning within a level, is determined by a range of factors, including, but not limited to, a candidate's relevant years of experience, domain knowledge, and interview performance. The salary range listed in this posting is representative of the range of levels Zoox is considering for this position.

Zoox also offers a comprehensive package of benefits, including paid time off (e.g. sick leave, vacation, bereavement), unpaid time off, Zoox Stock Appreciation Rights, Amazon RSUs, health insurance, long-term care insurance, long-term and short-term disability insurance, and life insurance.
About Zoox
Zoox is developing the first ground-up, fully autonomous vehicle fleet and the supporting ecosystem required to bring this technology to market. Sitting at the intersection of robotics, machine learning, and design, Zoox aims to provide the next generation of mobility-as-a-service in urban environments. We’re looking for top talent that shares our passion and wants to be part of a fast-moving and highly execution-oriented team.

Follow us on LinkedIn

Accommodations
If you need an accommodation to participate in the application or interview process please reach out to accommodations@zoox.com or your assigned recruiter.

A Final Note:
You do not need to match every listed expectation to apply for this position. Here at Zoox, we know that diverse perspectives foster the innovation we need to be successful, and we are committed to building a team that encompasses a variety of backgrounds, experiences, and skills.

We may use artificial intelligence (AI) tools to support parts of the hiring process, such as reviewing applications, analyzing resumes, or assessing responses. These tools assist our recruitment team but do not replace human judgment. Final hiring decisions are ultimately made by humans. If you would like more information about how your data is processed, please contact us.

[job_alerts.create_a_job]

Site Reliability Engineer • Foster City, CA, US

[internal_linking.similar_jobs]
Senior Site Reliability Engineer

Senior Site Reliability Engineer

Canonical • San Francisco, CA, United States
[job_card.full_time]
Senior Site Reliability Engineer role at Canonical.Canonical is a leading provider of open source software and operating systems to the global enterprise and technology markets.Our platform, Ubuntu...[show_more]
[last_updated.last_updated_30] • [promoted]
Site Reliability Engineer — Scale & Resilience for AI Ops

Site Reliability Engineer — Scale & Resilience for AI Ops

HappyRobot • San Francisco, CA, United States
[job_card.full_time]
A high-growth AI startup in San Francisco is seeking a Site Reliability Engineer to lead the scaling of operational resilience.In this role, you will own system stability and debugging workflows wh...[show_more]
[last_updated.last_updated_30] • [promoted]
Infrastructure Site Reliability Engineer (Local only)

Infrastructure Site Reliability Engineer (Local only)

Maxonic Inc. • San Francisco, CA, United States
[job_card.full_time]
Infrastructure Site Reliability Engineer (Local only).Direct message the job poster from Maxonic Inc.Infrastructure Site Reliability Engineer.Contract (4+ months) with strong possibility to convert...[show_more]
[last_updated.last_updated_30] • [promoted]
Staff Site Reliability Engineer, Tech Lead

Staff Site Reliability Engineer, Tech Lead

Unify • San Francisco, CA, United States
[job_card.full_time]
Unify was founded January 17th, 2023 by Austin Hughes and Connor Heggie.Connor was a machine learning research engineer at.The rest of our team comes from companies like.Our mission is to build the...[show_more]
[last_updated.last_updated_30] • [promoted]
Site Reliability Engineer

Site Reliability Engineer

Henderson Scott • San Francisco, CA, United States
[job_card.full_time]
Site Reliability Engineer (SRE) | San Fran Bay Area | Hybrid, 2 days in Fremont Office.We're partnering with a technology-driven organisation modernising its infrastructure and operations.Infrastru...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Senior Site Reliability Engineer

Senior Site Reliability Engineer

BetterUp • San Francisco, CA, United States
[job_card.full_time]
Let’s face it, a company whose mission is human transformation better have some fresh thinking about the employer/employee relationship.We can’t cram it all in here, but you’ll start noticing it fr...[show_more]
[last_updated.last_updated_30] • [promoted]
Site Reliability Engineer

Site Reliability Engineer

Plenful • San Francisco, CA, United States
[job_card.full_time]
Plenful is on a mission to transform healthcare operations from the inside out.Fresh off our recent founding round and backed by Notable Capital, Bessemer Venture Partners, TQ Ventures, Susa/Kivu V...[show_more]
[last_updated.last_updated_30] • [promoted]
Senior Site Reliability Engineer

Senior Site Reliability Engineer

Motive Software • San Francisco, CA, United States
[job_card.full_time]
Senior Site Reliability Engineer.Let’s face it, a company whose mission is human transformation better have some fresh thinking about the employer/employee relationship.We can’t cram it all in here...[show_more]
[last_updated.last_updated_30] • [promoted]
CloudDevs: Senior Site Reliability Engineer (SRE)

CloudDevs: Senior Site Reliability Engineer (SRE)

Breakout Tools • San Francisco, CA, United States
[job_card.full_time]
CloudDevs works with fast-moving, venture-backed startups across the US.We’re building a pool of world-class Site Reliability Engineers for current roles and for upcoming opportunities.You will eit...[show_more]
[last_updated.last_updated_30] • [promoted]
Site Reliability Engineer - Platform

Site Reliability Engineer - Platform

CodeRabbit • San Francisco, CA, United States
[job_card.full_time]
CodeRabbit is an innovative research and development company focused on building extraordinarily productive human‑machine collaboration systems.Our primary goal is to create the next generation of ...[show_more]
[last_updated.last_updated_30] • [promoted]
Site Reliability Engineer

Site Reliability Engineer

Mercor • San Francisco, CA, United States
[job_card.full_time]
Mercor is at the intersection of labor markets and AI research.We partner with leading AI labs and enterprises to provide the human intelligence essential to AI development.Our vast talent network ...[show_more]
[last_updated.last_updated_30] • [promoted]
Staff Site Reliability Engineer

Staff Site Reliability Engineer

Redwood Materials, Inc. • San Francisco, CA, United States
[job_card.full_time]
Redwood is localizing a global battery supply chain that seamlessly integrates recovery, reuse, and recycling—keeping critical minerals in circulation and driving the energy transition.Founded in 2...[show_more]
[last_updated.last_updated_30] • [promoted]
Senior Site Reliability Engineer

Senior Site Reliability Engineer

Alembic Technologies • San Francisco, CA, United States
[job_card.full_time]
Senior Site Reliability Engineer.This range is provided by Alembic Technologies.Your actual pay will be based on your skills and experience — talk with your recruiter to learn more.We’re looking fo...[show_more]
[last_updated.last_updated_30] • [promoted]
Senior Site Reliability Engineer

Senior Site Reliability Engineer

Gradle Inc. • San Francisco, CA, United States
[job_card.full_time]
Senior Site Reliability Engineer.Senior Site Reliability Engineer overseeing the reliability, performance, and availability of Develocity instances serving paying customers, open‑source projects, a...[show_more]
[last_updated.last_updated_30] • [promoted]
Site Reliability Engineer - Scale & Observability

Site Reliability Engineer - Scale & Observability

gamma.app • San Francisco, CA, United States
[job_card.full_time]
A dynamic tech firm located in San Francisco is seeking a Site Reliability Engineer to enhance operational health across their production systems.This high-impact role demands expertise in AWS and ...[show_more]
[last_updated.last_updated_30] • [promoted]
Site Reliability Engineer

Site Reliability Engineer

Writemed • San Francisco, CA, United States
[job_card.full_time]
Would you like to join one of the fastest-growing organizations with a goal of using the latest AI, GenAI, LLM, Cloud, and Digital Technologies to advance drug development and improve patient care ...[show_more]
[last_updated.last_updated_30] • [promoted]
Sr. Site Reliability Engineer

Sr. Site Reliability Engineer

Apple Inc. • San Francisco, CA, United States
[job_card.full_time]
San Francisco Bay Area, California, United States Software and Services.Apple is where individual imaginations gather together, committing to the values that lead to great work.Every new product we...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Site Reliability Engineer (SRE)

Site Reliability Engineer (SRE)

Air Apps • San Francisco, CA, United States
[job_card.full_time]
Site Reliability Engineer (SRE).Site Reliability Engineer (SRE).Get AI-powered advice on this job and more exclusive features.At Air Apps, we believe in thinking bigger—and moving faster.We’re a fa...[show_more]
[last_updated.last_updated_30] • [promoted]