Talent.com
Site Reliability Engineer
Site Reliability EngineerZoox • Foster City, California, United States
Site Reliability Engineer

Site Reliability Engineer

Zoox • Foster City, California, United States
[job_card.30_days_ago]
[job_preview.job_type]
  • [job_card.full_time]
[job_card.job_description]
Zoox is looking for a platform/site reliability engineer who will be responsible for measuring and maintaining the uptime of the many services critical to the development process for autonomous vehicles. In this role, you will be heavily involved in all phases of rolling out a service from designing systems that are easy to maintain and fault-tolerant through deployment, operation, and continual improvement. Zoox is a robotics company and our ethos of automation extends throughout the infrastructure components we build. Be prepared to work with systems handling large volumes of data and data-processing pipelines performing compute-intensive tasks on CPUs and GPUs.

In this role, you will:

    • Design and implement highly scalable and reliable systems to support Zoox's autonomous vehicle platform.
    • Optimize system performance, reliability, and scalability.
    • Develop and maintain monitoring, alerting, and reporting systems to ensure proactive identification and resolution of issues.
    • Collaborate with software engineering teams to improve deployment processes and automation.
    • Conduct root cause analysis of production issues and implement corrective actions.
    • Implement disaster recovery and business continuity plans.

Qualifications

    • 6+ years of experience in site reliability engineering or a similar role, with a strong background in working with large-scale distributed systems.
    • Proven experience with cloud platforms such as AWS, GCP, or Azure.
    • Expertise in container orchestration technologies like Kubernetes.
    • Deep understanding of networking, storage, and database technologies.
    • Strong programming skills in languages such as Python, Go, C/C++ or Java.
    • Experience with infrastructure as code tools such as Ansible, Salt, Terraform or CloudFormation.

Preferred Qualifications

    • Experience in the automotive or autonomous vehicle industry.
    • Knowledge of security best practices and compliance requirements.
    • Previous experience in a leadership or mentorship role.

Compensation
There are three major components to compensation for this position: salary, Amazon Restricted Stock Units (RSUs), and Zoox Stock Appreciation Rights. The salary range for this position is $165,000 to $222,000. A sign-on bonus may be offered as part of the compensation package. Compensation will vary based on geographic location and level. Leveling, as well as positioning within a level, is determined by a range of factors, including, but not limited to, a candidate's relevant years of experience, domain knowledge, and interview performance. The salary range listed in this posting is representative of the range of levels Zoox is considering for this position.
Zoox also offers a comprehensive package of benefits including paid time off (e.g. sick leave, vacation, bereavement), unpaid time off, Zoox Stock Appreciation Rights, Amazon RSUs, health insurance, long-term care insurance, long-term and short-term disability insurance, and life insurance.

About Zoox
Zoox is developing the first ground-up, fully autonomous vehicle fleet and the supporting ecosystem required to bring this technology to market. Sitting at the intersection of robotics, machine learning, and design, Zoox aims to provide the next generation of mobility-as-a-service in urban environments. We’re looking for top talent that shares our passion and wants to be part of a fast-moving and highly execution-oriented team.

Follow us on LinkedIn

Accommodations
If you need an accommodation to participate in the application or interview process please reach out to accommodations@zoox.com or your assigned recruiter.

A Final Note:
[job_alerts.create_a_job]

Site Reliability Engineer • Foster City, California, United States

[internal_linking.similar_jobs]
Product Infrastructure Engineer - Site Reliability

Product Infrastructure Engineer - Site Reliability

Zyphra • San Francisco, California, US
[job_card.full_time]
Job Description Job Description Zyphra is an artificial intelligence company based in San Francisco, California.The Role: As an Infrastructure Engineer - Site Reliability , you'll be responsible fo...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Site Reliability Engineer

Site Reliability Engineer

Attain • Redwood City, CA, United States
[job_card.full_time]
Built for consumers and companies, alike.In a world driven by data, we believe consumers and businesses can coexist.Our founders had a vision to empower consumers to leverage their greatest asset—t...[show_more]
[last_updated.last_updated_30] • [promoted]
Staff Site Reliability Engineer, Tech Lead

Staff Site Reliability Engineer, Tech Lead

Unify • San Francisco, CA, United States
[job_card.full_time]
Unify was founded January 17th, 2023 by Austin Hughes and Connor Heggie.Connor was a machine learning research engineer at.The rest of our team comes from companies like.Our mission is to build the...[show_more]
[last_updated.last_updated_30] • [promoted]
Senior Site Reliability Engineer

Senior Site Reliability Engineer

BetterUp • San Francisco, CA, United States
[job_card.full_time]
Let’s face it, a company whose mission is human transformation better have some fresh thinking about the employer/employee relationship.We can’t cram it all in here, but you’ll start noticing it fr...[show_more]
[last_updated.last_updated_30] • [promoted]
Senior Site Reliability Engineer

Senior Site Reliability Engineer

Motive Software • San Francisco, CA, United States
[job_card.full_time]
Senior Site Reliability Engineer.Let’s face it, a company whose mission is human transformation better have some fresh thinking about the employer/employee relationship.We can’t cram it all in here...[show_more]
[last_updated.last_updated_30] • [promoted]
Site Reliability Engineer

Site Reliability Engineer

Henderson Scott • San Francisco, CA, United States
[job_card.full_time]
Site Reliability Engineer (SRE) | San Fran Bay Area | Hybrid, 2 days in Fremont Office.We're partnering with a technology-driven organisation modernising its infrastructure and operations.Infrastru...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Site Reliability Engineer - Platform

Site Reliability Engineer - Platform

CodeRabbit • San Francisco, CA, United States
[job_card.full_time]
CodeRabbit is an innovative research and development company focused on building extraordinarily productive human‑machine collaboration systems.Our primary goal is to create the next generation of ...[show_more]
[last_updated.last_updated_30] • [promoted]
Site Reliability Engineer

Site Reliability Engineer

Mercor • San Francisco, CA, United States
[job_card.full_time]
Mercor is at the intersection of labor markets and AI research.We partner with leading AI labs and enterprises to provide the human intelligence essential to AI development.Our vast talent network ...[show_more]
[last_updated.last_updated_30] • [promoted]
Staff Site Reliability Engineer

Staff Site Reliability Engineer

Redwood Materials, Inc. • San Francisco, CA, United States
[job_card.full_time]
Redwood is localizing a global battery supply chain that seamlessly integrates recovery, reuse, and recycling—keeping critical minerals in circulation and driving the energy transition.Founded in 2...[show_more]
[last_updated.last_updated_30] • [promoted]
Senior/Staff Site Reliability Engineer

Senior/Staff Site Reliability Engineer

Circle • San Francisco, CA, United States
[job_card.full_time]
Senior Site Reliability Engineer.Circle (NYSE: CRCL) is one of the world’s leading internet financial platform companies, building the foundation of a more open, global economy through digital asse...[show_more]
[last_updated.last_updated_30] • [promoted]
Senior Site Reliability Engineer

Senior Site Reliability Engineer

Alembic Technologies • San Francisco, CA, United States
[job_card.full_time]
Senior Site Reliability Engineer.This range is provided by Alembic Technologies.Your actual pay will be based on your skills and experience — talk with your recruiter to learn more.We’re looking fo...[show_more]
[last_updated.last_updated_30] • [promoted]
Senior Site Reliability Engineer

Senior Site Reliability Engineer

Gradle Inc. • San Francisco, CA, United States
[job_card.full_time]
Senior Site Reliability Engineer.Senior Site Reliability Engineer overseeing the reliability, performance, and availability of Develocity instances serving paying customers, open‑source projects, a...[show_more]
[last_updated.last_updated_30] • [promoted]
Site Reliability Engineer - Infrastructure

Site Reliability Engineer - Infrastructure

Verkada • San Mateo, CA, United States
[job_card.full_time]
We are actively looking for a talented Site Reliability Engineer to join the Infrastructure team.As a member of the infrastructure team, your role will be to manage this infrastructure and continue...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Site Reliability Engineering

Site Reliability Engineering

Forhyre • San Francisco, California, US
[job_card.full_time]
Job Description Job Description Forhyre is looking for engineers who can bring unique perspectives and innovative ideas to all areas of development and are interested in continuing to improve our p...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Site Reliability Engineer

Site Reliability Engineer

Fractal • San Francisco, CA, United States
[job_card.full_time]
This range is provided by Fractal.Your actual pay will be based on your skills and experience — talk with your recruiter to learn more.Fractal Analytics is a strategic AI partner to Fortune 500 com...[show_more]
[last_updated.last_updated_30] • [promoted]
Site Reliability Engineer

Site Reliability Engineer

Primer • San Francisco, CA, United States
[job_card.full_time]
Primer helps B2B products break out of the B2C-centric marketing box.Our platform turns consumer ad channels, data streams, and emerging AI workflows into measurable growth engines for go-to-market...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Site Reliability Engineer

Site Reliability Engineer

Writemed • San Francisco, CA, United States
[job_card.full_time]
Would you like to join one of the fastest-growing organizations with a goal of using the latest AI, GenAI, LLM, Cloud, and Digital Technologies to advance drug development and improve patient care ...[show_more]
[last_updated.last_updated_30] • [promoted]
Site Reliability Engineer (SRE) - AI Infrastructure

Site Reliability Engineer (SRE) - AI Infrastructure

Hamilton Barnes Associates Limited • San Francisco, CA, United States
[job_card.full_time]
Are you looking for an exciting new opportunity?.Join a stealth-mode hyperscale data center startup building a next-generation AI and cloud platform designed for startups and advanced research, pow...[show_more]
[last_updated.last_updated_30] • [promoted]