The Infrastructure Compute Site Reliability Engineering (SRE) team's mission is to own and manage the successful operation of our underlying cell infrastructure system, along with elements of servi...[show_more][last_updated.last_updated_variable_days]
Metrology Engineer II, Site Services
Genentech, IncSouth San Francisco
[job_card.full_time]
Are you a skilled professional with a passion for precision, innovation, and collaboration? We are seeking a dedicated and experienced individual to join our team as a Metrology Leader.In this dyna...[show_more][last_updated.last_updated_variable_days]
[promoted]
Site Contracts Associate
Summit Therapeutics Sub, Inc.Menlo Park, CA, US
[job_card.full_time]
Location : On-site 4 days per week at our Menlo Park, CA, Princeton, NJ or Miami, FL locations.At Summit, we believe in building a team of world class professionals who are passionate about this mis...[show_more][last_updated.last_updated_variable_days]
[promoted]
Site Contracts Associate
Summit Therapeutics SubMenlo Park, California, USA
[job_card.full_time]
Location : On-site 4 days per week at our Menlo Park CA Princeton NJ or Miami FL locations.At Summit we believe in building a team of world class professionals who are passionate about this mission ...[show_more][last_updated.last_updated_variable_days]
[promoted]
Python Backend Engineer - 3D / Visualization / API / Software (On-site)
AttisSan Mateo, CA, United States
[job_card.full_time]
A pioneering and well-funded AI company is seeking a talented.This is a unique opportunity to join an innovative team at the forefront of engineering and artificial intelligence, creating a new cat...[show_more][last_updated.last_updated_variable_days]
Senior Software Engineer, Site Reliability
IXL LearningSan Mateo, CA, United States
[job_card.full_time]
Senior Software Engineer, Site Reliability.IXL Learning, developer of personalized learning products used by millions of people globally, is seeking Senior Software Engineers to join our Site Relia...[show_more][last_updated.last_updated_variable_days]
Staff Site Reliability Engineer
SolarWindsSan Mateo, California
[job_card.full_time]
Work collaboratively with software engineering on infrastructure and deployment requirements;.Contribute actively and assist in our automation and observability initiatives.Build and maintain opera...[show_more][last_updated.last_updated_30]
Site Reliability Engineer
ZooxFoster City, California, United States
[job_card.full_time]
Zoox is looking for a platform / site reliability engineer who will be responsible for measuring and maintaining the uptime of the many services critical to the development process for autonomous veh...[show_more][last_updated.last_updated_30]
[promoted]
Senior Site Reliability Engineer
ZiplineSouth San Francisco, CA, US
[job_card.full_time]
Do you want to change the world? Zipline is on a mission to transform the way goods move.Our aim is to solve the world's most urgent and complex access challenges by building, manufacturing and...[show_more][last_updated.last_updated_30]
[promoted]
Utilities / Facilities Site Leader (R&D Site)
Mentor Technical GroupMillbrae, CA, US
[job_card.full_time]
Mentor Technical Group Job Opportunity.Mentor Technical Group (MTG) provides a comprehensive portfolio of technical support and solutions for the FDA-regulated industry.
As a world leader in life sc...[show_more][last_updated.last_updated_30]
Senior Software Engineer, Site Reliability
I XlSan Mateo, CA, United States
[job_card.full_time]
Senior Software Engineer, Site Reliability.IXL Learning, developer of personalized learning products used by millions of people globally, is seeking Senior Software Engineers to join our Site Relia...[show_more][last_updated.last_updated_variable_days]
[promoted]
Site Utility Manager, Site Services
RocheSouth San Francisco, CA, US
[job_card.full_time]
Are you ready to lead the charge in shaping and managing critical energy infrastructure for one of the world's leading biotech campuses? Genentech is seeking an experienced and forward-thinking Sit...[show_more][last_updated.last_updated_variable_days]
Site Reliability Engineer
ReplitFoster City, California, United States
[job_card.full_time]
Replit is the fastest way to turn ideas into software.With our powerful AI-powered Agent and Assistant, anyone can create and launch apps from natural language in just one click.Build and deploy fu...[show_more][last_updated.last_updated_30]
Lead Site Reliability Engineer
VisaFoster City, California, United States
[job_card.full_time]
Visa is a world leader in payments and technology, with over 259 billion payments transactions flowing safely between consumers, merchants, financial institutions, and government entities in more t...[show_more][last_updated.last_updated_30]
[promoted]
Site Reliability Engineer
Cypress HCMSan Mateo, CA, United States
[job_card.full_time]
As a Site Reliability Engineer (Contractor), you will be a hands-on contributor, focused on supporting and improving the reliability of our AWS cloud infrastructure.
You will apply core SRE principl...[show_more][last_updated.last_updated_variable_days]
[promoted]
Software Engineer, Site Reliability Engineering
WisdomAISan Mateo, CA, US
[job_card.full_time]
WisdomAI has the mission to provide access and insights from data to everyone.We believe in the power of data to drive better decisions and we believe with Generative AI, there is an opportunity to...[show_more][last_updated.last_updated_30]
Senior Site Reliability Engineer Cloud Platform
ZillizRedwood City, California, United States
[job_card.full_time]
Zilliz is a fast-growing startup developing the industry’s leading .Founded by the engineers behind Milvus, the world’s most popular .
On a mission to democratize AI, Zilliz is committed to simplify...[show_more][last_updated.last_updated_30]
Senior / Lead Site Reliability Engineer Federal
C3 AiRedwood City, California, United States
[job_card.full_time]
C3 AI (NYSE : AI), is the Enterprise AI application software company.C3 AI delivers a family of fully integrated products including the C3 Agentic AI Platform, an end-to-end platform for developing,...[show_more][last_updated.last_updated_30]
Senior Software Engineer, Site Reliability
IXLSan Mateo, CA
[job_card.full_time]
IXL Learning, developer of personalized learning products used by millions of people globally, is seeking Senior Software Engineers to join our Site Reliability team, and help maintain the reliabil...[show_more][last_updated.last_updated_30]
The Infrastructure Compute Site Reliability Engineering (SRE) team's mission is to own and manage the successful operation of our underlying cell infrastructure system, along with elements of service discovery, secrets management and related software layers. We’re looking for skilled Site Reliability Engineers with strong programming skills to help us build Roblox's private cloud, productionize our growing Kubernetes-based infrastructure, and institute reliability best practices across the Roblox Compute team.
You will :
Design and Develop systems & libraries that promote fault-tolerance and resilience, automate much of the management and lifecycle of our clusters, and ensure systems are observable.
Promote and Institute reliability best practices across the Infra Compute group, drive common reliability initiatives. Provides collaborative technical reviews and operational guidance to strengthen system reliability.
Build, Automate and Standardize process automation to create a "golden path" of tooling and platform support that powers the fundamental Roblox ecosystem.
Create Tooling that provides production guardrails, by evaluating release candidate capacity with load testing tooling before deploying to production.
Create Performance Monitoring Services and observability towards understanding capacity issues and platform degradations, monitoring production services and their changes, like generalized canarying services with alerting.
Analyze systems and system designs for production readiness
You have :
A Bachelors degree (or equivalent professional experience) in Computer Science or related engineering field with a proven track record including at least 4 years as an SRE or Software Engineer.
Fluency with high-level programming languages like Go , Java, C#.
Experience with Kubernetes, or similar orchestration systems. Experience in Nomad, Vault, and Consul is strongly desired.
Experience and good habits around building software and tools and getting them adopted. Your system's focus advises a view of code needing to be deeply reliable.
You are :
A Partner : You know that the best tools integrate broadly with the tooling ecosystem. You approach partners and processes with curiosity and seek to understand a problem deeply before you start coding.
A Developer : You love building durable and reliable complex systems.
Passionate about problem-solving, finding creative work solutions, and addressing unexpected challenges as part of a team.
Problem Solver : You ask the right questions to tackle issues within your expertise and you use data to test your theories.
Planner : You have experience in large project lifecycles. You have experience working in sprints, breaking down complex tasks into achievements, and reporting status to keep project scheduling accurate.