Talent.com
Staff Systems Engineer, Fault Management
Staff Systems Engineer, Fault ManagementKodiak • San Francisco, California, USA
Staff Systems Engineer, Fault Management

Staff Systems Engineer, Fault Management

Kodiak • San Francisco, California, USA
[job_card.variable_days_ago]
[job_preview.job_type]
  • [job_card.full_time]
[job_card.job_description]

Kodiak Robotics Inc. was founded in 2018 and has become a leader in autonomous ground transportation committed to a safer and more efficient future for all. The company has developed an artificial intelligence (AI) powered technology stack purpose-built for commercial trucking and the public sector. The company delivers freight daily for its customers across the southern United States using its autonomous 2024 Kodiak became the first known company to publicly announce delivering a driverless semi-truck to a customer. Kodiak is also leveraging its commercial self-driving software to develop test and deploy autonomous capabilities for the U.S. Department of Defense.

The Systems and Safety Engineering team at Kodiak is seeking an experienced Systems Engineer to own the design and execution of Kodiaks next-generation Autonomy Fault Management System. This individual will lead the effort end-to-end : from product and system requirement definition through architecture and implementation to verification and validation and safety case integration. This leader will ensure that the Kodiak Driver handles onboard system faults with the desired correct safe response. This role is central to progressing towards achieving a scalable driverless deployment and will work closely with autonomy hardware software and system safety teams.

This role directly shapes Kodiaks ability to operate sustainably at commercial scale. Fault management is not only a safety systemit is a primary lever of fleet availability utilization and cost per mile. You will own the technical strategies that determine when the system can continue operating safely when it must degrade and when it must exit service

In this role you will :

  • Lead the end-to-end development of the next generation of Autonomy Fault Management System leading the collaborative effort across hardware software system safety and operations teams.
  • Own the systems and safety engineering execution for fault management across the full V-model lifecycle.
  • Lead the development of systems engineering artifacts including requirements traceability V&V plans V&V evidence.
  • Define and lead the fault management architecture and concept of operations including detection isolation response safe-state definition and minimum risk conditions.
  • Generate technical evidence in support of the adequacy coverage and sufficiency of the Fault Management System as an element of Kodiaks Driverless Safety Case.
  • Support quantitative and qualitative analyses used to set detection thresholds prioritize hazards and evaluate risk associated with fault responses and minimum risk maneuvers.
  • Lead and influence system architecture trade studies that impact the fault coverage system availability safety risk and operational continuity.
  • Develop the strategy for managing system availability degraded operation and operational continuity through the Fault Management System.
  • Quantify the commercial and safety impact of false positive and false negative detections.
  • Provide analysis to support complex autonomy system design trade-offs to inform system design decisions affecting safety and performance.
  • Serve as the technical leader to align cross-functional teams around a unified fault management strategy.

What youll bring :

  • B.S. M.S. or PhD in engineering or related technical field
  • 5 years experience within real-time safety-critical applications preferably in highly automated or autonomous systems (autonomous vehicles aerospace nuclear medical etc).
  • Experience with fault management diagnostic development safe state identification and development
  • Experience working with agile software engineering teams
  • Ability to read C / C code
  • Experienced in Systems Engineering V-model and application within product life cycle
  • Strong verbal and written communication skills
  • Ability to collaborate effectively with technical stakeholders spanning multiple technical disciplines
  • What we offer :

  • Competitive compensation package including equity and annual bonuses
  • Excellent Medical Dental and Vision plans through Kaiser Permanente Cigna and MetLife (including a medical plan with infertility benefits)
  • MetLife Legal Services Identity & Fraud Protection Hospital Indemnity Insurance Accident Insurance & Critical Illness Insurance
  • Flexible PTO 10 paid holidays and generous parental leave policies
  • Our office is centrally located in Mountain View CA
  • Office perks : dog-friendly free catered lunch a fully stocked kitchen and free EV charging
  • Long Term Disability Short Term Disability Life Insurance
  • Wellbeing Benefits - Headspace through Cigna Calm through Kaiser One Medical Gympass Spring Health through Cigna Rula (mental health navigation)
  • Fidelity 401(k)
  • Commuter FSA Dependent Care FSA HSA
  • Various incentive programs (referral bonuses patent bonuses etc.)
  • The pay range listed below reflects the base salary in our SF / Silicon Valley location across several internal levels. Actual starting pay will be based on job-related factors including : work location experience relevant training education skill level and performance during interview. Total compensation at Kodiak includes base pay equity bonus and a competitive benefits package

    California Pay Range

    $200000 - $268000 USD

    At Kodiak we strive to build a diverse community working towards our common company goals in a safe and collaborative environment where harassment of any kind is strictly prohibited. Kodiak is committed to equal opportunity employment regardless of race ethnicity religion gender identity sexual orientation age disability or veteran status or any other basis protected by applicable law.

    In alignment with its business operations Kodiak adheres to all relevant statutes regulations and administrative prerequisites. Accordingly roles that carry more sensitive requirements may be limited to candidates that can satisfy additional scrutiny and eligibility for such positions may hinge on verification of a candidates residence U.S. person status and / or citizenship status. Should the position require and Kodiak determines that a candidates residence U.S. person status and / or citizenship status necessitate an export license bar the candidate from the position or otherwise fall under national security-related restrictions Kodiak will consider the candidate for alternative positions unaffected by such restrictions under terms and conditions set forth at Kodiaks sole discretion or as an alternative opt not to proceed with the candidates application. If applicable Kodiak may provide visa sponsorship for eligible candidates.

    We use a third-party AI tool (Endorsed) to assist in the initial screening of applications. As part of the evaluation process we provide Endorsed with job requirements and candidate-submitted applications. Final hiring decisions are made by our human recruitment team and no automated system makes the ultimate decision regarding hiring. Certain features of the platform may qualify it as an Automated Employment Decision Tool (AEDT) under applicable regulations. We began using Endorsed on January 1 2026. You can review the independent bias audit report covering our use of Endorsed here( By submitting your application you acknowledge that your application may be processed by AI systems as part of the screening and selection process. If you have any questions or would like to request a separate review of your application please contact with Separate Review Request in the email subject line.

    Required Experience :

    Staff IC

    Key Skills

    Computer Science,Docker,Kubernetes,Python,VMware,C / C++,Go,System Architecture,gRPC,OS Kernels,Perl,Distributed Systems

    Employment Type : Full-Time

    Experience : years

    Vacancy : 1

    Monthly Salary Salary : 200000 - 268000

    [job_alerts.create_a_job]

    Staff Systems Engineer Fault Management • San Francisco, California, USA

    [internal_linking.similar_jobs]
    Staff SRE : Reliability & Scale for Cloud Platform

    Staff SRE : Reliability & Scale for Cloud Platform

    PowerToFly • Redwood City, CA, United States
    [job_card.full_time]
    A leading SaaS firm is looking for a Site Reliability Engineer in Redwood City, California.This role involves leading reliability for microservices, developing SLOs, and enhancing service availabil...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Staff Software Engineer, ML Performance & Systems

    Staff Software Engineer, ML Performance & Systems

    Fal • San Francisco, California, United States
    [job_card.full_time]
    Help fal maintain its frontier position on model performance for generative media models.Design and implement novel approaches to model serving architecture on top of our in-house inference engine,...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Platform Systems Engineer : Core OS & Performance

    Platform Systems Engineer : Core OS & Performance

    OpenAI • San Francisco, CA, United States
    [job_card.full_time]
    A leading AI research and deployment company is seeking a Systems Software Engineer in San Francisco.This role focuses on designing and debugging core platform components, ensuring system performan...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Staff Engineer & Tech Lead — Architect Scalable Systems

    Staff Engineer & Tech Lead — Architect Scalable Systems

    daydream, inc. • San Francisco, CA, United States
    [job_card.full_time]
    A technology company in San Francisco is hiring a Staff Engineer to lead technical execution and oversee engineering efforts. The role focuses on system architecture, code quality, and mentorship wi...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Systems Engineer (Experienced or Lead)

    Systems Engineer (Experienced or Lead)

    Boeing • Berkeley, California, US
    [job_card.temporary]
    Is your CV ready If so, and you are confident this is the role for you, make sure to apply asap.At Boeing, we innovate and collaborate to make the world a better place. We’re committed to fostering ...[show_more]
    [last_updated.last_updated_variable_hours] • [promoted] • [new]
    Staff Solutions Engineer, Spend

    Staff Solutions Engineer, Spend

    Airwallex Pty Ltd. • San Francisco, CA, United States
    [job_card.full_time]
    Airwallex is the only unified payments and financial platform for global businesses.Powered by our unique combination of proprietary infrastructure and software, we empower over 200,000 businesses ...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Staff Software Engineer, Multi-Agent Systems

    Staff Software Engineer, Multi-Agent Systems

    Nimble • San Francisco, CA, United States
    [job_card.full_time]
    Nimble is an AI robotics company building the autonomous supply chain to enable fast, efficient, and sustainable commerce. We’re developing a general-purpose robot AI and a warehouse generalist supe...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Staff Engineer, Digital Assets & Distributed Systems (Hybrid)

    Staff Engineer, Digital Assets & Distributed Systems (Hybrid)

    Early Warning Services LLC • San Francisco, CA, United States
    [job_card.full_time]
    A leading technology solutions provider is seeking a seasoned technical leader to drive the development and deployment of complex solutions. The ideal candidate will have over ten years of experienc...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Staff Engineer : Scale Critical Systems & Drive Innovation

    Staff Engineer : Scale Critical Systems & Drive Innovation

    PayJoy, Inc • San Francisco, CA, United States
    [job_card.full_time]
    A technology-driven financial services provider is seeking a Staff Engineer to lead technical innovation and drive collaboration with various teams. The role requires a Bachelor's degree in a techni...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Staff Hardware Systems Engineer

    Staff Hardware Systems Engineer

    Crusoe • San Francisco, CA, United States
    [job_card.full_time]
    Crusoe's mission is to accelerate the abundance of energy and intelligence.We’re crafting the engine that powers a world where people can create ambitiously with AI — without sacrificing scale, spe...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Senior / Staff Software Engineer - Business Systems

    Senior / Staff Software Engineer - Business Systems

    Verkada • San Mateo, California, United States
    [job_card.full_time]
    Verkada is the largest cloud-based B2B physical security platform company in the world.Only Verkada offers six product lines — video security cameras, access control, environmental sensors, alarms,...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Systems Engineer, Platform Requirements and Verification

    Systems Engineer, Platform Requirements and Verification

    Waabi Innovation Inc. • San Francisco, CA, United States
    [job_card.full_time]
    Own the creation, documentation, alignment and verification of vehicle platform requirements at Waabi and define ways for how to describe and evaluate them efficiently and effectively.Bring an in-d...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Sr. Staff Engineer - Risk and Compliance Tech - Hybrid Role

    Sr. Staff Engineer - Risk and Compliance Tech - Hybrid Role

    GEICO • San Francisco, CA, United States
    [job_card.full_time]
    Staff Engineer - Risk and Compliance Tech - Hybrid Role • •At GEICO, we offer a rewarding career where your ambitions are met with endless possibilities. Every day we honor our iconic brand by offerin...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Staff Site Reliability Engineer

    Staff Site Reliability Engineer

    Veeam • San Francisco, CA, United States
    [job_card.full_time]
    Veeam, the #1 global market leader in data resilience, believes businesses should control all their data whenever and wherever they need it. Veeam provides data resilience through data backup, data ...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Senior Systems Engineer, Lifecycle & Compliance - On-site CA

    Senior Systems Engineer, Lifecycle & Compliance - On-site CA

    El Camino Health • San Francisco, CA, United States
    [job_card.full_time]
    A prominent healthcare technology firm in California is seeking a Senior System Engineer I to join their Lifecycle Engineering team. This role involves ensuring product compliance and managing compl...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Senior Staff Engineer - HRIS Platform & Global Scale

    Senior Staff Engineer - HRIS Platform & Global Scale

    Rippling • San Francisco, CA, United States
    [job_card.full_time]
    A leading technology firm based in San Francisco is seeking a Senior Staff Engineer to lead the HRIS organization.This role will focus on setting technical strategies, improving system architecture...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Staff Systems Engineer

    Staff Systems Engineer

    Bio-Rad Laboratories • Hercules, California, United States
    [job_card.full_time]
    Working within Bio-Rad's Life Science R&D Group as a Systems Engineer, you will take engineering concepts, requirements and transform them into functional prototypes and finished products that impr...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Senior / Staff Engineer - Reliability (SRE) | Belgrade, Berlin, London, NYC, SF

    Senior / Staff Engineer - Reliability (SRE) | Belgrade, Berlin, London, NYC, SF

    Pantera Capital • San Francisco, CA, United States
    [job_card.full_time]
    Perplexity is seeking a Senior or Staff level Reliability Engineer (SRE) to join our small team in revolutionizing the way people search and interact with the internet. You will be responsible for l...[show_more]
    [last_updated.last_updated_30] • [promoted]