Talent.com
Staff Software Engineer
Staff Software EngineerCrusoe • San Francisco, CA, United States
[error_messages.no_longer_accepting]
Staff Software Engineer

Staff Software Engineer

Crusoe • San Francisco, CA, United States
[job_card.variable_days_ago]
[job_preview.job_type]
  • [job_card.full_time]
[job_card.job_description]

Crusoe's mission is to accelerate the abundance of energy and intelligence. We’re crafting the engine that powers a world where people can create ambitiously with AI — without sacrificing scale, speed, or sustainability.

Be a part of the AI revolution with sustainable technology at Crusoe. Here, you'll drive meaningful innovation, make a tangible impact, and join a team that’s setting the pace for responsible, transformative cloud infrastructure.

About This Role :

We are looking for a highly skilled engineer with deep experience running Kubernetes at scale on bare metal. You will help design, build, and operate Crusoe’s next-generation Kubernetes platform across global datacenters, ensuring performance, reliability, and automation at every layer of the stack.

What You’ll Be Working On :

  • Designing, building, and operating Kubernetes clusters on bare metal at scale
  • Engineering full cluster lifecycle management (Talos bootstrapping, upgrades, node reprovisioning, HA control planes, recovery workflows)
  • Architecting networking, load balancing, and service mesh solutions optimized for bare metal
  • Implementing performant CNIs (Calico, Cilium), integrating L2 / L3 networking, routing (BGP / ECMP), and optimizing traffic across racks and datacenters
  • Automating provisioning via PXE / iPXE, Tinkerbell, MAAS, and managing BMCs / IPMI / Redfish with standardized BIOS / firmware across heterogeneous hardware fleets
  • Designing and operating persistent storage (local disks, block, object) including Ceph, Rook, and openEBS
  • Building automation and tooling (Go, Python, Bash) for provisioning, drift detection, upgrades, and incident response
  • Extending observability with Prometheus, Alertmanager, Grafana, OpenTelemetry, and defining SLOs for cluster health, latency, and workload availability
  • Implementing security best practices : Vault, cert-manager, RBAC hardening, network policies, and OS / K8s patch pipelines
  • Mentoring engineers and shaping technical direction for Crusoe’s Kubernetes platform

What You’ll Bring to the Team :

  • 7+ years in infrastructure engineering, including 3+ years operating Kubernetes in production
  • Strong experience running Kubernetes on bare metal (not just managed services)
  • Expert-level knowledge of Linux internals (cgroups, namespaces, kernel networking)
  • Deep experience with CNIs (Cilium, Calico), load balancers (Envoy, HAProxy, F5), and L3 networking (BGP, ECMP)
  • Proven track record provisioning and operating physical servers at scale (PXE / iPXE, Tinkerbell, MAAS, BMC / IPMI automation)
  • Strong programming skills in Go for building operators, controllers, and automation tooling
  • Hands-on experience with distributed storage systems (Ceph, MinIO, Rook, CSI drivers)
  • Strong background in observability (Prometheus, Alertmanager, metrics autoscaling, logging / ELK)
  • Familiarity with PKI, identity, and secrets management (Vault, cert-manager)
  • Excellent debugging skills for complex distributed systems
  • Strong communication and collaboration across cross-functional teams
  • Bonus Points :

  • Experience with hardware fleet management across multiple datacenters
  • Contributions to open source Kubernetes or related ecosystem projects
  • Experience implementing disaster recovery strategies at scale
  • Familiarity with GPUs, HPC clusters, or large-scale AI / ML workloads
  • Benefits :

  • Industry competitive pay
  • Restricted Stock Units in a fast growing, well-funded technology company
  • Health insurance package options that include HDHP and PPO, vision, and dental for you and your dependents
  • Employer contributions to HSA accounts
  • Paid Parental Leave
  • Paid life insurance, short-term and long-term disability
  • Teladoc
  • 401(k) with a 100% match up to 4% of salary
  • Generous paid time off and holiday schedule
  • Cell phone reimbursement
  • Tuition reimbursement
  • Subscription to the Calm app
  • MetLife Legal
  • Company paid commuter benefit; $300 per month
  • Compensation :

    Compensation will be paid in the range of $204,000 - $247,000 + Bonus. Restricted Stock Units are included in all offers. Compensation to be determined by the applicant’s education, experience, knowledge, skills, and abilities, as well as internal equity and alignment with market data.

    Crusoe is an Equal Opportunity Employer. Employment decisions are made without regard to race, color, religion, disability, genetic information, pregnancy, citizenship, marital status, sex / gender, sexual preference / orientation, gender identity, age, veteran status, national origin, or any other status protected by law or regulation.

    #J-18808-Ljbffr

    [job_alerts.create_a_job]

    Staff Software Engineer • San Francisco, CA, United States

    [internal_linking.similar_jobs]
    Staff Software Engineer

    Staff Software Engineer

    Idler • San Francisco, CA, US
    [job_card.full_time]
    Idler builds reinforcement learning environments that teach AI models to code like 0.Our training environments are based on real-world coding scenarios that frontier models will actually encounter....[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Staff Software Engineer

    Staff Software Engineer

    IPG Mediabrands • San Francisco, California, United States
    [job_card.full_time]
    KINESSO is the technology-driven performance marketing agency that sits at the very heart of IPG Mediabrands, providing actionable growth for both our agency partners and clients.We turn 'action' i...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Staff Software Engineer Metrics US

    Staff Software Engineer Metrics US

    Promote Project • San Francisco, CA, United States
    [job_card.full_time]
    At Weights & Biases, our mission is to build the best tools for AI developers.We founded our company on the insight that while there were excellent tools for developers to build better code, there ...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Staff Software Engineer

    Staff Software Engineer

    Cleric • San Francisco, CA, US
    [job_card.full_time]
    We're building an autonomous AI SRE that helps software engineering teams reliably investigate production incidents.Our agent combines LLMs with tools to understand systems, reason through prob...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Staff Software Engineer, Database Systems

    Staff Software Engineer, Database Systems

    Zilliz • Redwood City, CA, US
    [job_card.full_time]
    Zilliz is a fast-growing startup developing the industry’s leading vector database company for enterprise-grade AI.Founded by the engineers behind Milvus, the world’s most pop...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Staff Software Engineer AI Agents

    Staff Software Engineer AI Agents

    Goodleap • San Francisco, California, United States
    [job_card.full_time]
    GoodLeap is a technology company delivering best-in-class financing and software products for sustainable solutions, from solar panels and batteries to energy-efficient HVAC, heat pumps, roofing, w...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Staff Software Engineer

    Staff Software Engineer

    Collective • San Francisco, CA, United States
    [job_card.full_time]
    Collective is on a mission to redefine the way businesses-of-one work.Our technology and team of trusted advisors help members achieve financial independence by taking care of everything from busin...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Staff Software Engineer, Platform

    Staff Software Engineer, Platform

    Anthropic • San Francisco, California, USA
    [job_card.full_time]
    Anthropics mission is to create reliable interpretable and steerable AI systems.We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of ...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Staff Software Engineer, Core

    Staff Software Engineer, Core

    Descript • San Francisco, CA, United States
    [job_card.full_time]
    We are building the next‑generation AI‑powered platform and web application for easy and fast creation of audio and video content. Growing this revolutionary product involves unique technical challe...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Staff Software Engineer

    Staff Software Engineer

    Braveclojure • San Francisco, CA, United States
    [job_card.full_time]
    Our mission at Onton is to help people make decisions they love, instantly.We’re tackling the most economically impactful decisions first : the average shopping journey takes 79 days, and we’re taki...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Staff Front End Software Engineer

    Staff Front End Software Engineer

    OSI Engineering • Menlo Park, CA, US
    [job_card.full_time]
    Staff Front End Software Engineer Job Summary We are looking for a talented Staff Software Engineer to join our front-end engineering team developing web solutions. You will be part of a dynamic tea...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Staff Software Engineer - Forward Deployed

    Staff Software Engineer - Forward Deployed

    Invisible Technologies • San Francisco, California, United States
    [job_card.full_time]
    Invisible Technologies is the AI operating system for the enterprise.Our end-to-end AI Software Platform structures messy data, builds digital workflows, deploys agentic solutions, evaluates / measur...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Staff Software Engineer

    Staff Software Engineer

    Altana AI • San Francisco, CA, United States
    [job_card.full_time]
    AI can be a powerful tool for good in the world – at Altana we apply AI to the world’s largest organized body of supply chain data to power a more resilient, more secure, and more sustainable model...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Staff Software Engineer, Platform

    Staff Software Engineer, Platform

    Scale • San Francisco, CA, United States
    [job_card.full_time]
    Software is eating the world, but AI is eating software.We live in unprecedented times – AI has the potential to exponentially augment human intelligence. Every person will have a personal tutor, co...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Staff Software Engineer

    Staff Software Engineer

    Omada Health • South San Francisco, CA, United States
    [job_card.full_time]
    Omada Health is on a mission to inspire and engage people in lifelong health, one step at a time.Omada Health is a digital care provider that empowers people to achieve their health goals through s...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Staff Software Engineer

    Staff Software Engineer

    VirtualVocations • Oakland, California, United States
    [job_card.full_time]
    A company is looking for a Staff Software Engineer - Typescript.Key Responsibilities Provide technical leadership across multiple teams, setting architectural vision and ensuring consistency in t...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Staff Software Engineer

    Staff Software Engineer

    Bio-Rad Laboratories • Hercules, CA, United States
    [job_card.full_time]
    This role is both technical and collaborative.You will work closely with cross-functional teams including systems engineers, mechanical designers, assay development scientists, and quality engineer...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Staff Software Engineer, Full Stack

    Staff Software Engineer, Full Stack

    Verse Medical • San Francisco, CA, US
    [job_card.full_time]
    Hospital-Quality Care, Everywhere.The healthcare industry still relies on faxes and phone tag to coordinate critical care for patients at home. We think patients and the clinicians who serve them de...[show_more]
    [last_updated.last_updated_30] • [promoted]