Talent.com
Sr/Staff Software Engineer, Observability
Sr/Staff Software Engineer, ObservabilityCrusoe Energy Systems LLC • San Francisco, California, United States
[error_messages.no_longer_accepting]
Sr / Staff Software Engineer, Observability

Sr / Staff Software Engineer, Observability

Crusoe Energy Systems LLC • San Francisco, California, United States
[job_card.variable_days_ago]
[job_preview.job_type]
  • [job_card.full_time]
[job_card.job_description]

Crusoe's mission is to accelerate the abundance of energy and intelligence. We’re crafting the engine that powers a world where people can create ambitiously with AI — without sacrificing scale, speed, or sustainability.

Be a part of the AI revolution with sustainable technology at Crusoe. Here, you'll drive meaningful innovation, make a tangible impact, and join a team that’s setting the pace for responsible, transformative cloud infrastructure.

About This Role :

We are looking for a highly skilled engineer with deep expertise in building and operating observability platforms at scale.

As a

Senior / Staff Software Engineer, Observability , you be

supporting

our team of Network Engineers (no network engineering experience required). The Crusoe Cloud Network Engineering Team is responsible for designing, building, and operating the global edge, backbone, and data center network for High Performance Compute (HPC) Clusters with GPUs.

This is a

unique opportunity

wherein you'll be building an observability platform specific to devices; one that is language agnostic and scalable. You will design, develop, and run Crusoe’s next-generation observability stack, enabling engineers to understand the internal state of distributed systems through metrics, logs, and traces.

A Day in the Life :

Designing and operating scalable observability systems (metrics, logging, tracing) across multi-datacenter Kubernetes environments

Architecting end-to-end telemetry pipelines, including ingestion, storage, querying, and visualization

Extending monitoring and alerting with Prometheus, Alertmanager, Thanos / Cortex, Grafana, and OpenTelemetry

Building scalable log collection and processing pipelines with Fluent Bit, Vector, Loki, or ELK / Opensearch stacks

Implementing distributed tracing platforms (Tempo, Jaeger, OpenTelemetry) and integrating with service meshes, load balancers, and APIs

Defining and driving adoption of SLOs, SLIs, and error budgets across services and teams

Automating provisioning and scaling of observability infrastructure with Kubernetes, Terraform, and custom tooling (Go, Python)

Ensuring reliability and cost efficiency of telemetry pipelines while supporting high-volume workloads (AI / ML, HPC clusters, GPU infrastructure)

Embedding security best practices into observability platforms, including RBAC, TLS, secret management, and multi-tenant access controls

Mentoring engineers and shaping Crusoe’s observability strategy and technical roadmap

You Will Thrive In This Role If :

6+ years of experience with distributed systems, with a focus on observability and monitoring systems

Deep expertise with metrics systems (Prometheus, Thanos, Mimir, Cortex), logging pipelines (Fluent Bit, Vector, Loki, ELK / Opensearch), and tracing platforms (Jaeger, Tempo, OpenTelemetry)

Strong programming skills in Go or Python for automation, operators, and custom integrations

Experience running observability platforms on Kubernetes and operating them at scale across multi-datacenter environments

Proven ability to design, optimize, and scale telemetry pipelines handling high cardinality and high throughput data

Solid understanding of distributed systems, performance engineering, and debugging complex workloads

Familiarity with service meshes, networking, and workload instrumentation (Envoy, Istio, OpenTelemetry SDKs)

Strong collaboration skills and the ability to influence engineering teams to adopt observability best practices

Benefits :

Industry competitive pay

Restricted Stock Units in a fast growing, well-funded technology company

Health insurance package options that include HDHP and PPO, vision, and dental for you and your dependents

Employer contributions to HSA accounts

Paid Parental Leave

Paid life insurance, short-term and long-term disability

Teladoc

401(k) with a 100% match up to 4% of salary

Generous paid time off and holiday schedule

Cell phone reimbursement

Tuition reimbursement

Subscription to the Calm app

MetLife Legal

Company paid commuter benefit; $300 per month

Compensation Range :

Compensation will be paid in the range of $172,000 - $253,000 + Bonus. Restricted Stock Units are included in all offers. Compensation to be determined by the applicant’s education, experience, knowledge, skills, and abilities, as well as internal equity and alignment with market data.

Crusoe is an Equal Opportunity Employer. Employment decisions are made without regard to race, color, religion, disability, genetic information, pregnancy, citizenship, marital status, sex / gender, sexual preference / orientation, gender identity, age, veteran status, national origin or any other status protected by law or regulation.

#J-18808-Ljbffr

[job_alerts.create_a_job]

Software Engineer • San Francisco, California, United States

[internal_linking.similar_jobs]
Staff Software Engineer (Incubation)

Staff Software Engineer (Incubation)

GoFundMe • San Francisco, CA, United States
[job_card.full_time]
Want to help us help others? We’re hiring!.GoFundMe is the world’s most powerful community for good, dedicated to helping people help each other. By uniting individuals and nonprofits in one place, ...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Sr. Staff Software Engineer, Core Foundations

Sr. Staff Software Engineer, Core Foundations

Pinterest • San Francisco, California, United States
[job_card.full_time]
Millions of people around the world come to our platform to find creative ideas, dream about new possibilities and plan for memories that will last a lifetime. At Pinterest, we're on a mission to br...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Staff Software Engineer

Staff Software Engineer

Linktree • San Francisco, CA, United States
[job_card.full_time]
United States • Los Angeles, CA • New York, NY • San Francisco, CA.Engineering • Hybrid • Full-time.We are looking for talented and experienced software engineers with a specialization in growth to...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Sr / Staff Software Engineer, Observability

Sr / Staff Software Engineer, Observability

Crusoe Energy Systems LLC • San Francisco, CA, United States
[job_card.full_time]
Crusoe's mission is to accelerate the abundance of energy and intelligence.We’re crafting the engine that powers a world where people can create ambitiously with AI — without sacrificing scale, spe...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Staff Software Engineer - GenAI Reliability & Observability

Staff Software Engineer - GenAI Reliability & Observability

Eden Prescott • San Francisco, CA, United States
[job_card.full_time]
A pioneering tech company in San Francisco is seeking a Senior or Staff Software Engineer for their expanding team.This role focuses on designing and building GenAI evaluation and observability inf...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Sr Staff Software Engineer, Consumer Pricing and Incentives

Sr Staff Software Engineer, Consumer Pricing and Incentives

Uber • San Francisco, California, United States
[job_card.full_time]
Sr Staff Software Engineer, Consumer Pricing and Incentives About The Role And Team The Consumer Pricing and Incentives team is central to the economics of Uber Delivery, directly influencing how w...[show_more]
[last_updated.last_updated_1_hour] • [promoted] • [new]
Staff Software Engineer (San Francisco hybrid)

Staff Software Engineer (San Francisco hybrid)

Pomelo Care • San Francisco, CA, United States
[job_card.full_time]
Staff Software Engineer (San Francisco hybrid).Join to apply for the Staff Software Engineer (San Francisco hybrid) role at Pomelo Care. Pomelo Care is a multidisciplinary team of clinicians, engine...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Senior / Staff Software Engineer (Platform)

Senior / Staff Software Engineer (Platform)

Pallet • San Francisco, CA, United States
[job_card.full_time]
Senior / Staff Software Engineer (Platform).Pallet is building AI Agents to transform logistics — a $12 trillion global industry. We’ve raised $50M from top investors including General Catalyst, Besse...[show_more]
[last_updated.last_updated_30] • [promoted]
Senior / Staff Software Engineer, Product & Systems

Senior / Staff Software Engineer, Product & Systems

Lightfield • San Francisco, CA, United States
[job_card.full_time]
Lightfield is a next-generation CRM that automatically captures customer interactions like emails, meetings, and support tickets and organizes them into structured CRM data, enabling deep analysis ...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Staff Software Engineer

Staff Software Engineer

Ironclad Inc. • San Francisco, CA, United States
[job_card.full_time]
Ironclad is the leading AI contracting platform that transforms agreements into assets.Contracts move faster, insights surface instantly, and agents push work forward, all with you in control.Wheth...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Senior / Staff Software Engineer

Senior / Staff Software Engineer

Amae Health • San Francisco, CA, United States
[job_card.full_time]
Transforming the lives of those affected by severe mental illness.At Amae Health, we are dedicated to helping the 15.Americans living with severe mental illness (SMI) lead stable, meaningful lives,...[show_more]
[last_updated.last_updated_30] • [promoted]
Staff Software Engineer

Staff Software Engineer

TogetherWeTech • San Francisco, CA, United States
[job_card.full_time]
Staff Software Engineer - SF Bay (4 days onsite) | Up to $195K - $255K + Equity.Get AI-powered advice on this job and more exclusive features. This range is provided by TogetherWeTech.Your actual pa...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Senior / Staff Software Engineer

Senior / Staff Software Engineer

Embakire Workforce • San Francisco, CA, United States
[job_card.full_time]
Be among the first 25 applicants.This range is provided by Embakire Workforce.Your actual pay will be based on your skills and experience — talk with your recruiter to learn more.Join a high-veloci...[show_more]
[last_updated.last_updated_30] • [promoted]
Staff Software Engineer, Experimentation

Staff Software Engineer, Experimentation

Chime • San Francisco, CA, United States
[job_card.full_time]
Staff Software Engineer, Experimentation.The Experimentation Platform team at Chime develops a critical tool that empowers Engineers, Product Managers, Data Scientists, Analysts, ML Engineers, and ...[show_more]
[last_updated.last_updated_30] • [promoted]
Staff Software Engineer San Francisco

Staff Software Engineer San Francisco

Clerk Chat, Inc. • San Francisco, CA, United States
[job_card.full_time]
Clerk Chat's mission is to make every business conversational.We are achieving this by building the leading messaging application, integrating AI where it matters, and crafting our own telecom infr...[show_more]
[last_updated.last_updated_variable_hours] • [promoted] • [new]
Staff Software Engineer

Staff Software Engineer

Bio-Rad Laboratories • Hercules, CA, United States
[job_card.full_time]
This role is both technical and collaborative.You will work closely with cross-functional teams including systems engineers, mechanical designers, assay development scientists, and quality engineer...[show_more]
[last_updated.last_updated_30] • [promoted]
Staff Software Engineer

Staff Software Engineer

Omada Health • South San Francisco, CA, United States
[job_card.full_time]
Omada Health is on a mission to inspire and engage people in lifelong health, one step at a time.Omada Health is a digital care provider that empowers people to achieve their health goals through s...[show_more]
[last_updated.last_updated_30] • [promoted]
Senior / Staff Software Engineer, Metrics Tooling and Automation

Senior / Staff Software Engineer, Metrics Tooling and Automation

Zoox • Foster City, CA, US
[job_card.full_time]
Our team is working to ensure Zoox’s hardware and software are safe for operation in the real world through the use of simulation. As part of this process, the team is responsible for designin...[show_more]
[last_updated.last_updated_30] • [promoted]