Talent.com
Production Engineer, Storage
Production Engineer, StorageCrusoe • San Francisco, CA, US
Production Engineer, Storage

Production Engineer, Storage

Crusoe • San Francisco, CA, US
[job_card.30_days_ago]
[job_preview.job_type]
  • [job_card.full_time]
[job_card.job_description]

Job Description

Job Description

Crusoe's mission is to accelerate the abundance of energy and intelligence. We’re crafting the engine that powers a world where people can create ambitiously with AI — without sacrificing scale, speed, or sustainability.

Be a part of the AI revolution with sustainable technology at Crusoe. Here, you'll drive meaningful innovation, make a tangible impact, and join a team that’s setting the pace for responsible, transformative cloud infrastructure.

About This Role :

At Crusoe Energy Systems, our Site Reliability Engineering (SRE) team plays a mission-critical role in maintaining the performance and reliability of our AI-optimized cloud infrastructure. The Storage-focused SRE role is responsible for ensuring the availability, performance, and scalability of Crusoe’s cloud storage products and services, which power compute-intensive, latency-sensitive workloads for AI and HPC use cases. This role directly supports our vertically integrated, sustainable cloud platform by building and optimizing distributed, fault-tolerant storage systems at scale.

What You'll Be Working On :

In this role, you will build automation and self-healing tools to monitor and maintain Crusoe’s distributed cloud storage infrastructure, which includes block, file, and object storage systems. You will drive reliability initiatives focused on data replication, encryption, backup and restore strategies, and robust failover mechanisms. Collaborating closely with storage engineers, you will help implement and maintain high-performance NVMe- and SSD-backed volumes that support large-scale AI compute clusters. Your responsibilities will also include supporting user-facing storage services with a focus on availability, performance tuning, and adherence to error budgets. You’ll investigate and resolve storage-related incidents using deep telemetry, logs, and performance profiling, while also partnering with hardware and kernel teams to diagnose low-level I / O issues and optimize I / O paths, cache policies, and file systems. Additionally, you will contribute to the architecture of fault-tolerant, scalable storage backends tailored for AI-first cloud environments.

What You’ll Bring to the Team :

5+ years of professional experience in SRE, systems, or storage engineering.

Hands-on experience with distributed storage systems (e.g., Ceph, GlusterFS, OpenEBS) and deep understanding of object, block, and file storage paradigms.

Proficiency in a programming language such as Python, Go, Java, or C.

Experience with Infrastructure as Code and deployment tooling such as Terraform, Ansible, or Puppet.

Deep knowledge of Linux internals with a focus on I / O subsystems, memory management, and storage scheduling.

Familiarity with storage protocols like NFS, SMB, iSCSI, or NVMe-oF.

Strong experience working with containerized workloads and orchestration platforms (e.g., Kubernetes, Docker).

Excellent incident response, troubleshooting, and documentation practices.

Experience with building and operating managed services at scale such as object, file and block storage (AWS, GCP, Azure)

Excellent communication skills

Must be able to pass a background check

Embody the Company values

Bonus Points :

Contributions to open-source storage projects or the Linux storage stack.

Experience with hybrid storage models across on-prem and cloud environments.

Familiarity with high-throughput network topologies for storage backplanes (e.g., RoCE, RDMA, InfiniBand)..

Benefits :

Industry competitive pay

Restricted Stock Units in a fast growing, well-funded technology company

Health insurance package options that include HDHP and PPO, vision, and dental for you and your dependents

Employer contributions to HSA accounts

Paid Parental Leave

Paid life insurance, short-term and long-term disability

Teladoc

401(k) with a 100% match up to 4% of salary

Generous paid time off and holiday schedule

Cell phone reimbursement

Tuition reimbursement

Subscription to the Calm app

MetLife Legal

Company paid commuter benefit; $300 per month

Compensation :

Compensation will be paid in the range of $166,000 - $201,000 a year + Bonus. Restricted Stock Units are included in all offers. Compensation to be determined by the applicant’s education, experience, knowledge, skills, and abilities, as well as internal equity and alignment with market data.

Crusoe is an Equal Opportunity Employer. Employment decisions are made without regard to race, color, religion, disability, genetic information, pregnancy, citizenship, marital status, sex / gender, sexual preference / orientation, gender identity, age, veteran status, national origin, or any other status protected by law or regulation.

[job_alerts.create_a_job]

Storage Engineer • San Francisco, CA, US

[internal_linking.similar_jobs]
Director, Drug Substance, Process Development & Manufacturing Sciences

Director, Drug Substance, Process Development & Manufacturing Sciences

Dynavax Technologies • Emeryville, CA, United States
[job_card.full_time]
This position can be 100% remote, but must be located in the United States.Dynavax is a commercial-stage biopharmaceutical company developing and commercializing novel vaccines to help protect the ...[show_more]
[last_updated.last_updated_30] • [promoted]
Manufacturing Engineer, Energy Storage

Manufacturing Engineer, Energy Storage

Redwood Materials, Inc. • San Francisco, CA, United States
[job_card.full_time]
Redwood is localizing a global battery supply chain that seamlessly integrates recovery, reuse, and recycling — keeping critical minerals in circulation and driving the energy transition.Founded in...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Senior Engineer I (Storage)

Senior Engineer I (Storage)

DigitalOcean • San Francisco, CA, United States
[job_card.full_time]
Be among the first 25 applicants.Dive in and do the best work of your career at DigitalOcean.Journey alongside a strong community of top talent who are relentless in their drive to build the simple...[show_more]
[last_updated.last_updated_30] • [promoted]
Supplier Industrialization Engineer

Supplier Industrialization Engineer

Peak Energy • Burlingame, CA, US
[job_card.full_time]
Supplier Industrialization Engineer.Peak Energy is the first American venture to advance globally proven Sodium-Ion battery systems as the storage standard for the new era of renewable energy on a ...[show_more]
[last_updated.last_updated_30] • [promoted]
Senior ML Storage Infrastructure Engineer

Senior ML Storage Infrastructure Engineer

Zoox • Foster City, CA, US
[job_card.full_time]
Zoox is looking for a software engineer to work on our custom High-Performance Computing infrastructure and its supporting ecosystem of tools and services. This infrastructure is central to machine ...[show_more]
[last_updated.last_updated_30] • [promoted]
Staff Manufacturing Engineer

Staff Manufacturing Engineer

Atomic Machines • Emeryville, CA, US
[job_card.full_time]
Atomic Machines is ushering in a new era of micromanufacturing with its Matter Compiler™ technology.This full-stack technology enables new classes of micromachines to be designed and built by...[show_more]
[last_updated.last_updated_30] • [promoted]
Storage Team Lead

Storage Team Lead

The Voleon Group • Berkeley, CA, US
[job_card.full_time]
Voleon is a technology company that applies state-of-the-art AI and machine learning techniques to real-world problems in finance. For nearly two decades, we have led our industry and worked at the ...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Firmware Engineer, Energy Storage

Firmware Engineer, Energy Storage

Redwood Materials • San Francisco, CA, United States
[job_card.full_time]
Redwood is localizing a global battery supply chain that seamlessly integrates recovery, reuse, and recycling — keeping critical minerals in circulation and driving the energy transition.Founded in...[show_more]
[last_updated.last_updated_30] • [promoted]
Staff Thermal Systems Engineer

Staff Thermal Systems Engineer

Gradient • San Francisco, CA, US
[job_card.full_time]
Join us at Gradient, where our purpose is to revolutionize home comfort while championing environmental sustainability.Our mission is to combat the escalating challenge of climate change by redefin...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Senior Production Engineer (REMOTE)

Senior Production Engineer (REMOTE)

Upbound • San Francisco, CA, US
[filters.remote]
[job_card.full_time]
Upbound is the company behind Crossplane, the open source project which started the control plane revolution in the cloud native community. Upbound is redefining how modern infrastructure is built.A...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Sustaining Engineer II

Sustaining Engineer II

Jupiter Endovascular • Menlo Park, CA, US
[job_card.full_time]
Imagine a world where you could bring the precision and control of surgery to catheter-based therapies.For decades, endovascular procedures have been constrained by the technological limitations of...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Cloud and Storage Engineer

Cloud and Storage Engineer

Contact Government Services, LLC • San Francisco, CA, US
[job_card.full_time]
Employment Type : Full-Time, Experienced.Department : Information technology .CGS is seeking a Cloud and Storage Engineer to develop and implement full-scale Storage Area Netwo...[show_more]
[last_updated.last_updated_30] • [promoted]
Chief Engineer (Facilities Operations and Maintenance)

Chief Engineer (Facilities Operations and Maintenance)

Innovative Consulting & Management Services • Berkeley, CA, US
[job_card.full_time]
Innovative Consulting & Management Services (ICMS).Professional & Technical Management Consulting firm with over 20 years of consulting experience. We offer technical professional services t...[show_more]
[last_updated.last_updated_30] • [promoted]
Senior Manufacturing Engineer

Senior Manufacturing Engineer

Zipline • South San Francisco, CA, US
[job_card.full_time]
Do you want to change the world? Zipline is on a mission to transform the way goods move.Our aim is to solve the world's most urgent and complex access challenges by building, manufacturing and...[show_more]
[last_updated.last_updated_30] • [promoted]
Senior Full-Stack Engineer

Senior Full-Stack Engineer

Orchard Robotics • San Francisco, CA, US
[job_card.full_time]
Series A startup backed by top VCs like Quiet Capital, Shine Capital, and General Catalyst.We're securing America’s food supply by building the AI farmer that automates our nation’s...[show_more]
[last_updated.last_updated_30] • [promoted]
Reliability Engineer

Reliability Engineer

Robust.ai • San Carlos, CA, US
[job_card.full_time]
Robust AI is a fast-growing, early-stage startup founded in 2019 by an unsurpassed team of veterans in robotics, AI and business. We are a collaborative group with a wide range of backgrounds and pe...[show_more]
[last_updated.last_updated_30] • [promoted]
Production Engineer, Storage

Production Engineer, Storage

Crusoe • San Francisco, CA, United States
[job_card.full_time]
Crusoe is building the World’s Favorite AI-first Cloud infrastructure company.We’re pioneering vertically integrated, purpose-built AI infrastructure solutions trusted by Fortune 500 companies to p...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Site Reliability Engineer I

Site Reliability Engineer I

Prosper • San Francisco, CA, US
[job_card.full_time]
As a Site Reliability Engineer I at Prosper, you will play a crucial role in enhancing the reliability, scalability, and maintainability of our technology platform. This entry-level position is desi...[show_more]
[last_updated.last_updated_30] • [promoted]