Talent.com
Senior Compute SRE — AI Cloud, Kernel & Virtualization
Senior Compute SRE — AI Cloud, Kernel & VirtualizationCrusoe • San Francisco, California, United States
Senior Compute SRE — AI Cloud, Kernel & Virtualization

Senior Compute SRE — AI Cloud, Kernel & Virtualization

Crusoe • San Francisco, California, United States
[job_card.variable_days_ago]
[job_preview.job_type]
  • [job_card.full_time]
[job_card.job_description]

A leading AI-first cloud infrastructure company is seeking a Site Reliability Engineer to enhance their compute infrastructure. This role involves developing automation tools and optimizing performance for AI workloads. Candidates should have over 8 years of experience in SRE or Linux system engineering, with expertise in virtualization technologies like KVM and a strong command of Linux kernel internals. The position offers competitive compensation, including stock options, and a hybrid work schedule.

#J-18808-Ljbffr

[job_alerts.create_a_job]

Senior Cloud Compute • San Francisco, California, United States

[internal_linking.related_jobs]
Senior SRE : Reliability, Automation & Cloud Ops

Senior SRE : Reliability, Automation & Cloud Ops

Cooley LLP • San Francisco, CA, United States
[job_card.full_time]
A leading law firm in San Francisco seeks a Senior Technology Site Reliability Engineer to ensure high availability and performance of the firm's infrastructure. Responsibilities include monitoring ...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Senior Manager, REMS Data Programmer

Senior Manager, REMS Data Programmer

Jazz Pharmaceuticals • Redwood City, California, USA
[job_card.full_time]
If you are a current Jazz employee please apply via the Internal Career site.Jazz Pharmaceuticals is a global biopharma company whose purpose is to innovate to transform the lives of patients and ...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Senior Director, Data and AI Architecture Leader

Senior Director, Data and AI Architecture Leader

Dynavax Technologies • Emeryville, CA, United States
[job_card.full_time]
This position can be 100% remote, but must be located in the United States.Dynavax is a commercial-stage biopharmaceutical company developing and commercializing novel vaccines to help protect the ...[show_more]
[last_updated.last_updated_30] • [promoted]
Senior SDE- Kernel Engineer

Senior SDE- Kernel Engineer

Amazon • San Francisco, CA, United States
[job_card.full_time]
The Amazon Devices team designs and engineers high-profile consumer electronics, including the best-selling Kindle family of products. We have also produced groundbreaking devices like Fire tablets,...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Senior SRE : Cloud-Native Infra for Grid Reliability

Senior SRE : Cloud-Native Infra for Grid Reliability

Gridware • San Francisco, CA, United States
[job_card.full_time]
A leading technology company in San Francisco is seeking a Senior Site Reliability Engineer to design and maintain cloud-native infrastructure on AWS. The ideal candidate will manage Kubernetes clus...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Senior Compute SRE : Kernel & Virtualization for AI / HPC

Senior Compute SRE : Kernel & Virtualization for AI / HPC

Crusoe • San Francisco, CA, United States
[job_card.full_time]
A leading cloud infrastructure provider seeks an experienced Site Reliability Engineer to enhance their compute infrastructure. You'll develop automation tools, optimize performance for AI workloads...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
SRE : Scale AI Infra with Kubernetes & Automation | Equity

SRE : Scale AI Infra with Kubernetes & Automation | Equity

Together • San Francisco, CA, United States
[job_card.full_time]
A research-driven AI company is seeking a Site Reliability Engineer to manage user-facing services and production systems. Responsibilities include participating in on-call rotations, building infra...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Senior SRE - AI-Driven Cloud Reliability & Automation

Senior SRE - AI-Driven Cloud Reliability & Automation

Crusoe Energy Systems LLC • San Francisco, CA, United States
[job_card.full_time]
A leading energy technology firm seeks a Site Reliability Engineer to enhance its reliable, energy-efficient, AI-optimized cloud platform. In this role, you'll collaborate with cross-functional team...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Director, AI SRE & DevOps – GenAI Reliability

Director, AI SRE & DevOps – GenAI Reliability

Charles Schwab • San Francisco, CA, United States
[job_card.full_time]
A leading financial services firm is seeking a Director of AI SRE & DevOps in San Francisco.In this role, you will lead infrastructure and reliability efforts for GenAI applications.Ideal candidate...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Senior SRE : Build Scalable, Reliable Systems

Senior SRE : Build Scalable, Reliable Systems

The10minutecareersolution • San Francisco, CA, United States
[job_card.full_time]
A tech solutions provider is seeking a Senior Web site Reliability Engineer to enhance system reliability and performance in fast-paced startup environments across the US.The ideal candidate will h...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Senior AI Engineering Leader : Scalable Cloud AI & Mentorship

Senior AI Engineering Leader : Scalable Cloud AI & Mentorship

Capital One • San Francisco, CA, United States
[job_card.full_time]
A leading financial services company in San Francisco is seeking an experienced AI / ML Engineer to develop advanced algorithms and systems. Candidates should have extensive experience with programmin...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Senior SRE — Healthcare Cloud Infra & Networking

Senior SRE — Healthcare Cloud Infra & Networking

Collective Health • San Francisco, CA, United States
[job_card.full_time]
A healthcare technology company based in San Francisco is seeking a Senior Site Reliability Engineer to design and maintain the cloud infrastructure for healthcare applications.This role blends sof...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Senior SRE : Scale Reliable Cloud Systems & Observability

Senior SRE : Scale Reliable Cloud Systems & Observability

Air Apps, Inc. • San Francisco, CA, United States
[job_card.full_time]
A leading tech company in San Francisco is seeking a Site Reliability Engineer (SRE) to ensure the reliability, availability, and scalability of systems. You will implement automation and monitoring...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
SRE Engineer : AI Infra, Observability & Reliability

SRE Engineer : AI Infra, Observability & Reliability

Sierra • San Francisco, CA, United States
[job_card.full_time]
A technology company in San Francisco seeks a Software Engineer for its Site Reliability team.This role involves defining and building reliability and scalability in an AI-driven infrastructure.Can...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Senior Machine Learning Engineer, Computer Vision - Robotics

Senior Machine Learning Engineer, Computer Vision - Robotics

Scale AI, Inc. • San Francisco, CA, United States
[job_card.full_time]
Scale's Robotics business unit is dedicated to solving the data bottleneck in Physical AI.This position will be a key contributor in conducting applied research in Robotics and developing ML pipeli...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Senior Distributed Systems Engineer – AI Cloud & Kernel

Senior Distributed Systems Engineer – AI Cloud & Kernel

E2b • San Francisco, CA, United States
[job_card.full_time]
A tech startup in San Francisco seeks an experienced infrastructure engineer to build a cloud platform for AI software.The role involves developing a distributed system and an orchestrator for effi...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Hybrid Cloud SRE : Scale AI Infrastructure & Resilience

Hybrid Cloud SRE : Scale AI Infrastructure & Resilience

Atlas • San Francisco, CA, United States
[job_card.full_time]
A leading technology firm in San Francisco is looking for a Site Reliability Engineer to enhance the reliability and performance of its platform. This role will involve designing fault-tolerant infr...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Director of AI SRE & DevOps, AI.x

Director of AI SRE & DevOps, AI.x

Charles Schwab Corporation • San Francisco, CA, United States
[job_card.full_time]
At Schwab, you will build a rewarding career while making a difference in the lives of our millions of clients.Here, innovative thinking meets creative problem solving as we work together to challe...[show_more]
[last_updated.last_updated_variable_days] • [promoted]