Hardware Development Infrastructure EngineerOpenAI • San Francisco

Hardware Development Infrastructure Engineer

OpenAI • San Francisco

[job_card.variable_days_ago]

[job_preview.job_type]

[job_card.full_time]

[job_card.job_description]

About the Team :

OpenAI’s Hardware organization develops silicon and system-level solutions designed for the unique demands of advanced AI workloads. The team is responsible for building the next generation of AI-native silicon while working closely with software and research partners to co-design hardware tightly integrated with AI models. In addition to delivering production-grade silicon for OpenAI’s supercomputing infrastructure, the team also creates custom design tools and methodologies that accelerate innovation and enable hardware optimized specifically for AI.

About the Role

We’re looking for a Hardware Development Infrastructure Engineer to build and run the infrastructure that powers OpenAI’s hardware development lifecycle. You’ll work closely with hardware teams to translate their workflows into scalable, observable, and automated systems, and then own the platforms that support them over time.

This role sits at the intersection of hardware, cloud, HPC, DevOps, and data. You’ll design regression systems, CI / CD pipelines, cloud and cluster platforms, and the data foundations that make development efficiency visible and measurable.

In this role, you will :

Partner with hardware teams on workflows and tooling : Embed with teams across DV, PD, emulation, formal, and software to understand development flows, identify failure modes, and deliver tooling (CLIs, services, APIs) that reduces manual work and accelerates iteration.

Build and operate regression systems at scale : Own regressions end-to-end—from definition and scheduling to execution, results ingestion, triage, and reporting—while improving throughput, reproducibility, and flake reduction.

Own CI / CD for infrastructure and tooling : Design and operate pipelines for infrastructure-as-code, services, images, and cluster configuration changes, including testing, gated deploys, staged rollouts, and safe rollback.

Run cloud and HPC platforms : Design, provision, and operate cloud infrastructure (Azure preferred) and HPC / HTC clusters (e.g., Slurm), tuning scheduling policies, autoscaling, node lifecycles, and cost-performance tradeoffs.

Build data foundations and visibility : Develop ETL pipelines to ingest metrics, logs, and results; operate databases for workflow metadata and outcomes; and build dashboards that surface efficiency, utilization, and reliability trends.

Drive operational excellence : Establish monitoring and alerting, lead incident response and postmortems, maintain runbooks, and produce clear, durable documentation.

You might thrive in this role if you have :

Familiarity with chip development workflows and at least one deep EDA domain (e.g., DV, PD, emulation, or formal verification).

Strong infrastructure fundamentals, including cloud platforms, networking, security, performance, and automation.

Experience operating cloud environments (Azure preferred; AWS, GCP, or OCI acceptable) with strong infrastructure-as-code practices (e.g., Terraform, Bicep; configuration management tools a plus).

Strong programming skills (Python preferred) and solid software engineering and scripting practices.

Experience building and operating CI / CD systems (e.g., Jenkins, Buildkite, GitHub Actions), including testing and release workflows.

Database experience (e.g., Postgres or MySQL), including schema design, migrations, indexing, and operational safety.

Clear communicator with strong judgment—able to explain tradeoffs, propose pragmatic solutions, and articulate a realistic vision for scalable infrastructure

Preferred Qualifications

Experience operating Slurm or other large-scale cluster schedulers.

Experience with enterprise authentication and directory services (e.g., Entra ID, LDAP, FreeIPA, SSSD).

Experience building or operating backend and middleware systems such as message queues, caches, artifact stores, or internal service platforms.

Familiarity with high-performance storage architectures and data movement optimization.

Experience running and monitoring license servers for expensive or capacity-constrained toolchains.

To comply with U.S. export control laws and regulations, candidates for this role may need to meet certain legal status requirements as provided in those laws and regulations.

[job_alerts.create_a_job]

Hardware Development Infrastructure Engineer • San Francisco

[internal_linking.similar_jobs]

Staff Infrastructure Engineer, Distributed Systems

Ambience Healthcare, Inc. • San Francisco, CA, United States

[job_card.full_time]

A leading healthcare tech firm in San Francisco is seeking an experienced Software Engineer to own and enhance database architecture for scalable, high-performing systems.You will be responsible fo...[show_more]

[last_updated.last_updated_variable_days] • [promoted]

Infrastructure Engineer

on Kernel • San Francisco, CA, United States

[job_card.full_time]

We’re building the developer platform that lets AI agents use applications, starting with browsers.Our edge is an infrastructure platform that’s extensible, observable, and built for scale from day...[show_more]

[last_updated.last_updated_30] • [promoted]

Infrastructure Engineer

Mercor, Inc. • San Francisco, CA, United States

[job_card.full_time]

We use our platform to source, vet, and onboard expert contractors who help train AI models in a wide variety of domains. Our technology is so effective it’s used by all of the top 5 AI labs.We scal...[show_more]

[last_updated.last_updated_30] • [promoted]

Founding Infrastructure Engineer : Shape Our Core Platform

Neonpay • San Francisco, CA, United States

[job_card.full_time]

A global payments platform is seeking an Infrastructure Engineer to establish the infrastructure foundation as part of a scaling team. In this hands-on role, you'll lead the development of AWS infra...[show_more]

[last_updated.last_updated_variable_days] • [promoted]

Infrastructure Engineer

Tamarind Bio • San Francisco, CA, United States

[job_card.full_time]

This range is provided by Tamarind Bio.Your actual pay will be based on your skills and experience — talk with your recruiter to learn more. We're looking for an Infrastructure Engineer to lead the ...[show_more]

[last_updated.last_updated_30] • [promoted]

Infrastructure Engineer

Chalk • San Francisco, CA, United States

[job_card.full_time]

Chalk is building the data platform that powers the future of machine learning applications.We tear down complexity, latency, and scale barriers that have traditionally constrained ML capabilities....[show_more]

[last_updated.last_updated_30] • [promoted]

Founding Infrastructure Engineer

Neon • San Francisco, CA, United States

[job_card.full_time]

Founding Infrastructure Engineer.Neon is a global payments and e-commerce platform designed to help game publishers earn more money and independence from app stores. We believe commerce should be op...[show_more]

[last_updated.last_updated_30] • [promoted]

Staff Infrastructure Engineer

Twelve Labs • San Francisco, CA, United States

[job_card.full_time]

Infrastructure Engineer at TwelveLabs.At TwelveLabs, we are pioneering the development of cutting-edge multimodal foundation models that have the ability to comprehend videos just like humans do.Ou...[show_more]

[last_updated.last_updated_variable_days] • [promoted]

Full-Stack Systems Engineer

Purple Unicorn Company • San Francisco, CA, United States

[job_card.full_time]

Non-negotiable : ALL ROLES ARE ON-SITE IN SAN FRANCISCO.IN ORDER TO BE CONSIDERED YOU MUST BE LOCAL OR WILLING TO MOVE.Must be authorized to work in the US • • •. At SalesPatriot, we are more than a sof...[show_more]

[last_updated.last_updated_variable_days] • [promoted]

Infrastructure Engineer

Cerebras • San Francisco, CA, United States

[job_card.full_time]

We’re a team of ex-Google engineers who built some of the largest defensive platforms on the planet —.Now, we’re striking out on our own to tackle an even bigger challenge : stopping the new wave of...[show_more]

[last_updated.last_updated_30] • [promoted]

Software Engineer, Tooling and Development Infrastructure

Hp Iq • San Francisco, CA, United States

[job_card.full_time]

Software Engineer, Tooling and Development Infrastructure.HP IQ is HP’s new AI innovation lab.Combining startup agility with HP’s global scale, we’re building intelligent technologies that redefine...[show_more]

[last_updated.last_updated_30] • [promoted]

Senior HPC & GPU Infrastructure Engineer

Sciforium • San Francisco, CA, United States

[job_card.full_time]

Sciforium is an AI infrastructure company developing next-generation multimodal AI models and a proprietary, high-efficiency serving platform. Backed by multi-million-dollar funding and direct spons...[show_more]

[last_updated.last_updated_variable_days] • [promoted]

Platform Engineer : Core Infrastructure

Slash • San Francisco, CA, United States

[job_card.full_time]

Slash is building the future of business banking, one industry at a time.We believe businesses deserve financial infrastructure tailored to how they actually operate. That's why we're creating a new...[show_more]

[last_updated.last_updated_30] • [promoted]

Infrastructure Engineer, Sandboxing

Anthropic • San Francisco, CA, United States

[job_card.full_time]

Anthropic’s mission is to create reliable, interpretable, and steerable AI systems.We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group ...[show_more]

[last_updated.last_updated_30] • [promoted]

Senior Core Infrastructure EngineerSan Francisco, CA

HighNote • San Francisco, CA, United States

[job_card.full_time]

Senior Core Infrastructure Engineer.Highnote connects merchants and businesses to the most advanced virtual payments platform, and we are just getting started. Our mission is to help merchants and b...[show_more]

[last_updated.last_updated_variable_days] • [promoted]

Senior Hardware Systems Engineer

Samsara • San Francisco, CA, United States

[job_card.full_time]

Samsara (NYSE : IOT) is the pioneer of the Connected Operations™ Cloud, which is a platform that enables organizations that depend on physical operations to harness Internet of Things (IoT) data to ...[show_more]

[last_updated.last_updated_30] • [promoted]

Sr.System Development Engineer, AGI Infrastructure

Amazon • San Francisco, CA, United States

[job_card.full_time]

The Artificial General Intelligence (AGI) team is looking for a passionate, talented, and inventive engineer to play a pivotal role in the development and maintenance of industry‑leading multi‑moda...[show_more]

[last_updated.last_updated_variable_days] • [promoted]

Hardware Development Engineer, Advanced Technology Group

KoBold Metals • San Francisco, CA, United States

[job_card.full_time]

Hardware Development Engineer, Advanced Technology Group.The mining industry has steadily become worse at finding new ore deposits, requiring > . X more capital to make discoveries compared to 30 yea...[show_more]

[last_updated.last_updated_30] • [promoted]