Talent.com
OpenAI
Software Engineer, Collective CommunicationOpenAI • San Francisco
Software Engineer, Collective Communication

Software Engineer, Collective Communication

OpenAI • San Francisco
30+ days ago
Job type
  • Full-time
Job description

About the Team

The Workload Networking team is responsible for the collective communication stack used in our largest training jobs. Using a combination of C++ and CUDA we work on novel collective communication techniques that enable efficient training of our flagship models on our largest custom built supercomputers.

The models we train are key ingredients to the AI research progress at OpenAI and the field as a whole, and we continually incorporate learnings from our entire research org into our training platform.

About the Role

As a Software Engineer, Networking you will design and implement custom networking collectives that are tightly integrated into our training stack.

We’re looking for people who have a background in low level performance critical software. Experience with collective communication is a bonus.

This role is based in San Francisco, CA. We use a hybrid work model of 3 days in the office per week and offer relocation assistance to new employees.

In this role, you will:

  • Collaborate closely with ML researchers to design and implement efficient collective operations in C++ and CUDA.

  • Ensure that our largest training jobs take full advantage of the different network transports used in our supercomputers.

  • Work on simulations to inform our future supercomputer network designs.

You might thrive in this role if you:

  • Have written distributed algorithms using RDMA in the past.

  • Are comfortable writing low level performance sensitive CPU and/or GPU code.

  • Are familiar with network simulation techniques.

Create a job alert for this search

Software Engineer, Collective Communication • San Francisco

Similar jobs

Software Engineer, Collective Communication

OpenAISan Francisco, CA, US
Full-time

The Workload Networking team is responsible for the collective communication stack used in our largest training jobs.Using a combination of C++ and CUDA we work on novel collective communication te... Show more

 • Promoted

Software Engineer - DevOps and MLOps

Maven Robotics, Inc.San Francisco, CA, US
Full-time

We are looking to recruit an exceptional.Software Engineer - Software Development and Machine Learning Operations.Design, implement, and manage CI/CD pipelines to facilitate seamless code integrati... Show more

 • Promoted

Information Technology Professional

US NavyLadera, CA, US
Full-time

Information Technology Professional (IT/CTN/IS).Information Systems Technicians, Cryptologic Technician Networks, and Intelligence Specialists keep the Fleet connected, informed, and secure by oper... Show more

 • Promoted

Remote Software Engineer - AI Features & Scale

GlossGeniusSan Francisco, CA, US
Remote
Full-time

A leading software company is seeking Software Engineers of all levels to enhance a platform utilized by over 90,000 small businesses.This remote role focuses on developing new AI features and impr... Show more

 • Promoted

Software Engineer, Networking

AnzaSan Francisco, CA, US
Full-time

Software Engineer, Networking - Anza.At Anza, we're at the forefront of blockchain technology, developing the Agave client to enhance the Solana ecosystem — a blockchain designed for rapid growth w... Show more

 • Promoted

Staff Software Engineer

Wispr FlowSan Francisco, CA, US
Full-time

Wispr Flow is making it as effortless to interact with your devices as talking to a close friend.Voice is the most natural, powerful way to communicate — and we’re building the interfaces to make t... Show more

 • Promoted

Software Engineer, Systems

BraintrustSan Francisco, CA, US
Full-time

Braintrust is building the modern platform for evaluating and deploying AI systems.Our mission is to help enterprises build trust in their AI by making it easy to test, monitor, and improve models ... Show more

 • Promoted

Software Engineer, AI Agents

HightouchSan Francisco, CA, US
Full-time

Hightouch is the modern AI platform for marketing and growth teams.Our AI agents reimagine marketing workflows, allowing marketers to create content, plan campaigns, and execute strategies with tra... Show more

 • Promoted

Software Engineer, Growth

LassieSan Francisco, CA, US
Full-time

Lassie is building the AI for every health practice.One million health practices spend $100B+ a year on admin staff, and still can't find enough people.Since COVID, offices are chronically short-st... Show more

 • Promoted

Staff Software Engineer, Cloud Security

LyftSan Francisco, CA, US
Full-time

At Lyft, our purpose is to serve and connect.We aim to achieve this by cultivating a work environment where all team members belong and have the opportunity to thrive.Lyft connects people to transp... Show more

 • Promoted

Hybrid ML Infrastructure Engineer for Computer Vision & AI

The Mice Groups, Inc.Redwood City, CA, US
Full-time

Machine Learning Infrastructure Engineer – Computer Vision & AI.Note: The original description did not include explicit responsibilities.Please provide a responsibilities section to complete the jo... Show more

 • Promoted

Founding Software Engineer — Build Impactful Healthcare Tech

StealthSan Francisco, CA, US
Full-time

An innovative company is seeking a passionate entry-level engineer to join their mission-driven team.In this role, you'll contribute to the development of a cutting-edge mobile app that transforms ... Show more

 • Promoted

Staff Software Engineer: GraphQL Platform

TechBrainsSan Francisco, CA, US
Full-time

Staff Software Engineer: GraphQL Platform.Gusto is a modern, online people platform that helps small businesses take care of their teams.On top of full-service payroll, Gusto offers health insuranc... Show more

 • Promoted

GenAI Security Engineer — Scale AI Protections & Frameworks

Isc2 Eastbay ChapterMenlo Park, CA, US
Full-time

A leading technology company is seeking a Security Engineer to join its GenAI Product Security team in Menlo Park, California.This role involves securing AI products, designing security solutions, ... Show more

 • Promoted

Remote Software Engineer, Blockchain Consensus

Yeah! GlobalSan Francisco, CA, United States
Full-time

A global technology company is seeking a Software Engineer to improve the consensus mechanisms of the Solana network.Responsibilities include designing algorithms, enhancing scalability, and identi... Show more

 • Promoted

Software Engineer (DevOps)

HelixSan Francisco, CA, US
Full-time

Workhelix is a tech-enabled services company with a single goal: helping organizations get the most out of their AI investments.Our people and software combine to give leaders answers to the three ... Show more

 • Promoted

Software Engineer, Financial Platform

ChimeSan Francisco, CA, US
Full-time

We are looking for a Software Engineer to help build our new financial platform that is highly reliable, scalable, and will serve as the backbone for Chime.This platform will power the most importa... Show more

 • Promoted

Staff Software Engineer - Lead Scalable Public Safety Platform

Peregrine TechnologiesSan Francisco, CA, US
Full-time

A technology firm in San Francisco is seeking a Staff Software Engineer to lead projects and foster team culture.This role emphasizes both management and technical leadership.Ideal candidates will ... Show more

 • Promoted

Software Engineer - Deepgram for Restaurants

DeepgramSan Francisco, CA, US
Full-time

Deepgram is the leading voice AI platform for developers building speech-to-text (STT), text-to-speech (TTS) and full speech-to-speech (STS) offerings.Deepgram’s voice-native foundational models – ... Show more

 • Promoted

Software Engineer (DevOps)

Orova AI, IncSan Francisco, CA, US
Full-time

Orova AI Base Has Been Released, Most Advanced AI Agent Creation Platform To Date!.We are seeking a Software Engineer (DevOps) to design and maintain our cloud infrastructure, automate deployments,... Show more