Talent.com
Mid-Level Software Engineer, Reliability
Mid-Level Software Engineer, ReliabilityJobright.ai • San Francisco, CA, US
[error_messages.no_longer_accepting]
Mid-Level Software Engineer, Reliability

Mid-Level Software Engineer, Reliability

Jobright.ai • San Francisco, CA, US
[job_card.30_days_ago]
[job_preview.job_type]
  • [job_card.full_time]
[job_card.job_description]

Mid-Level Software Engineer, Reliability

Join to apply for the

Mid-Level Software Engineer, Reliability

role at

Jobright.ai

Mid-Level Software Engineer, Reliability

2 days ago Be among the first 25 applicants

Join to apply for the

Mid-Level Software Engineer, Reliability

role at

Jobright.ai

Get AI-powered advice on this job and more exclusive features.

Jobright is an AI-powered career platform that helps job seekers discover the top opportunities in the US. We are NOT a staffing agency. Jobright does not hire directly for these positions. We connect you with verified openings from employers you can trust.

Job Summary

OpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity. They are seeking experienced engineers to ensure their systems scale while maintaining performance and reliability. The role involves designing solutions for infrastructure scalability, collaborating with development teams, implementing monitoring systems, and participating in an on-call rotation to ensure system availability.

Responsibilities

  • Design and implement solutions to ensure the scalability of our infrastructure to meet rapidly increasing demands.
  • Collaborate with development teams to make the systems they design and operate more reliable.
  • Implement and manage monitoring systems to proactively identify issues and anomalies in our production environment.
  • Develop and maintain service level objectives (SLOs) and service level indicators (SLIs) to measure and ensure system reliability.
  • Implement fault-tolerant and resilient design patterns to minimize service disruptions.
  • Build and maintain automation tools to streamline repetitive tasks and improve system reliability.
  • Partner with researchers, engineers, product managers, and designers to bring new features and research capabilities to the world.
  • Participate in an on-call rotation to respond to critical incidents and ensure 24 / 7 system availability.

Qualifications

Required

  • Bachelor's degree in Computer Science, Information Technology, or a related field (or equivalent work experience).
  • Proven experience as an reliability engineer or a similar role in a fast-paced, rapidly scaling company.
  • Strong proficiency in cloud infrastructure.
  • Proficiency in programming / scripting languages.
  • Experience with containerization technologies and container orchestration platforms like Kubernetes.
  • Knowledge of IaC tools such as Terraform or CloudFormation.
  • Excellent problem-solving and troubleshooting skills.
  • Strong communication and collaboration skills.
  • Experience with observability tools such as DataDog, Prometheus, Grafana, Splunk and ELK stack.
  • Experience with microservices architecture and service mesh technologies.
  • Knowledge of security best practices in cloud environments.
  • Company

    OpenAI creates artificial intelligence technologies to assist with tasks and provide support for human activities. Founded in 2015, the company is headquartered in San Francisco, California, USA, with a team of 201-500 employees. The company is currently Growth Stage. OpenAI has a track record of offering H1B sponsorships.

    Seniority level

    Seniority level Mid-Senior level

    Employment type

    Employment type Full-time

    Job function

    Industries Software Development

    Referrals increase your chances of interviewing at Jobright.ai by 2x

    Inferred from the description for this job

    Medical insurance

    Vision insurance

    401(k)

    Get notified when a new job is posted.

    Sign in to set job alerts for "Software Engineer" roles.

    San Francisco, CA $160,000.00-$180,000.00 2 weeks ago

    San Francisco, CA $130,000.00-$238,000.00 2 weeks ago

    San Francisco, CA $150,000.00-$250,000.00 2 weeks ago

    Full-Stack Software Engineer (Jr / Mid level)

    San Francisco, CA $120,000.00-$180,000.00 1 month ago

    San Francisco, CA $150,000.00-$230,000.00 3 months ago

    San Francisco, CA $57.00-$61.00 4 days ago

    San Francisco, CA $150,000.00-$283,000.00 2 weeks ago

    San Francisco, CA $57.00-$61.00 4 days ago

    Software Development Engineer I - Frontend & Mobile

    San Francisco, CA $99,500.00-$200,000.00 2 weeks ago

    San Francisco, CA $150,000.00-$176,000.00 3 months ago

    San Francisco, CA $120,000.00-$190,000.00 9 months ago

    Software Engineer, AI Intern (Summer 2026)

    San Francisco, CA $57.00-$61.00 4 days ago

    San Francisco, CA $130,000.00-$140,000.00 2 weeks ago

    Software Engineer, AI Intern (Winter 2026)

    San Francisco, CA $57.00-$61.00 4 days ago

    San Francisco, CA $125,000.00-$175,000.00 2 months ago

    San Francisco, CA $163,200.00-$223,200.00 2 weeks ago

    Software Engineer, Frontend (All Levels)

    San Francisco, CA $99,500.00-$200,000.00 2 weeks ago

    San Francisco, CA $165,000.00-$165,000.00 2 years ago

    San Francisco, CA $120,000.00-$200,000.00 2 years ago

    San Francisco, CA $140,000.00-$280,000.00 8 months ago

    Alameda, CA $130,000.00-$160,000.00 2 months ago

    We're unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.

    J-18808-Ljbffr

    [job_alerts.create_a_job]

    Reliability Engineer • San Francisco, CA, US

    [internal_linking.similar_jobs]
    Senior Technology Site Reliability Engineer

    Senior Technology Site Reliability Engineer

    Cooley LLP • San Francisco, CA, United States
    [job_card.full_time]
    Senior Technology Site Reliability Engineer.Cooley is seeking a Senior Site Reliability Engineer to join the.Infrastructure & Development Operations. The Senior Technology Site Reliability Engineer(...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Software Engineer - Growth

    Software Engineer - Growth

    Bitgo • San Francisco, California, United States
    [job_card.full_time]
    BitGo is the leading infrastructure provider of digital asset solutions, delivering custody, wallets, staking, trading, financing, and settlement services from regulated cold storage.Since our foun...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Software Engineer, Mid-Level

    Software Engineer, Mid-Level

    Jobright.ai • Menlo Park, CA, United States
    [job_card.full_time]
    Be among the first 25 applicants.Get AI-powered advice on this job and more exclusive features.Jobright is an AI-powered career platform that helps job seekers discover the top opportunities in the...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Software Engineer (Mid-Level)

    Software Engineer (Mid-Level)

    Chestnutfi • San Francisco, CA, United States
    [job_card.full_time]
    Chestnut is building the first AI-native operating system for insurance distribution by transforming how the $1T+ insurance industry allocates its largest spend : sales and distribution.Backed by a1...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Software Engineer, Site Reliability

    Software Engineer, Site Reliability

    Fireworks AI • Redwood City, CA, United States
    [job_card.full_time]
    Get AI-powered advice on this job and more exclusive features.Here at Fireworks, we're building the future of generative AI infrastructure. Fireworks offers the generative AI platform with the highe...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Senior Software Engineer - Backend (Modeling)

    Senior Software Engineer - Backend (Modeling)

    Windfall • San Francisco, California, United States
    [job_card.full_time]
    As a Senior Backend Engineer on our Modeling team at Windfall, you will be the architect and builder of the core infrastructure that powers our machine learning and AI initiatives.You will work in ...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Software Engineer, IoT Reliability

    Software Engineer, IoT Reliability

    AirGarage • San Francisco, CA, United States
    [job_card.full_time]
    AirGarage is seeking a Software Engineer to own the reliability, health, and observability of our nationwide IoT device fleet. You will work with embedded systems, backend infrastructure, and site r...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Lead Software Engineer- Middleware Reliability Engineering

    Lead Software Engineer- Middleware Reliability Engineering

    Visa • Foster City, CA, US
    [job_card.full_time]
    Visa is a world leader in payments and technology, with over 259 billion payments transactions flowing safely between consumers, merchants, financial institutions, and government entities in more t...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Senior Software Engineer - ML Infrastructure

    Senior Software Engineer - ML Infrastructure

    Plaid • San Francisco, CA, US
    [job_card.full_time]
    Plaid is evolving into an AI-first company, where data and machine learning are the key enablers of smarter, more secure insight products built on top of Plaid’s vast financial data network.T...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Site Reliability Engineering

    Site Reliability Engineering

    Forhyre • San Francisco, CA, US
    [job_card.full_time]
    Forhyre is looking for engineers who can bring unique perspectives and innovative ideas to all areas of development and are interested in continuing to improve our platform through the ever-changin...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Senior Site Reliability Engineer - Platform

    Senior Site Reliability Engineer - Platform

    Quizlet • San Francisco, CA, US
    [job_card.full_time]
    At Quizlet, our mission is to help every learner achieve their outcomes in the most effective and delightful way.Our $1B+ learning platform serves tens of millions of students every month, in...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Software Engineer, Machine Learning Infrastructure

    Software Engineer, Machine Learning Infrastructure

    Datologyai • Redwood City, California, United States
    [job_card.full_time]
    Companies want to train their own large models on their own data.The current industry standard is to train on a random sample of your data, which is inefficient at best and actively harmful to mode...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Software Engineer (Site Reliability Engineer)

    Software Engineer (Site Reliability Engineer)

    Anyscale • San Francisco, CA, United States
    [job_card.full_time]
    Software Engineer (Site Reliability Engineer).Software Engineer (Site Reliability Engineer).At Anyscale, we're on a mission to democratize distributed computing and make it accessible to software d...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Reliability Engineer

    Reliability Engineer

    Robust.ai • San Carlos, CA, US
    [job_card.full_time]
    Robust AI is a fast-growing, early-stage startup founded in 2019 by an unsurpassed team of veterans in robotics, AI and business. We are a collaborative group with a wide range of backgrounds and pe...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Software Engineer, Reliability

    Software Engineer, Reliability

    OpenAI • San Francisco, CA, United States
    [job_card.full_time]
    Join the engineering teams that bring OpenAI’s ideas safely to the world!!.The Applied Engineering team works across research, engineering, product, and design to bring OpenAI’s technology to consu...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Software Engineer - Reliability

    Software Engineer - Reliability

    xAI • San Francisco, CA, United States
    [job_card.full_time]
    AI’s mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on engineering excelle...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Software Engineer, Site Reliability Engineering

    Software Engineer, Site Reliability Engineering

    WisdomAI • San Mateo, CA, US
    [job_card.full_time]
    WisdomAI has the mission to provide access and insights from data to everyone.We believe in the power of data to drive better decisions and we believe with Generative AI, there is an opportunity to...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Site Reliability Engineer I

    Site Reliability Engineer I

    Prosper • San Francisco, CA, US
    [job_card.full_time]
    As a Site Reliability Engineer I at Prosper, you will play a crucial role in enhancing the reliability, scalability, and maintainability of our technology platform. This entry-level position is desi...[show_more]
    [last_updated.last_updated_30] • [promoted]