Talent.com
Principal Database Reliability Engineer
Principal Database Reliability EngineerBEDI Partnerships • Austin, Texas, USA
[error_messages.no_longer_accepting]
Principal Database Reliability Engineer

Principal Database Reliability Engineer

BEDI Partnerships • Austin, Texas, USA
[job_card.30_days_ago]
[job_preview.job_type]
  • [job_card.full_time]
[job_card.job_description]

Join Udemy. Help define the future of learning.

Udemy is an AI-powered skills acceleration platform built to help people and teams grow. Its personalized practical and focused on real-world impact.

Our mission is simple : to transform lives through learning. Your work helps people around the world build skills they can use whether theyre picking up something new or leveling up to stay ahead.

Over 80 million learners and 17000 businesses already learn with Udemy. If youre excited by change energized by learning and ready to have a real impact youll feel right at home.

Learn more about us on our company page.

Principal Database Reliability Engineer

About this role :

As part of Udemys Platform team the Datastore Infrastructure (DSI) team is responsible for overseeing all aspects of Databases (MySQL Aurora DynamoDB) Message Queues (RabbitMQ) Streaming (Kafka) and Caching (Redis Memcache) in our infrastructure. This includes ensuring uptime security and compliance observability performance improving developers productivity and developing future growth strategies. The team is split between EU and US regions. You will play a vital role in overseeing day-to-day activities and engineering strategies of DSI ensuring that millions of students worldwide achieve greater learning and career outcomes on Udemy. We value teamwork a good sense of humor strong ownership technological curiosity and a desire to learn.

To be successful in this role you will collaborate closely with engineering product and a diverse set of stakeholders around the world. You are not just interested in maintaining systems but also writing the software that maintains them. You strongly believe in a no-blame culture and advocate for humane on-call practices. You constantly seek opportunities for improvement and thrive in an environment where you can drive positive change.

What youll be doing :

  • Lead improvement projects for our datastores and platform teams to align with the companys long term objectives.
  • Maintain Infrastructure Uptime monitor performance and ensure infrastructure continues scaling as we grow.
  • Develop Immutable infrastructure patterns and automate Infrastructure provisioning via Code (Terraform Python Ansible etc ..)
  • Ensure adherence to PCI and ISO27001 compliance as well as SOC 2 security requirements modifying CI / CD processes when necessary and upholding policies and standards.
  • Advocate for and implement positive changes in tools and processes through healthy discussions.
  • Participate in the on-call rotation demonstrating a systematic approach to incident management.
  • Participate in day-to-day activities support requests and project-related tasks for the team.
  • Contribute to documentation maintain ticketing queues provide project support troubleshoot and offer after-hours assistance as required
  • Provide coaching and mentorship to new hires fostering their technical growth and integration into the team. Maintain close communication with team members throughout their tenure.

What youll have :

We do not expect you to have all the below but the more mix / max skills you have the easier you will onboard

  • 8-10 years of professional experience working in a Cloud Engineering team (also SRE / DBRE team) with Infrastructure responsibilities in managing large production workloads.
  • Proficiency with managing MySQL at scale (Horizontal Scaling sharding InnoDB optimizations Query Optimization HA / DR Monitoring Backups Strategy Security Automations).
  • Strong understanding in running Production Workloads in Kubernetes
  • Proficiency with tools like Terraform Ansible Git and how to work with Infrastructure as Code and automated provisioning.
  • Strong experience in Kafka cluster management topic configuration performance tuning and ensuring high availability and fault tolerance. Experience with MSK is also good.
  • Experience with Message Queues (MQ / SQS) and Caching (Redis Memcache) or similar products
  • Experience in Python.
  • Knowledge of configuration management tools monitoring systems (Datadog or similar) for database infrastructure and scaling strategies for handling increased data volumes.
  • Strong troubleshooting skills to diagnose complex database issues.
  • Hands-on experience with AWS cloud infrastructure and a grasp of security best practices.
  • Adaptability and comfort working in a fast-paced hands-on environment.
  • Nice to have :

  • Experience with any additional Programming Languages (Golang Kotlin Java)
  • Experience in implementing CDC pipelines for reliable data replication and synchronization
  • Experience with Vitess Operator running MySQL on Kubernetes.
  • Experience with Writing Kubernetes Helm Charts.
  • Experience with tools like ArgoCD / Argo Workflows or similar alternatives in various combinations.
  • Knowledge of security standards vulnerability patching TLS / SSL and related..
  • Any additional experience or familiarity with related technologies would be advantageous.
  • infrastructure. This includes ensuring uptime security and compliance observability performance improving developers productivity and developing future growth strategies. The team is split between EU and US regions. You will play a vital role in overseeing day-to-day activities and engineering strategies of DSI ensuring that millions of students worldwide achieve greater learning and career outcomes on Udemy. We value teamwork a good sense of humor strong ownership technological curiosity and a desire to learn.

    To be successful in this role you will collaborate closely with engineering product and a diverse set of stakeholders around the world. You are not just interested in maintaining systems but also writing the software that maintains them. You strongly believe in a no-blame culture and advocate for humane on-call practices. You constantly seek opportunities for improvement and thrive in an environment where you can drive positive change.

    What youll be doing :

  • Lead improvement projects for our datastores and platform teams to align with the companys long term objectives.
  • Maintain Infrastructure Uptime monitor performance and ensure infrastructure continues scaling as we grow.
  • Develop Immutable infrastructure patterns and automate Infrastructure provisioning via Code (Terraform Python Ansible etc ..)
  • Ensure adherence to PCI and ISO27001 compliance as well as SOC 2 security requirements modifying CI / CD processes when necessary and upholding policies and standards.
  • Advocate for and implement positive changes in tools and processes through healthy discussions.
  • Participate in the on-call rotation demonstrating a systematic approach to incident management.
  • Participate in day-to-day activities support requests and project-related tasks for the team.
  • Contribute to documentation maintain ticketing queues provide project support troubleshoot and offer after-hours assistance as required
  • Provide coaching and mentorship to new hires fostering their technical growth and integration into the team. Maintain close communication with team members throughout their tenure.
  • What youll have :

    We do not expect you to have all the below but the more mix / max skills you have the easier you will onboard

  • 8-10 years of professional experience working in a Cloud Engineering team (also SRE / DBRE team) with Infrastructure responsibilities in managing large production workloads.
  • Proficiency with managing MySQL at scale (Horizontal Scaling sharding InnoDB optimizations Query Optimization HA / DR Monitoring Backups Strategy Security Automations).
  • Strong understanding in running Production Workloads in Kubernetes
  • Proficiency with tools like Terraform Ansible Git and how to work with Infrastructure as Code and automated provisioning.
  • Strong experience in Kafka cluster management topic configuration performance tuning and ensuring high availability and fault tolerance. Experience with MSK is also good.
  • Experience with Message Queues (MQ / SQS) and Caching (Redis Memcache) or similar products
  • Experience in Python.
  • Knowledge of configuration management tools monitoring systems (Datadog or similar) for database infrastructure and scaling strategies for handling increased data volumes.
  • Strong troubleshooting skills to diagnose complex database issues.
  • Hands-on experience with AWS cloud infrastructure and a grasp of security best practices.
  • Adaptability and comfort working in a fast-paced hands-on environment.
  • Nice to have :

  • Experience with any additional Programming Languages (Golang Kotlin Java)
  • Experience in implementing CDC pipelines for reliable data replication and synchronization
  • Experience with Vitess Operator running MySQL on Kubernetes.
  • Experience with Writing Kubernetes Helm Charts.
  • Experience with tools like ArgoCD / Argo Workflows or similar alternatives in various combinations.
  • Knowledge of security standards vulnerability patching TLS / SSL and related..
  • Any additional experience or familiarity with related technologies would be advantageous.
  • We understand that not everyone will match each of the above qualifications. However we also realize that everyone has unique experiences that can add value to our company. Even if you think your background might not perfectly align wed love to hear from you!

    Posting Date : November 05 2025

    Application window : November 05 2025 - December 05 2025

    At Udemy we strive to be transparent around compensation. Actual compensation for this role is based on several factors including but not limited to job-related skills qualifications experience and specific work location due to differences in the cost of addition to a base salary this role is also eligible for equity.

    Hiring Compensation Range

    $184000 - $230000 USD

    Why work here

    Youll grow here.

    Learning is part of the job. Youll get full access to Udemy courses a monthly UDay to invest in yourself and a budget to spend on whatever helps you improve. Many people are diving into AI lately but what you focus on is up to you.

    AI is real here.

    We use it in the way we learn and the way we work. Youll have the space and tools to experiment apply and get better at using AI in practical ways.

    Youll own your work.

    We trust people to lead make decisions and follow through. You dont need to wait for permission or layers of approval to have an impact.

    Youll build with others.

    We collaborate openly and shape ideas together. Everyone has a voice and good thinking is welcomed from any direction.

    Youll see your impact.

    What you build helps people grow their skills change their careers or find a path forward. Youve got the experience why not use it to help others gain theirs

    Bring your curiosity. Well bring the platform and the support. Lets LEARN together.

    Our Benefits Start with U

    Our benefits start with you and were built to provide you and your family with the protection and care you need making it easy to access the right coverage when you need it most. Benefits vary by region and we encourage applicants to review our Australia Benefits India Benefits Ireland Benefits Mexico Benefits Turkiye Benefits & US Benefitspages to get an understanding of some of the benefits we offer. For details on region-specific benefits please refer to the information provided during the hiring process.

    Benefits outlined are provided as a general overview and may vary depending on the location role and employment classification. All benefits are subject to change at the discretion of the organization and in accordance with applicable laws and policies.

    At Udemy we value diversity and inclusion and consider qualified applicants without regard to race color religion sex national origin ancestry age genetic information sexual orientation gender identity marital or family status veteran status medical condition or disability. We understand that not everyone will match each of the qualifications. However we also realize that everyone has unique experiences that can add value to our company. Even if you think your background might not perfectly align wed love to hear from you!

    Information regarding data privacy is available within the Udemy Careers Privacy Notice.

    Required Experience :

    Staff IC

    Key Skills

    Kubernetes,FMEA,Continuous Improvement,Elasticsearch,Go,Root cause Analysis,Maximo,CMMS,Maintenance,Mechanical Engineering,Manufacturing,Troubleshooting

    Employment Type : Full Time

    Experience : years

    Vacancy : 1

    Monthly Salary Salary : 184000 - 230000

    [job_alerts.create_a_job]

    Database Reliability Engineer • Austin, Texas, USA

    [internal_linking.similar_jobs]
    Database Architect 3

    Database Architect 3

    California Creative Solutions Inc. • Austin, Texas, USA
    [job_card.full_time]
    Designs and builds relational databases.Develops strategies for data acquisitions archive recovery and implementation of a database. Cleans and maintains the database by removing and deleting old da...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Database Architect

    Database Architect

    innovitusa • Austin, Texas, USA
    [job_card.full_time]
    Designs and builds relational databases.Develops strategies for data acquisitions archive recovery and implementation of a database. Cleans and maintains the database by removing and deleting old da...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Site Reliability Engineer, Sr. Consultant level

    Site Reliability Engineer, Sr. Consultant level

    Visa • Austin, TX, United States
    [job_card.full_time]
    Visa is a world leader in payments and technology, with over 259 billion payments transactions flowing safely between consumers, merchants, financial institutions, and government entities in more t...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Database Architect

    Database Architect

    Software People, Inc. • Austin, TX, United States
    [job_card.full_time] +2
    [filters_job_card.quick_apply]
    Location : Austin, Texas Duration : 6+ months Need 3-page resumes.Must have current LinkedIn profile.Please complete the e...[show_more]
    [last_updated.last_updated_variable_days]
    Sr Database Architect

    Sr Database Architect

    MY HR • Austin, Texas, USA
    [job_card.full_time] +1
    Designs and builds relational databases.Develops strategies for data acquisitions archive recovery and implementation of a database. Cleans and maintains the database by removing and deleting old da...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Site Reliability Engineer

    Site Reliability Engineer

    Paradromics, Inc. • Austin, TX, US
    [job_card.full_time]
    Brain-related illness is one of the last great frontiers in medicine, not because the brain is unknowable, but because it has been inaccessible. Paradromics is building a brain-computer interface (B...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Database Architect 3 (Hybrid)

    Database Architect 3 (Hybrid)

    Serigor Inc. • Austin, Texas, USA
    [job_card.full_time]
    The Senior Data Architect & Analytics Lead will play a pivotal role in advancing the data infrastructure and analytics initiatives for the team. This position is designed for a seasoned professional...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    The Recruiting Guy • Austin, TX, US
    [job_card.full_time]
    If this role is still posted then we are still recruiting and needing applications.Senior Cloud Infrastructure Engineer.Must live within commuting distance of San Francisco or be willing to relocat...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Staff Site Reliability Engineer

    Staff Site Reliability Engineer

    Visa • Austin, TX, United States
    [job_card.full_time]
    Visa is a world leader in payments and technology, with over 259 billion payments transactions flowing safely between consumers, merchants, financial institutions, and government entities in more t...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Sr. Site Reliability Engineer - Talent Day

    Sr. Site Reliability Engineer - Talent Day

    Visa • Austin, TX, United States
    [job_card.full_time]
    Visa is a world leader in payments and technology, with over 259 billion payments transactions flowing safely between consumers, merchants, financial institutions, and government entities in more t...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Data Engineer (Remote USA ) G8

    Data Engineer (Remote USA ) G8

    Cisco Systems, Inc. • Austin, TX, United States
    [filters.remote]
    [job_card.full_time]
    The application window is expected to close on : 25 / 12 / 25.Job posting may be removed earlier if the position is filled or if a sufficient number of applications are received.Location preference : Aus...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Senior Data Engineer

    Senior Data Engineer

    Social Solutions Global • Austin, TX, United States
    [job_card.full_time]
    US Salary Range : $100,000 - $140,000.Bonterra exists to propel every doer of good to their peak impact.We measure that impact against our vision to increase the giving rate as a percentage of GDP f...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Software Database Engineer (Level 3)

    Software Database Engineer (Level 3)

    Hireblazer • Austin, Texas, USA
    [job_card.full_time]
    Role : Software Database Engineer (Level 3).The working position is Hybrid - On Site and Telework.The client requires the services of a software engineer Level 3 hereafter referred to as Worker who ...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Database Architect 3 - Hybrid

    Database Architect 3 - Hybrid

    Prudent Technologies and Consulting, Inc. • Austin, TX, United States
    [job_card.full_time]
    Required / Preferred Experience 10 Required Experience in data architecture, data modeling, and data warehousing on-prem and cloud technologies. Required Experience with business intelligence and big ...[show_more]
    [last_updated.last_updated_variable_hours] • [promoted] • [new]
    Database Architect (W2)

    Database Architect (W2)

    Snowrelic Inc • Austin, TX, United States
    [job_card.full_time]
    [filters_job_card.quick_apply]
    Role : Database Architect Location : Austin, TX (Hybrid) The Senior Data Architect & Analytics Lead will play...[show_more]
    [last_updated.last_updated_variable_hours] • [new]
    Database Architect

    Database Architect

    Hirekeyz Inc • Austin, TX, United States
    [job_card.temporary]
    [filters_job_card.quick_apply]
    Role : Database Architect Location : Austin, TX (Onsite with Hybrid) [Need candidate from LOCAL TO THE AUSTIN, TX AREA ONLY (Within 50-mile radius)] <...[show_more]
    [last_updated.last_updated_variable_days]
    Database Architect 3

    Database Architect 3

    GDR Defense • Austin, Texas, USA
    [job_card.full_time]
    Position will be 2 days remote with 3 days (Mon & Wed & Thurs) required to be onsite at the location listed above.Program will only allow candidates who are LOCAL TO THE AUSTIN AREA ONLY (Within 50...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Senior Database Engineer

    Senior Database Engineer

    Kanak Elite Services Inc • Austin, Texas, USA
    [job_card.full_time]
    Title : Cassandra Database Engineer.Location : Austin Texas 78759 - Hybrid.The Cassandra Database Engineer is an expert across NOSQL database technologies but specifically a specialist on Cassandra d...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]