Talent.com
Principal Site Reliability Engineer
Principal Site Reliability EngineerEarly Warning Services LLC • San Francisco, CA, United States
Principal Site Reliability Engineer

Principal Site Reliability Engineer

Early Warning Services LLC • San Francisco, CA, United States
[job_card.variable_days_ago]
[job_preview.job_type]
  • [job_card.full_time]
[job_card.job_description]

Positions located in Scottsdale, San Francisco, Chicago, or New York follow a hybrid work model to allow for a more collaborative working environment.Candidates responding to this posting must independently possess the eligibility to work in the United States, for any employer, at the date of hire. This position is ineligible for employment Visa sponsorship.

  • Overall Purpose
  • The Principal Site Reliability Engineer partners with development teams by designing availability and resiliency patterns in applications and infrastructure.
  • Essential Functions :
  • Design and Implement software and tools to improve the performance - availability, scalability, and latency, while delivering end products to customer with the highest efficiency and meeting all security standards.
  • Supports the company’s commitment to risk management and protecting the integrity and confidentiality of systems and data.
  • Build automation and tooling around application management, such as deployments, configuration changes and disaster recovery scenarios.
  • Design, Implement and evangelize Observability and monitoring systems to proactively detect problems and identify cause.
  • Evaluate capacity of the application on a continuous basis to provide stats to the Product / Business teams and recommend an efficient path to scale for future needs.
  • Identify performance bottlenecks and work with cross-functional teams to troubleshoot and resolve issues.
  • Serve as a technical liaison for the application and provide documents and runbooks to Level 1 and Level 2 teams.
  • Participate in 24 X 7 on-call rotation.
  • Be a champion of excellent processes; take the initiative in developing repeatable patterns and standard, re-usable work across teams.
  • Work directly with application development teams to provide feedback and technical requirements to the software development lifecycle, implementing best-practice microservice design patterns and other modern software development approaches.
  • Understand and support the adoption of best-practice microservice design patterns and other modern software reliability approaches and techniques.
  • Be a thought leader : a senior point of expertise on site reliability engineering issues, industry trends and developing technologies. Be a role model to others on the team. Coach and mentor team members.
  • Minimum Qualifications
  • Education and experience typically obtained through completion of a Bachelor’s Degree in Business and / or Computer Science or related field.
  • 12+ years of related experience managing large complex projects in a technical or software development environment inclusive of post-graduate degree
  • Proven ability to lead a team through high priority Incidents and improve the RCA proces
  • Excellent troubleshooting skills and proven experience resolving technical issues in complex environments
  • Hands-on experience in designing and developing using the one or more of the following technologies - Python, Go, Java - Docker - Experience in Microservices Architecture. - Messaging frameworks such as Kafka, SQS or JMS - Database Technologies like Oracle, Dynamo DB, Aurora etc.. - Caching layers such as Redis and memcached
  • Strong understanding of Linux administration
  • Experience with CI / CD pipeline implementation including GIT, Chef, Maven, Jenkins etc
  • Strong understanding of networking fundamentals
  • Experience in leading cross-functional teams to create technical solutions.
  • Proven track record designing and building complex end-to-end systems (full stack developer)
  • Background and drug screen
  • Preferred Qualifications
  • Good programming skills in one or more of the following languages : Java, ruby, python, JavaScript and GO
  • Hands-on experience in supporting applications in a 24X7 customer-facing production environment.
  • Working knowledge of AWS, Docker, Kubernetes, SwarmThe base pay scale for this position in : Phoenix, AZ / Chicago, IL in USD per year is : $172,000 - $215,000. New York, NY / San Francisco, CA in USD per year is : $206,000 - $258,000. Additionally, candidates are eligible for a discretionary incentive plan and benefits.
  • Physical Requirements
  • Employee must be able to perform essential functions and physical requirements of position with or without reasonable accommodation.Candidates responding to this posting must independently possess the eligibility to work in the United States at the date of hire.Some of the Ways We Prioritize Your Health and Happiness
  • Healthcare Coverage –Competitive medical (PPO / HDHP), dental, and vision plans as well as company contributions to your Health Savings Account (HSA) or pre-tax savings through flexible spending accounts (FSA) for commuting, health & dependent care expenses.
  • 401(k) Retirement Plan –Featuring a 100% Company Safe Harbor Match on your first 6% deferral immediately upon eligibility.
  • Paid Time Off – Unlimited Time Off for Exempt (salaried) employees, as well as generous PTO for Non-Exempt (hourly) employees, plus 11 paid company holidays and a paid volunteer day.
  • 12 weeks of Paid Parental Leave
  • Maven Family Planning – provides support through your Parenting journey including egg freezing, fertility, adoption, surrogacy, pregnancy, postpartum, early pediatrics, and returning to work. And SO much more! We continue to enhance our program, so be sure to for the latest. Our team can share more during the interview process!Pursuant to the San Francisco Fair Chance Ordinance, we will consider for employment qualified applicants with arrest and conviction records.
  • CURRENT EMPLOYEES : Apply for open positions via Job Hub in your Workday Account.
  • for an assistance request.E-Verify
  • ## Privacy Notice
  • Effective :
  • May 2, 2025
  • This privacy notice is intended to inform California residents of the personal information we collect, how it’s used and disclosed, and the rights you have in regard to such information.Click below for the full privacy notice

#J-18808-Ljbffr

[job_alerts.create_a_job]

Site Reliability Engineer • San Francisco, CA, United States

[internal_linking.related_jobs]
Principal Site Reliability Engineer

Principal Site Reliability Engineer

Harrison Clarke • San Francisco, CA, US
[job_card.full_time]
Harrison Clarke are working with several high profile companies that are seeking a Principal Site Reliability Engineer (SRE) , to lead the design, implementation, and scaling of the infrastructur...[show_more]
[last_updated.last_updated_30] • [promoted]
Senior Technology Site Reliability Engineer

Senior Technology Site Reliability Engineer

Cooley LLP • San Francisco, CA, United States
[job_card.full_time]
Senior Technology Site Reliability Engineer.Cooley is seeking a Senior Site Reliability Engineer to join the.Infrastructure & Development Operations. The Senior Technology Site Reliability Engineer(...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Site Reliability Engineer

Site Reliability Engineer

gamma.app • San Francisco, CA, United States
[job_card.full_time]
We're building the creative layer for modern communication.Every month, over a billion people make presentations — but the tools they use to make them haven't evolved in decades.We're changing that...[show_more]
[last_updated.last_updated_variable_hours] • [promoted] • [new]
Site Reliability Engineer

Site Reliability Engineer

Together • San Francisco, CA, US
[job_card.full_time]
As a Site Reliability Engineer (SRE) at Together, you are responsible for keeping all user-facing services and production systems running smoothly. You are a blend of a pragmatic operator and a soft...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Site Reliability Engineer

Site Reliability Engineer

Together AI • San Francisco, CA, United States
[job_card.full_time]
As a Site Reliability Engineer (SRE) at Together, you are responsible for keeping all user-facing services and production systems running smoothly. You are a blend of a pragmatic operator and a soft...[show_more]
[last_updated.last_updated_30] • [promoted]
Site Reliability Engineer

Site Reliability Engineer

Alchemy • San Francisco, CA, United States
[job_card.full_time]
Our mission is to bring web3 to a billion people, by providing builders with the tools they need to build exceptional onchain products. Alchemy is the only complete developer platform that offers th...[show_more]
[last_updated.last_updated_30] • [promoted]
Staff / Principal Site Reliability Engineer

Staff / Principal Site Reliability Engineer

The Resume Database • Redwood City, CA, United States
[job_card.full_time]
Staff / Principal Site Reliability Engineer.Staff / Principal Site Reliability Engineer.You’ll architect scalable solutions, navigate complex technical challenges independently, and deliver results und...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Site Reliability Engineer

Site Reliability Engineer

Clay • San Francisco, California, United States
[job_card.full_time]
Site Reliability Engineer Join to apply for the Site Reliability Engineer role at Clay.About Clay Clay is a creative tool for growth. Our mission is to help businesses grow — without huge investment...[show_more]
[last_updated.last_updated_variable_hours] • [promoted] • [new]
Senior Site Reliability Engineer

Senior Site Reliability Engineer

Alembic Technologies • San Francisco, CA, United States
[job_card.full_time]
Senior Site Reliability Engineer.This range is provided by Alembic Technologies.Your actual pay will be based on your skills and experience — talk with your recruiter to learn more.We’re looking fo...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Site Reliability Engineer

Site Reliability Engineer

Canonical • San Francisco, CA, United States
[job_card.full_time]
Canonical is a leading provider of open source software and operating systems to the global enterprise and technology markets. Our platform, Ubuntu, is widely used in breakthrough enterprise initiat...[show_more]
[last_updated.last_updated_30] • [promoted]
Principal Site Reliability Engineer

Principal Site Reliability Engineer

Early Warning® • San Francisco, CA, United States
[job_card.full_time]
At Early Warning, we’ve powered and protected the U.Zelle®, Paze℠, and so much more.As a trusted name in payments, we partner with thousands of institutions to increase access to financial services...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Site Reliability Engineer

Site Reliability Engineer

Speak • San Francisco, CA, United States
[job_card.full_time]
Our mission is to reinvent the way people learn, starting with language.Learning a language can change a life by opening doors to new cultures, careers, and communities. Two billion people around th...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Site Reliability Engineer I

Site Reliability Engineer I

Prosper • San Francisco, CA, United States
[job_card.full_time]
As a Site Reliability Engineer I at Prosper, you will play a crucial role in enhancing the reliability, scalability, and maintainability of our technology platform. This entry-level position is desi...[show_more]
[last_updated.last_updated_30] • [promoted]
Senior Site Reliability Engineer

Senior Site Reliability Engineer

Hive • San Francisco, CA, United States
[job_card.full_time]
Hive is the leading provider of cloud-based AI solutions to understand, search, and generate content, and is trusted by hundreds of the world's largest and most innovative organizations.The company...[show_more]
[last_updated.last_updated_30] • [promoted]
Infrastructure Site Reliability Engineer (Local only)

Infrastructure Site Reliability Engineer (Local only)

Maxonic Inc. • San Francisco, CA, United States
[job_card.full_time]
Maxonic maintains a close and long-term relationship with our direct client.In support of their needs, we are looking for an. Infrastructure Site Reliability Engineer.Job Title : Infrastructure Site ...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Site Reliability Engineer

Site Reliability Engineer

ConductorOne • San Francisco, California, United States
[job_card.full_time]
We’re a hyper-creative, fast-moving team building the future of identity security.If transforming an industry and securing the world’s top companies excites you, we’d love to have you along for the...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Staff / Principal Site Reliability Engineer

Staff / Principal Site Reliability Engineer

Veza • San Francisco, CA, US
[job_card.full_time]
Staff / Principal Site Reliability Engineer We are seeking an exceptional Staff / Principal Site Reliability Engineer to lead critical infrastructure initiatives and drive Innovation across our organiz...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Site Reliability Engineer

Site Reliability Engineer

Cypress HCM • San Francisco, CA, United States
[job_card.full_time]
This range is provided by Cypress HCM.Your actual pay will be based on your skills and experience — talk with your recruiter to learn more. As a Site Reliability Engineer (Contractor), you will be a...[show_more]
[last_updated.last_updated_variable_days] • [promoted]