Talent.com
Systems Reliability Engineer (SRE), Edge
Systems Reliability Engineer (SRE), EdgeCloudflare, Inc. • San Francisco, CA, United States
Systems Reliability Engineer (SRE), Edge

Systems Reliability Engineer (SRE), Edge

Cloudflare, Inc. • San Francisco, CA, United States
[job_card.variable_days_ago]
[job_preview.job_type]
  • [job_card.full_time]
[job_card.job_description]

About Us

At Cloudflare, we are on a mission to help build a better Internet. Today the company runs one of the world's largest networks that powers millions of websites and other Internet properties for customers ranging from individual bloggers to SMBs to Fortune 500 companies. Cloudflare protects and accelerates any Internet application online without adding hardware, installing software, or changing a line of code. Internet properties powered by Cloudflare all have web traffic routed through its intelligent global network, which gets smarter with every request. As a result, they see significant improvement in performance and a decrease in spam and other attacks. Cloudflare was named to Entrepreneur Magazine's Top Company Cultures list and ranked among the World's Most Innovative Companies by Fast Company.

We realize people do not fit into neat boxes. We are looking for curious and empathetic individuals who are committed to developing themselves and learning new skills, and we are ready to help you do that. We cannot complete our mission without building a diverse and inclusive team. We hire the best people based on an evaluation of their potential and support them throughout their time at Cloudflare. Come join us!

Available Locations : Austin

About the Role

We are looking for talented Systems Reliability Engineers to build and operate our Edge platform running in more than 320 cities in over 120 countries. Our SREs come from diverse technical backgrounds and have built up their knowledge working in different environments, but common factors across all of our reliability-focused engineers include a passion for automation, scalability, and operational excellence. We support our services in a "follow the sun" model with offices in East Asia, Europe and North America.

This is a superb opportunity to join a high-performing team and scale our high-growth network as Cloudflare's business grows. We live at the boundary between systems, network, and software, and love improving the glue that holds them together. Working with us, you will build tools to constantly improve service availability, performance, and operational velocity. You will nurture a passion for an "automate everything" approach that makes systems failure resistant and ready to scale.

SREs focus on the immediate state and functionality of the Cloudflare platform around the world, leveraging an array of monitoring, alerting and diagnostics tools while developing and enhancing the Cloudflare platform and its capabilities. We own a wide portfolio of applications and services, running a tight feedback loop of developer and operator patterns. The ideal SRE candidate has a passionate curiosity about how the Internet fundamentally works and has a strong knowledge of networking, Linux and TLS along with coding ability in Go, Rust, or Python.

Requisite Skills

  • Aptitude for identifying problems, owning them and working with others to solve them
  • Linux systems experience
  • 3 years experience in an SRE role or a role with similar functions
  • Software development skills in some programming language such as Go, Rust, or Python
  • Understanding of distributed software systems and large scale system design tradeoffs
  • Intermediate experience of common network protocols like DNS and HTTP

Examples of desirable skills, knowledge and experience

  • Experience with the Linux kernel and Linux software packaging
  • Performance analysis and debugging
  • Configuration management systems such as Saltstack, Chef, Puppet or Ansible
  • Workflow automation systems such as Temporal or Apache Airflow
  • Load balancing and reverse proxies such as Nginx, Varnish, HAProxy, Squid or Apache
  • SQL databases
  • Time series databases such as OpenTSDB, Graphite, Prometheus or Grafana
  • Key / Value stores
  • Internetworking and BGP
  • Bonus Points

  • Experience with continuous / rapid release engineering
  • Strong tooling and automation development experience
  • Experience working in a 24 / 7 / 365 service environment
  • Experience working with large scale production distributed systems
  • A history of contributing to Open Source Software
  • Some tools that we use

  • Nginx
  • PostgreSQL
  • Docker
  • Prometheus
  • Grafana
  • Consul
  • Nomad
  • Temporal
  • Salt
  • What Makes Cloudflare Special?

    We're not just a highly ambitious, large-scale technology company. We're a highly ambitious, large-scale technology company with a soul. Fundamental to our mission to help build a better internet is protecting the free and open Internet.

    Project Galileo

    Since 2014, we've equipped more than 2,400 journalism and civil society organizations in 111 countries with powerful tools to defend themselves against attacks that would otherwise censor their work, technology already used by Cloudflare's enterprise customers–at no cost.

    Athenian Project

    In 2017, we created the Athenian Project to ensure that state and local governments have the highest level of protection and reliability for free, so that their constituents have access to election information and voter registration. Since the project, we've provided services to more than 425 local government election websites in 33 states.

    1.1.1.1

    We released 1.1.1.1 to help fix the foundation of the Internet by building a faster, more secure and privacy-centric public DNS resolver. This is available publicly for everyone to use – it is the first consumer-focused service Cloudflare has ever released. Here's the deal – we don't store client IP addresses never, ever. We will continue to abide by our privacy commitment and ensure that no user data is sold to advertisers or used to target consumers.

    Sound like something you'd like to be a part of? We'd love to hear from you!

    This position may require access to information protected under U.S. export control laws, including the U.S. Export Administration Regulations. Please note that any offer of employment may be conditioned on your authorization to receive software or technology controlled under these U.S. export laws without sponsorship for an export license.

    Cloudflare is proud to be an equal opportunity employer. We are committed to providing equal employment opportunity for all people and place great value in both diversity and inclusiveness. All qualified applicants will be considered for employment without regard to their, or any other person's, perceived or actual race, color, religion, sex, gender, gender identity, gender expression, sexual orientation, national origin, ancestry, citizenship, age, physical or mental disability, medical condition, family care status, or any other basis protected by law. We are an AA / Veterans / Disabled Employer.

    Cloudflare provides reasonable accommodations to qualified individuals with disabilities. Please tell us if you require a reasonable accommodation to apply for a job. Examples of reasonable accommodations include, but are not limited to, changing the application process, providing documents in an alternate format, using a sign language interpreter, or using specialized equipment. If you require a reasonable accommodation to apply for a job, please contact us via e-mail at hr@cloudflare.com or via mail at 101 Townsend St. San Francisco, CA 94107.

    #J-18808-Ljbffr

    [job_alerts.create_a_job]

    Engineer Sre • San Francisco, CA, United States

    [internal_linking.similar_jobs]
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    Gridware • San Francisco, CA, US
    [job_card.full_time]
    Gridware is a San Francisco-based technology company dedicated to protecting and enhancing the electrical grid.We pioneered a groundbreaking new class of grid management called active grid response...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Senior Technology Site Reliability Engineer

    Senior Technology Site Reliability Engineer

    Cooley LLP • San Francisco, CA, United States
    [job_card.full_time]
    Senior Technology Site Reliability Engineer.Cooley is seeking a Senior Site Reliability Engineer to join the.Infrastructure & Development Operations. The Senior Technology Site Reliability Engineer(...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Systems Engineer

    Systems Engineer

    Renegade • San Francisco, CA, US
    [job_card.full_time]
    Renegade is building an unstoppable network for the anonymous exchange of value.Our core permissionless protocol, the Renegade dark pool, solves many problems in current decentralized exchange desi...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Systems Reliability Engineer (SRE) - Edge

    Systems Reliability Engineer (SRE) - Edge

    Cloudflare • San Francisco, CA, United States
    [job_card.full_time]
    Systems Reliability Engineer (SRE) - Edge.At Cloudflare, we are on a mission to help build a better Internet.Today the company runs one of the world’s largest networks that powers millions of websi...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Site Reliability Engineer (SRE)

    Site Reliability Engineer (SRE)

    SS&C Technologies • San Francisco, CA, United States
    [job_card.full_time]
    SS&C Technologies is a global investment and financial services software provider, headquartered in Windsor, Connecticut, and supporting more than 28,000 employees across 35 countries.It specialize...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Site Reliability Engineering

    Site Reliability Engineering

    Forhyre • San Francisco, CA, US
    [job_card.full_time]
    Forhyre is looking for engineers who can bring unique perspectives and innovative ideas to all areas of development and are interested in continuing to improve our platform through the ever-changin...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Senior Reliability Engineer

    Senior Reliability Engineer

    Gradient • San Francisco, CA, US
    [job_card.full_time]
    Join us at Gradient, where our purpose is to revolutionize home comfort while championing environmental sustainability.Our mission is to combat the escalating challenge of climate change by redefin...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Senior Site Reliability Engineer - Platform

    Senior Site Reliability Engineer - Platform

    Quizlet • San Francisco, CA, US
    [job_card.full_time]
    At Quizlet, our mission is to help every learner achieve their outcomes in the most effective and delightful way.Our $1B+ learning platform serves tens of millions of students every month, in...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    Zipline • South San Francisco, CA, US
    [job_card.full_time]
    Do you want to change the world? Zipline is on a mission to transform the way goods move.Our aim is to solve the world's most urgent and complex access challenges by building, manufacturing and...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Systems Engineer - Body Systems Safety and Availability

    Systems Engineer - Body Systems Safety and Availability

    Zoox • Foster City, CA, US
    [job_card.full_time]
    Zoox is looking for a systems engineer to take ownership of the safety and availability of our body systems functions and components. You will collaborate with world-class hardware, software, and ot...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Site Reliability Engineer (SRE)

    Site Reliability Engineer (SRE)

    Air Apps • San Francisco, CA, United States
    [job_card.full_time]
    Site Reliability Engineer (SRE).Site Reliability Engineer (SRE).Get AI-powered advice on this job and more exclusive features. At Air Apps, we believe in thinking bigger—and moving faster.We’re a fa...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Senior SRE : Scale Reliable Cloud Systems & Observability

    Senior SRE : Scale Reliable Cloud Systems & Observability

    Air Apps, Inc. • San Francisco, CA, United States
    [job_card.full_time]
    A leading tech company in San Francisco is seeking a Site Reliability Engineer (SRE) to ensure the reliability, availability, and scalability of systems. You will implement automation and monitoring...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Senior+ Site Reliability Engineer

    Senior+ Site Reliability Engineer

    Crusoe • San Francisco, CA, US
    [job_card.full_time]
    Crusoe's mission is to accelerate the abundance of energy and intelligence.We’re crafting the engine that powers a world where people can create ambitiously with AI — without sacrif...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Reliability Engineer

    Reliability Engineer

    Robust.ai • San Carlos, CA, US
    [job_card.full_time]
    Robust AI is a fast-growing, early-stage startup founded in 2019 by an unsurpassed team of veterans in robotics, AI and business. We are a collaborative group with a wide range of backgrounds and pe...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Sr. Platform Reliability Engineer

    Sr. Platform Reliability Engineer

    Oscar • San Francisco, CA, United States
    [job_card.permanent]
    This range is provided by Oscar.Your actual pay will be based on your skills and experience — talk with your recruiter to learn more. A technology company in the advanced computing space is seeking ...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Site Reliability Engineer I

    Site Reliability Engineer I

    Prosper • San Francisco, CA, US
    [job_card.full_time]
    As a Site Reliability Engineer I at Prosper, you will play a crucial role in enhancing the reliability, scalability, and maintainability of our technology platform. This entry-level position is desi...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Software Engineer, Site Reliability Engineering

    Software Engineer, Site Reliability Engineering

    WisdomAI • San Mateo, CA, US
    [job_card.full_time]
    WisdomAI has the mission to provide access and insights from data to everyone.We believe in the power of data to drive better decisions and we believe with Generative AI, there is an opportunity to...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Senior SRE Engineer - Reliability & Scale

    Senior SRE Engineer - Reliability & Scale

    Roblox Corporation • San Mateo, CA, United States
    [job_card.full_time]
    A leading gaming platform is seeking a Senior Software Engineer - Site Reliability to ensure system performance, reliability, and efficiency. Responsibilities include creating resilient software, de...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]