Talent.com
Senior Technology Site Reliability Engineering Manager
Senior Technology Site Reliability Engineering ManagerCooley LLP • New York, NY, United States
Senior Technology Site Reliability Engineering Manager

Senior Technology Site Reliability Engineering Manager

Cooley LLP • New York, NY, United States
[job_card.variable_days_ago]
[job_preview.job_type]
  • [job_card.full_time]
[job_card.job_description]

Senior Technology Site Reliability Engineering Manager

Cooley is seeking a Senior Site Reliability Engineering Manager to join the Infrastructure & Development Operations team.

Position summary : The Senior Technology Site Reliability Engineering ("SRE") Manageris responsible forleading a team of SRE's to ensure the reliability, scalability, and performance of the firm's infrastructure and services. This role works with the DevOps, infrastructure, and development teams, applying engineering principles to operations in order to create scalable and resilient systems. In addition to being technically advanced, the SRE Manager will have high degree of emotional intelligence and the ability to work as a team towards complex and layered objectives. Specific duties and responsibilities include, but are not limited to, the following :

Position responsibilities :

  • Define and execute the SRE strategic roadmap aligned with business goal, providing experienced leadership in developing solutions for highly scalable, highly available, hybrid cloud (IaaS, PaaS, SaaS) infrastructure patterns and platform integrations across physical colocations and hyperscalers (AWS and Azure)
  • Build and mentor a high-performing SRE team, fostering a culture of trust, collaboration, and continuous improvement
  • Partner with cross-functional leaders in infrastructure, DevOps, and application development to scale reliability practices across the enterprise
  • Oversee incident response, root cause analysis, and postmortems with a focus on accountability and learning
  • Establish and enforce Service-level objectives (SLOs), service-level indicators (SLI's), and service-level agreements (SLA's)
  • Drive proactive monitoring, alerting, observability, and capacity planning
  • Lead automation initiatives for deployment, scaling, failover, and recovery
  • Promote observability practices using tools like Prometheus, Grafana, DataDog, or Splunk
  • Collaborate with development teams to build self-healing, fault-tolerant systems
  • Champion reliability-first thinking across engineering and operations
  • Encourage blameless postmortems and a learning-oriented incident culture
  • Ensure compliance with security, risk, and regulatory requirements
  • Serve as direct supervisor and mentor to direct reports
  • Provide day-to-day supervision of direct reports, ensure compliance with assigned work hours and monitor for compliance with all firm and department policies. Manage staffing coverage, review and process time logs / time off requests
  • Support business professional development and continued educational opportunities
  • In collaboration with immediate supervisor and CN HR, participate in hiring, performance appraisals, counseling, termination and other employee lifecycle events
  • All other duties as assigned or required

Skills and experience :

Required :

  • After orientation at Cooley LLP, exhibit proficiency in the Microsoft Office suite, iManage and other firm applications
  • Ability to work extended and / or weekend hours, as required
  • Ability to travel, as required
  • 7+ years' direct applicable experience (e.g., DevOps or Site Reliability Engineering) with 2+ years of exempt / management experience in relevant roles
  • Experience managing cross-functional projects and SRE planning and programing
  • Proficiency in Terraform and programming languages such as Python, Go, or Java
  • Deep expertise in cloud platforms, particularly AWS, and container orchestration
  • Strong background in distributed systems, performance tuning, and automation
  • Hands-on experience with configuration management tools such as Puppet, Chef, or Salt
  • Preferred :

  • Bachelor's Degree in Computer Science, Information Technology, Engineering, or associated discipline
  • Experience working with advanced ETL data workflows including technologies such as AWS EMR, Azure Synapse, Azure Data Factory, or Apache Hive / Spark / Airflow
  • Experience with IaC deployment of AKS / EKS / GKE architecture
  • Experience with enterprise Data Lake environments using technologies such as DataBricks or Snowflake
  • Competencies :

  • Expert analytical / quantitative, problem-solving, and deductive reasoning skills, with demonstrated experience performing advanced troubleshooting and root cause analysis of complex technical issues
  • Excellent organizational, planning, and time management skills and ability to work either independently or in a team environment to manage competing priorities and meet deadlines
  • Advanced verbal and written communication skills with the ability to present findings, conclusions, alternatives, and information clearly and concisely
  • Experience working with all levels of business professionals, management, stakeholders, and vendors with demonstrated ability to build effective relationships through trust and diplomacy
  • Cooley offers a competitive compensation and excellent benefits package and is committed to fair and equitable employment practices.

    EOE.

    The expected annual pay range for this position with a full-time schedule is $165,000 - $235,000. Please note that final offer amount will be dependent on geographic location, applicable experience and skillset of the candidate.

    We offer a full range of elective benefits including medical, health savings account (with applicable medical plan), dental, vision, health and / or dependent care flexible spending accounts, pre-tax commuter benefits, life insurance, AD&D, long-term care coverage, backup care for children and / or adults and other parental support benefits. In addition to elective benefit options, benefited employees receive firm-paid life insurance, AD&D, LTD, short term medical benefits as well as 21 days of Paid Time Off ("PTO") and 10 paid holidays each year. We provide generous parental leave and fertility benefits. New employees will attend a detailed benefit orientation to learn more about our many benefits and resources.

    [job_alerts.create_a_job]

    Manager Site Reliability • New York, NY, United States

    [internal_linking.similar_jobs]
    Site Reliability Engineer

    Site Reliability Engineer

    S&P Global • New York, New York, United States
    [job_card.full_time]
    This job is with S&P Global, an inclusive employer and a member of myGwork – the largest global platform for the LGBTQ+ business community. Please do not contact the recruiter directly.About the Rol...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Site Reliability Engineering

    Site Reliability Engineering

    Forhyre • New York, NY, US
    [job_card.full_time]
    Forhyre is looking for engineers who can bring unique perspectives and innovative ideas to all areas of development and are interested in continuing to improve our platform through the ever-changin...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Site Reliability Manager

    Site Reliability Manager

    Macmillan Learning • New York, NY, United States
    [job_card.full_time]
    The Site Reliability Manager (SRM) maintains the availability, reliability, and performance of internal applications and SaaS platforms. This role involves managing incidents, optimizing system perf...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    Circle.so • New York, NY, US
    [job_card.full_time]
    Circle is building the world's leading all-in-one platform for online communities.We make it possible for creators, coaches, educators, and businesses to bring together their audience with enga...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Senior Platform / Site Reliability Engineer

    Senior Platform / Site Reliability Engineer

    Biograph • New York, NY, US
    [job_card.full_time]
    Biograph is looking for senior engineers to own and scale our cloud infrastructure, empowering our engineering teams to build innovative solutions that reach millions of patients.Our small and nimb...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Site Reliability Engineer

    Site Reliability Engineer

    Clay Labs • New York City, New York, USA
    [job_card.full_time]
    Our mission is to help businesses grow without huge investments in tooling or manual labor.Were already helping over 100000 people grow their business with Clay. From local pizza shops to enterpris...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Staff Site Reliability Engineer

    Staff Site Reliability Engineer

    Ro • New York, NY, US
    [job_card.full_time]
    Ro is a direct-to-patient healthcare company with a mission of helping patients achieve their health goals by delivering the easiest, most effective care possible. Ro is the only company to offer na...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Senior Product Manager

    Senior Product Manager

    UVeye • Teaneck, NJ, US
    [job_card.full_time]
    At UVeye, we're on a mission to redefine vehicle safety and reliability on a global scale.Founded in 2016, we have pioneered the world's first fully automated suite of vehicle inspection sy...[show_more]
    [last_updated.last_updated_variable_hours] • [promoted] • [new]
    Senior Site Reliability Engineer (SRE)

    Senior Site Reliability Engineer (SRE)

    StubHub • New York, NY, US
    [job_card.full_time]
    StubHub is on a mission to redefine the live event experience on a global scale.Whether someone is looking to attend their first event or their hundredth, we're here to delight them all the way...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Senior Manager, Software Engineering, Full Stack (People Leader)

    Senior Manager, Software Engineering, Full Stack (People Leader)

    Capital One • NEW YORK, New York, United States
    [job_card.full_time] +1
    Senior Manager, Software Engineering, Full Stack (People Leader).Do you love building and pioneering in the technology space? Do you enjoy solving complex business problems in a fast-paced, collabo...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Staff Site Reliability Engineer

    Staff Site Reliability Engineer

    Altana AI • New York, NY, United States
    [job_card.full_time]
    AI can be a powerful tool for good in the world – at Altana we apply AI to the world’s largest organized body of supply chain data to power a more resilient, more secure, and more sustainable model...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Director, Site Reliability Engineer

    Director, Site Reliability Engineer

    Ordergroove • New York, NY, US
    [job_card.full_time]
    Ordergroove is a dynamic, fast-paced environment where you will be involved in building something of real value from the ground-up. We're looking for bright, talented people who are excited abou...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Staff Site Reliability Engineer

    Staff Site Reliability Engineer

    Altana • New York, NY, United States
    [job_card.full_time]
    Get AI-powered advice on this job and more exclusive features.Retrieved from the description.AI can be a powerful tool for good in the world – at Altana we apply AI to the world’s largest organized...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Staff Site Reliability Engineer

    Staff Site Reliability Engineer

    Gradle Inc. • New York, NY, US
    [job_card.full_time]
    Develocity is a first-of-its-kind toolchain observability and acceleration platform that helps software teams adopt and improve DORA capabilities (including continuous delivery) in order to achieve...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Site Reliability Engineer (SRE)

    Site Reliability Engineer (SRE)

    Openkyber • NY, United States
    [job_card.full_time]
    [filters_job_card.quick_apply]
    Dynatrace Engineer 12 Months Hybrid : New York City, NY 28278 Job Description They currently have a few Dynatrace resourc...[show_more]
    [last_updated.last_updated_variable_days]
    Manager, Maintenance and Systems

    Manager, Maintenance and Systems

    libertycoke • Bronx, NY, USA
    [job_card.full_time]
    Working at Liberty Coca-Cola Beverages LLC is all about pursuing a career not just a job.Discover what it means to be energized by a multitude of possibilities and a dynamic team.At Liberty Coca-Co...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Business Development Manager

    Business Development Manager

    The Kiely Family of Companies • Eatontown, NJ, US
    [job_card.full_time]
    Since 1952, Kiely Family of Companies has been building lasting relationships and delivering innovative design-build solutions that put our customers’ success first.Recognized on the ENR 400,...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Senior / Staff Site Reliability Engineer

    Senior / Staff Site Reliability Engineer

    Rethink recruit • New York, NY, United States
    [job_card.full_time]
    Abridge has built the most advanced AI platform for clinical conversations, trusted by over 100 major healthcare systems including Kaiser Permanente, Mayo Clinic, and CommonSpirit Health.Lightspeed...[show_more]
    [last_updated.last_updated_1_day] • [promoted]