Build Gallup's observability foundation and shift how we detect, respond to and prevent system issues before they affect customers.
As a founding member of Gallup’s new site reliability engineering team, you’ll define and scale our observability strategy across engineering and bring reliability engineering principles — automation, observability and continuous improvement — to everything we build. You’ll unify different teams’ monitoring solutions into a cohesive, proactive approach, consolidate our tooling, build automated workflows and establish processes that help us catch problems before they become incidents.
In this role, you’ll shape Gallup’s global technology platform to ensure the systems delivering analytics and insights to millions remain fast, resilient and always available. If you’re eager to drive resilience in systems that empower people and organizations worldwide, this is your opportunity — apply today.
What You’ll Do
- Establish the foundation of Gallup’s SRE function by defining standards, best practices and scalable systems that will grow with the organization
- Build and evolve observability infrastructure using tools like Dynatrace, Datadog, Grafana and PagerDuty to monitor applications running on AWS
- Design and implement automated alerting workflows that integrate directly with Slack
- Establish incident response processes that integrate monitoring, alerting and team communication to reduce recovery time and improve service continuity
- Create dashboards and metrics that give engineering teams real-time insight into application performance and system reliability
- Identify opportunities for automation and design self-healing systems in partnership with DevOps engineers
- Enable end-to-end monitoring and faster issue detection by partnering with application teams to embed observability into Java, .NET and Python services
- Lead initiatives that help engineering teams adopt and use observability tools effectively
- Identify patterns in system behavior that indicate potential issues before they affect customers
What Makes You Stand Out
Observability expertise : You've built or scaled monitoring and observability practices, not just maintained existing systems.Tool consolidation experience : You've successfully unified fragmented monitoring solutions across multiple teams.AI mindset : You reduce repetitive operational work through thoughtful automation and workflow design.Incident response leadership : You've designed or improved incident management processes and know how to balance speed with thoroughness.Communication and enablement : You go beyond building dashboards; you guide others in how to instrument their code and interpret metrics.What You Need
Bachelor's degree in computer science, MIS or a related field, or equivalent experience, requiredAt least three years of experience in site reliability engineering, DevOps or infrastructure roles with a focus on monitoring and observability requiredExperience with observability and monitoring tools such as Dynatrace (preferred), Datadog, Grafana or similar platforms requiredExperience with incident management tools like PagerDuty or similar alerting systems requiredStrong understanding of AWS cloud infrastructure and how to monitor distributed systems requiredExperience integrating monitoring and alerting systems with collaboration platforms like Slack requiredAbility to work with application teams across multiple languages and frameworks (e.g., Java, .NET, Python) requiredKnowledge of metrics, logging and tracing as pillars of observability requiredExperience writing scripts or automation (e.g., Python, Bash, PowerShell) to support monitoring workflows requiredExperience with containerized applications and infrastructure as code preferredA commitment to working on-site at Gallup’s San Francisco office at least three days a week requiredAbout Gallup
At Gallup, we change the world, one client at a time, through extraordinary analytics and advice on everything important facing humankind.
Gallup offers a robust benefits package that includes medical, dental, vision, life and other insurance options; a fully vested 401(k) retirement savings plan with company matching; an employee stock ownership program; mass transit reimbursement; family-building benefits; an employee assistance program; and various reimbursements and activities that enhance our associates’ wellbeing. We also offer an estimated annual salary range of $150,000-$200,000 for this role. Salaries are based on a variety of factors, including an individual’s education, experience and skills.
Gallup is an equal opportunity employer. We consider all qualified applicants without regard to race, color, religion, sex, national origin, disability, protected veteran status, sexual orientation, gender identity, or any other legally protected basis, in accordance with applicable law.
To review Gallup’s Privacy Statement, please click this link : https : / / www.gallup.com / privacy . This privacy policy is meant to help you understand what information we collect, why we collect it, and how you can update, manage and delete your information. Your application and the information you provide will be processed and stored in the United States.
#LI-Hybrid
#LI-KW1