Our client is currently seeking a SRE Engineer
SRE Engineer
Location - Austin, TX (Hybrid)
Full time role
Role Overview
We are seeking an experienced Site Reliability Engineer (SRE) with strong hands‑on expertise in Golang, Kubernetes, AWS, observability tooling, and production-grade CI/CD practices. The ideal candidate is a self‑driven engineer who can independently deliver high‑quality, tested code and collaborate effectively within cross‑functional teams.
Responsibilities
·Design, develop, and maintain scalable services written in Golang.
·Deliver clean, performant, and well‑tested code with comprehensive unit test coverage.
·Build, deploy, and manage applications on Kubernetes (deployments, statefulsets, load balancers).
·Implement and optimize AWS cloud infrastructure aligned with best practices.
·Develop and maintain automated CI/CD pipelines to support continuous delivery.
·Configure and monitor observability stacks including Grafana, Prometheus, Alertmanager, and Loki.
·Ensure high reliability, performance, and availability of services through proactive monitoring and alerting.
·Troubleshoot production issues and drive root‑cause analysis.
·Collaborate with engineering, product, and operations teams to improve system reliability.
·Work independently and take ownership of deliverables end-to-end.
Required Qualifications (Must-Have Skills)
Technical Skills
·Golang: Strong hands‑on development experience.
·Software Quality: Ability to write high‑quality, maintainable code; strong focus on unit testing.
·Kubernetes (E2 or equivalent):
oExperience with deployments, statefulsets, and load balancers.
·AWS Cloud: Hands-on experience with core AWS services.
·CI/CD: Experience building and maintaining automated pipelines.
·Observability Stack:
oGrafana
oPrometheus
oAlertmanager
oLoki (log aggregation)
Professional Skills
·Strong communication skills in technical and cross‑functional environments.
·Ability to drive tasks independently with minimal supervision.
·Effective team collaboration and the ability to influence and guide others.
Preferred Qualifications (Nice to Have)
·Prior SRE or DevOps experience in large‑scale distributed systems.
·Experience supporting production systems in a hybrid environment.