Position Summary
We are seeking a highly skilled and experienced Senior Systems Engineer to join our Infrastructure Operations team. This role is ideal for someone who thrives in designing, building, and optimizing scalable private and public cloud environments using tools like Puppet, ZFS, Kubernetes and OpenStack. You will play a critical role in shaping our private and public cloud strategy, ensuring high availability, performance, and security across our infrastructure.
Key Responsibilities
- Design and implement scalable, secure, and resilient private and public cloud infrastructure using Puppet
- Lead the architecture and deployment of server infrastructure, applications and services using puppet.
- Develop and maintain deployment playbooks on ansible and puppet foreman to support automated deployment and testing.
- Monitor system performance, troubleshoot issues, and ensure system reliability and uptime.
- Collaborate with DevOps, Security, and Application teams to align infrastructure with business goals.
- Document infrastructure designs, processes, and procedures.
- Mentor junior engineers and provide technical leadership across projects.
Required Qualifications
8+ years of experience in systems engineering or cloud infrastructure roles.Strong hands-on experience with OpenStack (compute, storage, networking).Proven experience designing and managing large-scale cloud environments.Expert in Linux systems (RHEL, Ubuntu) with deep knowledge of networking, security, and performance tuning.Must have Strong Puppet skills, including module development, Hiera, custom types / providers, and Puppet DB.Infrastructure as Code mindset, with CI / CD integration and code testing (e.g., rspec-puppet, r10k).Architects scalable systems, with experience in HA, load balancing, and distributed environments.Broad automation experience, including Bash, Python, Ansible, or Terraform.Skilled in monitoring / logging, using tools like Prometheus, Grafana, ELK, or Splunk.Version control expertise, especially with Git and managing config repos for Puppet and infrastructure.Security-focused, with experience in hardening systems, patching, compliance (e.g., CIS, STIG).Cloud and virtualization savvy, with exposure to AWS, GCP, VMware, or KVM-based environments.Strong troubleshooting skills, with a systematic approach to debugging complex infra issues.Preferred Qualifications
Certifications in Kubernetes (CKA / CKAD) or OpenStack or puppet will be added advantageExperience with hybrid cloud or multi-cloud environments.Familiarity with GitOps practices and tools like Jenkins, ArgoCD or Flux.Background in infrastructure cost optimization and capacity planning.