Infrastructure DevOps Engineer
AUSTIN TX or HERNDON VA (hybrid, 3 days / week on-site)
contract to hire
Must Be US Citizen
Our client's Cloud Infrastructure Platform Engineering team plays a pivotal role in designing and deploying robust infrastructure solutions to support client products and services. They are an automation-first organization, prioritizing the deployment of all our cloud resources via automation to enhance efficiency and consistency. Their engineers collaborate closely with internal product teams and customer-facing units to deliver adaptable and scalable code that meets the requirements of their diverse clientele. Candidates will have the opportunity to work across various cloud providers using a wide range of Infrastructure as Code (IaC) tooling and practices, developing provider-agnostic solutions that ensure seamless functionality across different platforms. Their infrastructure supports a broad client base, including local, state, and federal government agencies, as well as private sector organizations engaged in government-related missions.
RESPONSIBILITIES :
- Write, modify, run terraform to deploy and maintain infrastructure across multiple cloud service providers.? Be able to debug errors when deploying terraform.
- Run ansible playbooks to perform configuration and maintenance tasks.? Be able to develop new ansible roles and troubleshooting as errors occur.
- Use GitLab with multiple repositories, create and improve CI utilizing runners to maintain infrastructure.
- Configure, build, and deploy containerized services using Docker and / or Kubernetes.
- Develop standard operating procedures, knowledge base articles, technical bulletins, and other documents in support of the infrastructure.
- Operate in a security-first mindset, performing all other responsibilities with security in mind.
- Implement monitoring, config management, and logging capabilities to manage a multiple tenant cloud infrastructure across multiple cloud service providers.
- Use generative AI elements to increase efficiency and speed, improve accuracy and consistency, enhance security, and better manage resources where practical and within security boundary guidelines.
KNOWLEDGE AND SKILLS :
- Knowledge of AWS foundational technologies (EC2, S3, IAM, Route53, VPC).
- Knowledge of Azure foundational technologies (Virtual Networks, Application Gateway, Storage Account, Virtual Machines, Load Balancer, Resource Groups).
- Knowledge of GCP foundational technologies (Projects, Compute Engine, GKE, Cloud Storage, VPC).
- Proficient in using Terraform to build modules for deploying infrastructure in AWS, Azure, and / or GCP.
- Proficient in CI / CD technologies including pipeline design and implementation using tools such as GitLab runners and ConcourseCI.
- Working knowledge of HashiCorp Vault for secret management, including secure storage and retrieval of credentials, API keys, and certificates.
- Experience with Ansible for automation, configuration management, and playbook development
- Proficient in a scripting or programming language for automation (bash, python).
- Experience with RBAC design and Single Sign-On (SSO) for identity management.
- Investigation skills to review detailed Infrastructure and configurations.
- Experience with logging and log processing in a cloud environment.
- Working knowledge of networking : ip routing, subnetting, network troubleshooting.
- Strong Linux OS skills (deployment, configuration, and troubleshooting).
- Knowledge of ITIL-compliant ticket management using ServiceNow, including incident resolution, change requests and sprint planning.
MINIMUM QUALIFICATIONS :
- Bachelor’s degree in Computer Science or equivalent practical experience.
- 5-8+ years of experience with the following technologies : ??Terraform, Ansible, CI / CD, Secrets Management, Identity Management (SSO)?.
- Experience with Linux operating system internals and administration (e.g., filesystems, inodes, system calls, hardening) and networking (e.g., TCP / IP, routing, DNS, network topologies, SDN).
PREFERRED QUALIFICATIONS :
- Expertise in designing, analyzing and troubleshooting large-scale distributed systems.
- Ability to debug and optimize code and automate routine tasks.
- Systematic problem-solving approach coupled with strong communication skills and a sense of ownership and drive.
- Hands-on experience developing infrastructure as code (IaC).
LI-EW1