Cloud Infrastructure Management : Design, implement, and manage cloud-based infrastructure on AWS and Azure, ensuring optimal scalability, performance, and security.
CI / CD Pipeline Development : Develop and maintain CI / CD pipelines using GitHub Actions for automated code deployments and testing.
System Monitoring and Incident Management :
Implement and configure Datadog for comprehensive system monitoring.
Develop and maintain Datadog dashboards to visualize system performance and metrics.
Set up proactive alerts in Datadog to detect and respond to incidents swiftly, ensuring high system reliability and uptime.
Conduct root cause analysis of incidents and implement corrective actions using Datadog insights.
Collaboration with AI Teams : Work closely with AI teams to support the operational aspects of LLMs, including deployment strategies and performance tuning.
Infrastructure as Code (IaC) : Implement IaC using tools like Terraform or AWS CloudFormation to automate infrastructure provisioning and management.
Container Orchestration : Manage container orchestration systems such as Kubernetes or AWS ECS.
Operational Support for LLMs : Provide operational support for LLMs, focusing on performance optimization and reliability.
Scripting and Automation : Utilize scripting languages such as Python and Bash for automation and task management.
Security and Compliance : Ensure compliance with security standards and best practices, implementing robust security measures.
Documentation : Document system configurations, procedures, and best practices for internal and external stakeholders.
DevOps Collaboration : Work with development teams to optimize deployment workflows, introduce best practices for DevOps, and improve overall efficiency.
Technology and Industry Awareness : Stay up-to-date with emerging technologies and industry trends to suggest improvements and upgrades.
Qualifications and Skills Required :
Extensive experience with AWS and Azure cloud platforms.
Proficiency in developing CI / CD pipelines using GitHub Actions.
Strong experience with Datadog for system monitoring, including implementation, configuration, and maintenance.
Demonstrated ability to create and maintain Datadog dashboards for performance visualization.
Proven expertise in setting up alerts and conducting incident response with Datadog.
Hands-on experience with container orchestration systems such as Kubernetes or AWS ECS.
Proficiency in Infrastructure as Code (IaC) tools like Terraform or AWS CloudFormation.
Familiarity with operational aspects of Large Language Models (LLMs) is highly desirable.
Strong scripting skills in Python, Bash, or similar languages.
In-depth knowledge of security standards and best practices.
Excellent documentation skills.
Proven ability to work collaboratively with development and AI teams.
Commitment to staying current with industry trends and emerging technologies
[job_alerts.create_a_job]
Engineer • Plano, Texas, United States
[internal_linking.related_jobs]
Senior DevOps Engineer (Plano)
ChabezTech LLC • Plano, TX, US
[job_card.part_time]
Job Title : Senior Dev-SecOps Engineer.Location : Piscataway, NJ or Erie, PA (Onsite).DevOps or systems engineering role.
Hands-on experience with cloud platforms (AWS, Azure, GCP).Proficiency in scri...[show_more]
Veryable Inc • Dallas, Texas, United States, 75202
[job_card.full_time]
Join Veryable, a trailblazing platform revolutionizing manufacturing and empowering individuals to thrive through flexible work.
At Veryable, we are building a platform that is transforming the manu...[show_more]
[last_updated.last_updated_variable_days]
Director of DevOps (Dallas)
Qorali • Dallas, TX, US
[job_card.part_time]
We are seeking an experienced and strategic Director of DevOps to lead its enterprise cloud engineering and DevOps practices.
This senior leadership role will be responsible for defining, implementi...[show_more]
US Tech Solutions, Inc. • Dallas, Texas, United States
[job_card.temporary]
Duration : 12 months contract Job Description : We are seeking a highly skilled Cloud Infrastructure Engineer to lead the migration of Smart TV and Mobile Cloud backend services from AWS to Microsoft...[show_more]
Ascentt is building cutting-edge data analytics & AI / ML solutions for global automotive and manufacturing leaders.We turn enterprise data into real-time decisions using advanced machine learning an...[show_more]
[last_updated.last_updated_30]
Senior Infrastructure Engineer (100% Remote- Nutanix / Linux / Dell)
Optomi • Plano, TX, United States
[filters.remote]
[job_card.full_time]
Senior Infrastructure Engineer (100% Remote- Nutanix, Linux, Dell, HP).Optomi, in partnership with a client is seeking an Infrastructure Engineer to join their team.
This role will help to support m...[show_more]
PROLIM Global Corporation (www.DevOps Engineer location Plano, TX, United States for one of our Top clients.DevOps Engineer Plano, Texas (Onsite) Description Who we’re looking for We’re looking for...[show_more]
We are seeking a highly skilled and motivated DevOps Engineer to join our growing team.The ideal candidate will have strong problem-solving abilities, proficiency in Infrastructure as Code (IaC) us...[show_more]
[last_updated.last_updated_30]
Kubernetes Engineer
Tata Consultancy Services • Plano, TX, United States
[job_card.full_time]
Hands on experience of Kubernetes engineering and development.Minimum 5-7+ years of experience in working with hybrid Infra architectures.
Experience in analyzing the architecture of On Prem Infrast...[show_more]
Distinguished Machine Learning Engineer- Capital One Software (Remote).Ever since our first credit card customer in 1994, Capital One has recognized that technology and data can enable even large c...[show_more]
Only qualified Application System - Cloud Engineer candidates located near Plano TX to be considered due to the position requiring an onsite presence.
Recognized cloud certification(s) such as Azure...[show_more]
[last_updated.last_updated_30]
Senior DevOps Engineer
Resolution Technologies, Inc. • Plano, TX, US
[job_card.full_time]
Senior DevOps Engineer Career Opportunity.As a Senior DevOps Engineer, you will be at the forefront of managing and enhancing our clients Cloud Infrastructure and CI / CD pipelines.You will have a di...[show_more]
AWS DevOps Engineer" for one of the direct clients in Dallas, TX.Should have achieved mastery in one of the SecDevops practices – IaC-Terraform, Cloud-AWS, CICD-Git / Argo, microservices architecture...[show_more]
Overview JOB DESCRIPTION : The Cloud Solutions Network Engineer is part of the Cloud Center of Excellence (CCOE) and responsible for delivering on the transport solutions that will align the use ...[show_more]
[last_updated.last_updated_30] • [promoted]
DevOps Engineer
Russell Tobin • Plano, TX, US
[job_card.full_time]
Duration : 6 months with possible extension.An ideal candidate must have 3-5 years of experience in networking, Automated Testing, maintaining CI / CD pipelines & certificate chains (the relations...[show_more]
Intellisoft Technologies • Plano, TX, United States
[job_card.full_time]
Plano, TX (3 days onsite / week).Kubernetes package management).Elastic Kubernetes Service) and.Assess cloud environments for performance and efficiency.
Identify opportunities to reduce costs and imp...[show_more]
We are seeking a Senior DevOps Engineer with 7+ years of experience in working hands-on and leading DevOps automation and deployments to join our dynamic and fast-paced team in Dallas, TX.This indi...[show_more]