Talent.com
Kaav Inc.
Machine Learning Operations (MLOps) Engineer - AWS (with LLM Focus)Kaav Inc. • Miramar, FL, United States
Machine Learning Operations (MLOps) Engineer - AWS (with LLM Focus)

Machine Learning Operations (MLOps) Engineer - AWS (with LLM Focus)

Kaav Inc. • Miramar, FL, United States
17 hours ago
Job type
  • Full-time
Job description

Responsibilities:

  • LLM-Optimized MLOps Infrastructure: Design and implement MLOps infrastructure on AWS tailored for LLMs, leveraging services like SageMaker, EC2 (with GPU instances), S3, ECS/EKS, Lambda, and more.
  • LLM Deployment Pipelines: Build and manage CI/CD pipelines specifically for LLM deployment, addressing unique challenges like model size, inference optimization, and versioning.
  • LLMOps Practices: Implement LLMOps best practices for monitoring model performance, drift detection, prompt management, and feedback loops for continuous improvement.
  • RESTful API Development: Design and develop RESTful APIs to expose LLM capabilities to other applications and services, ensuring scalability, security, and optimal performance.
  • Model Optimization: Apply techniques like quantization, distillation, and pruning to optimize LLM models for efficient inference on AWS infrastructure.
  • Monitoring and Observability: Establish comprehensive monitoring and alerting mechanisms to track LLM performance, latency, resource utilization, and potential biases.
  • Prompt Engineering and Management: Develop strategies for prompt engineering and management to enhance LLM outputs and ensure consistency and safety.
  • Collaboration: Work closely with data scientists, researchers, and software engineers to integrate LLM models into production systems effectively.
  • Cost Optimization: Continuously optimize LLMOps processes and infrastructure for cost-efficiency while maintaining high performance and reliability.
Qualifications:
  • Experience: 3+ years of experience in MLOps or a related field, with hands-on experience in deploying and managing LLMs.
  • AWS Expertise: Strong proficiency in AWS services relevant to MLOps and LLMs, including SageMaker, EC2 (with GPU instances), S3, ECS/EKS, Lambda, and API Gateway.
  • LLM Knowledge: Deep understanding of LLM architectures (e.g., Transformers), training techniques, and inference optimization strategies.
  • Programming Skills: Proficiency in Python and experience with infrastructure-as-code tools (e.g., Terraform, CloudFormation), REST API frameworks (e.g., Flask, FastAPI), and LLM libraries (e.g., Hugging Face Transformers).
  • Monitoring: Familiarity with monitoring and logging tools for LLMs, such as Prometheus, Grafana, and CloudWatch.
  • Containerization: Experience with Docker and container orchestration (e.g., Kubernetes, ECS) for LLM deployment.
  • Problem Solving: Excellent problem-solving and troubleshooting skills in the context of LLMs and MLOps.
  • Communication: Strong communication and collaboration skills to effectively work with cross-functional teams.


Required Skills : GenAI
Additional Skills : AI Developer
Create a job alert for this search

Machine Learning Operations (MLOps) Engineer - AWS (with LLM Focus) • Miramar, FL, United States

Similar jobs

Operations Process Engineer- Remote

UNFIHialeah, FL, United States
Remote
Full-time

Purpose :Responsible for the development and implementation of standard operating practices and processes that align with and support minimum performance expectations.Supports DC operations front l... Show more

 • Promoted

Quality Systems AI Manager- Miami, Fl (Onsite position)

Prime Matter LabsMiami Gardens, FL, US
Full-time
Quick Apply

The Quality Systems AI Manager (QSAIM) is a subject-matter expert responsible for designing, implementing, and continuously improving the Quality Management Systems (QMS) and processes within PML&#... Show more

Senior Software Engineer - Applied AI/ML

Motorola SolutionsPlantation, FL, United States
Full-time

At Motorola Solutions, we believe that everything starts with our people.We’re a global close‑knit community, united by the relentless pursuit to help keep people safer everywhere.Our critical comm... Show more

 • Promoted

Senior Snowflake Data Engineer

Stellar IT Solutions LLCHollywood, FL, Florida, USA
Temporary

Job Title: </strong>Senior Snowflake Data Engineer</p> <p><strong>Location:</strong> Hollywood, FL (Onsite, 5 days/week)<br /> <strong>Type:</strong>... Show more