Machine Learning EngineerGEICO • Palo Alto, CA, United States

[error_messages.no_longer_accepting]

Machine Learning Engineer

GEICO • Palo Alto, CA, United States

[job_card.variable_days_ago]

[job_preview.job_type]

[job_card.full_time]

[job_card.job_description]

GEICO . For more information, please .

At GEICO, we offer a rewarding career where your ambitions are met with endless possibilities.
Every day we honor our iconic brand by offering quality coverage to millions of customers and being there when they need us most. We thrive through relentless innovation to exceed our customers’ expectations while making a real impact for our company through our shared purpose.
When you join our company, we want you to feel valued, supported and proud to work here. That’s why we offer The GEICO Pledge : Great Company, Great Culture, Great Rewards and Great Careers.
GEICO AI ML Infrastructure team is seeking an exceptional Senior ML Platform Engineer to build and scale our machine learning infrastructure with a focus on Large Language Models (LLMs) and AI applications. This role combines deep technical expertise in cloud platforms, container orchestration, and ML operations with strong leadership and mentoring capabilities. You will be responsible for designing, implementing, and maintaining scalable, reliable systems that enable our data science and engineering teams to deploy and operate LLMs efficiently at scale. The candidate must have excellent verbal and written communication skills with a proven ability to work independently and in a team environment.KEY RESPONSIBILITIESML Platform & Infrastructure
Design and implement scalable infrastructure for training, fine-tuning, and serving open source LLMs (Llama, Mistral, Gemma, etc.)
Architect and manage Kubernetes clusters for ML workloads, including GPU scheduling, autoscaling, and resource optimization
Design, implement, and maintain feature stores for ML model training and inference pipelines
Build and optimize LLM inference systems using frameworks like vLLM, TensorRT-LLM, and custom serving solutions
Ensure 99.9%+ uptime for ML platforms through robust monitoring, alerting, and incident response procedures
Design and implement ML platforms using DataRobot, Azure Machine Learning, Azure Kubernetes Service (AKS), and Azure Container Instances
Develop and maintain infrastructure using Terraform, ARM templates, and Azure DevOps
Implement cost-effective solutions for GPU compute, storage, and networking across Azure regions
Ensure ML platforms meet enterprise security standards and regulatory compliance requirements
Evaluate and potentially implement hybrid cloud solutions with AWS / GCP as backup or specialized use casesDevOps & Platform Engineering
Design and maintain robust CI / CD pipelines for ML model deployment using Azure DevOps, GitHub Actions, and MLOps tools
Implement automated model training, validation, deployment, and monitoring workflows
Set up comprehensive observability using Prometheus, Grafana, Azure Monitor, and custom dashboards
Continuously optimize platform performance, reducing latency and improving throughput for ML workloads
Design and implement backup, recovery, and business continuity plans for ML platformsTechnical Leadership & Mentoring
Mentor junior engineers and data scientists on platform best practices, infrastructure design, and ML operations
Lead comprehensive code reviews focusing on scalability, reliability, security, and maintainability
Design and deliver technical onboarding programs for new team members joining the ML platform team
Establish and champion engineering standards for ML infrastructure, deployment practices, and operational procedures
Create technical documentation, runbooks, and deliver internal training sessions on platform capabilitiesCross-Functional Collaboration
Work closely with data scientists to understand requirements and optimize workflows for model development and deployment
Collaborate with product engineering teams to integrate ML capabilities into customer-facing applications
Support research teams with infrastructure for experimenting with cutting-edge LLM techniques and architectures
Present technical solutions and platform roadmaps to leadership and cross-functional stakeholdersREQUIRED QUALIFICATIONSExperience & Education
Bachelor’s degree in computer science, Engineering, or related technical field (or equivalent experience)
5+ years of software engineering experience with focus on infrastructure, platform engineering, or MLOps
2+ years of hands-on experience with machine learning infrastructure and deployment at scale
1+ years of experience working with Large Language Models and transformer architecturesTechnical Skills - Core Requirements
Proficient in Python; strong skills in Go, Rust, or Java preferred
Proven experience working with open source LLMs (Llama 2 / 3, Qwen, Mistral, Gemma, Code Llama, etc.)
Proficient in Kubernetes including custom operators, helm charts, and GPU scheduling
Deep expertise in Azure services (AKS, Azure ML, Container Registry, Storage, Networking)
Experience implementing and operating feature stores (Chronon, Feast, Tecton, Azure ML Feature Store, or custom solutions)
Hands-on experience with inference optimization using vLLM, TensorRT-LLM, Triton Inference Server, or similarDevOps & Platform Skills
Advanced experience with Azure DevOps, GitHub Actions, Jenkins, or similar CI / CD platforms
Proficiency with Terraform, ARM templates, Pulumi, or CloudFormation
Deep understanding of Docker, container optimization, and multi-stage builds
Experience with Prometheus, Grafana, ELK stack, Azure Monitor, and distributed tracing
Knowledge of both SQL and NoSQL databases, data warehousing, and vector databasesLeadership & Soft Skills
Demonstrated track record of mentoring engineers and leading technical initiatives
Experience leading design reviews with focus on compliance, performance, and reliability
Excellent ability to explain complex technical concepts to diverse audiences
Strong analytical and troubleshooting skills for complex distributed systems
Experience managing cross-functional technical projects and coordinating with multiple stakeholdersPREFERRED QUALIFICATIONSAdvanced Experience
Master’s degree in computer science, Machine Learning, or related field
6+ years of platform engineering or infrastructure experience
Experience with Staff Engineer or Tech Lead roles in ML / AI organizations
Background in distributed systems and high-performance computing
Open-source contributions to ML infrastructure projects or LLM frameworksSpecialized Skills
Multi-Cloud Experience : Hands-on experience with Azure, AWS (SageMaker, EKS) and / or GCP (Vertex AI, GKE)
Experience with specialized hardware (A100s, H100s, TPUs, TEEs) and optimization
RLHF & Fine-tuning : Experience with Reinforcement Learning from Human Feedback and LLM fine-tuning workflows
Experience with Milvus, Pinecone, Weaviate, Qdrant, or similar vector storage solutions
Deep experience with MLflow, Kubeflow, DataRobot, or similar platformsIndustry Knowledge
Understanding of AI safety principles, model governance, and regulatory compliance
Background in regulated industries with understanding of data privacy requirements
Experience supporting ML research teams and academic partnerships
Deep understanding of GPU optimization, memory management, and high-throughput systems
Annual Salary
$105,000.00 - $300,000.00The above annual salary range is a general guideline. Multiple factors are taken into consideration to arrive at the final hourly rate / annual salary to be offered to the selected candidate. Factors include, but are not limited to, the scope and responsibilities of the role, the selected candidate’s work experience, education and training, the work location as well as market and business considerations.At this time, GEICO will not sponsor a new applicant for employment authorization for this position.
The GEICO Pledge :
Great Company :
At GEICO, we help our customers through life’s twists and turns. Our mission is to protect people when

#J-18808-Ljbffr

[job_alerts.create_a_job]

Machine Learning Engineer • Palo Alto, CA, United States

[internal_linking.similar_jobs]

Staff Machine Learning Engineer

Axiado • San Jose, CA, US

[job_card.full_time]

Axiado is an AI-enhanced security processor company redefining the control and management of every digital system.The company was founded in 2017, and currently has 150+ employees.At Axiado, develo...[show_more]

[last_updated.last_updated_30] • [promoted]

Machine Learning Engineer, Recommendation

NewsBreak • Mountain View, CA, US

[job_card.full_time]

Founded in 2015, NewsBreak is the Content Intelligence platform shaping the future content economy.With over 40 million monthly active users, our flagship platform delivers highly personalized loca...[show_more]

[last_updated.last_updated_30] • [promoted]

Machine Learning Engineer LLLMGenAI

Adobe • San Jose, California, United States

[job_card.full_time]

Our Company Changing the world through digital experiences is what Adobe’s all about.We give everyone-from emerging artists to global brands-everything they need to design and deliver exceptional d...[show_more]

[last_updated.last_updated_variable_days] • [promoted]

Machine Learning Engineer 2

Adobe Inc. • San Jose, CA, United States

[job_card.full_time]

Changing the world through digital experiences is what Adobe's all about.We give everyone—from emerging artists to global brands—everything they need to design and deliver exceptional digital exper...[show_more]

[last_updated.last_updated_variable_days] • [promoted]

Lead Machine Learning Engineer

Capital One • San Jose, CA, United States

[job_card.full_time]

Lead Machine Learning Engineer.As a Capital One Machine Learning Engineer (MLE), you’ll be part of an Agile team dedicated to productionizing machine learning applications and systems at scale.You’...[show_more]

[last_updated.last_updated_variable_days] • [promoted]

Machine Learning Engineer 2

Intuit • Mountain View, CA, United States

[job_card.full_time]

Embedded inside a vibrant team of data scientists, you’ll conceive, code, and deploy data science models at scale using industry tools. Key skills : data wrangling, feature engineering, model develop...[show_more]

[last_updated.last_updated_30] • [promoted]

Machine Learning Engineer

VirtualVocations • San Jose, California, United States

[job_card.full_time]

A company is looking for a Machine Learning Engineer for the Deployments Team.Key Responsibilities Design and deliver advanced solutions for predictions from various Computer Vision models across...[show_more]

[last_updated.last_updated_variable_days] • [promoted]

Machine Learning Engineer

RADAR • Sunnyvale, CA, US

[job_card.full_time]

At RADAR, we're transforming the way the world thinks about physical retail.RADAR has raised over $104M from top investors, retailers, and strategics and works with some of the world's reta...[show_more]

[last_updated.last_updated_variable_hours] • [promoted] • [new]

Machine Learning Engineer

Institute of Foundation Models • Sunnyvale, CA, US

[job_card.full_time]

About the Institute of Foundation Models.We are a dedicated research lab for building, understanding, using, and risk-managing foundation models. Our mandate is to advance research, nurture the next...[show_more]

[last_updated.last_updated_30] • [promoted]

Machine Learning Engineer

ProductNow • Palo Alto, CA, United States

[job_card.full_time]

Machine Learning Engineer ProductNow •Palo Alto, CA, US.Explore and experiment with state-of-the-art AI models and machine learning techniques, contributing to core product features powered by ML.Ow...[show_more]

[last_updated.last_updated_variable_hours] • [promoted] • [new]

Staff Machine Learning Engineer (ML Platform)

EarnIn • Palo Alto, CA, United States

[job_card.full_time]

Get AI-powered advice on this job and more exclusive features.As one of the first pioneers of earned wage access, our passion at EarnIn is building products that deliver real-time financial flexibi...[show_more]

[last_updated.last_updated_30] • [promoted]

Machine Learning Engineer

Instrumental Inc. • Palo Alto, CA, United States

[job_card.full_time]

Machine Learning Engineer (Computer Vision).We are looking for a customer-focused ML Engineer to help build and scale our end-to-end ML pipeline. You’ll balance research and productization in a fast...[show_more]

[last_updated.last_updated_30] • [promoted]

Machine Learning Engineer

Altera • San Jose, CA, United States

[job_card.full_time]

About the Role • • We are seeking a Machine Learning Engineer to help drive the development, optimization and deployment of Altera FPGA Compiler. In this role, you will work at the intersection of • •m...[show_more]

[last_updated.last_updated_1_day] • [promoted]

Machine Learning Engineer

Clutch Canada • Palo Alto, CA, United States

[job_card.full_time]

Palo Alto, CA - Engineering - Hybrid - Full-time.Building hardware is like writing software with no debugger, no logs, and only three compile attempts — before mass production.This lack of visibili...[show_more]

[last_updated.last_updated_30] • [promoted]

Machine Learning Engineer

Amiri Recruiting • Mountain View, CA, US

[job_card.full_time]

This is an opportunity with an early stage startup.We're looking for an ML research-focused software engineer to join us on our mission to build AI superpowers for developers.Train and fine-tun...[show_more]

[last_updated.last_updated_30] • [promoted]

AIML- Machine Learning Engineer, Machine Learning Platform Technologies

Apple Inc. • Santa Clara, CA, United States

[job_card.full_time]

AIML- Machine Learning Engineer, Machine Learning Platform Technologies.Santa Clara, California, United States Machine Learning and AI. Imagine what you could do here.At Apple, great ideas have a wa...[show_more]

[last_updated.last_updated_30] • [promoted]

Machine Learning Engineer

Gotion Inc. • Fremont, CA, United States

[job_card.full_time]

Silicon Valley, CA, currently building a manufacturing facility in Manteno, IL and has R&D centers in Ohio, China, Japan and Europe. We innovate in the next generation electric vehicle and energy st...[show_more]

[last_updated.last_updated_30] • [promoted]

Machine Learning Engineer

Cisco Systems • Sunnyvale, CA, United States

[job_card.full_time]

Splunk, a Cisco company, is building a safer, more resilient digital world with an end‑to‑end, full‑stack platform designed for hybrid, multi‑cloud environments. The Splunk AI Platform and Services ...[show_more]

[last_updated.last_updated_variable_hours] • [promoted] • [new]