Talent.com
LLM Inference Deployment Engineer
LLM Inference Deployment EngineerVirtualVocations • Ann Arbor, Michigan, United States
[error_messages.no_longer_accepting]
LLM Inference Deployment Engineer

LLM Inference Deployment Engineer

VirtualVocations • Ann Arbor, Michigan, United States
[job_card.variable_days_ago]
[job_preview.job_type]
  • [job_card.full_time]
[job_card.job_description]

A company is looking for an LLM Inference Deployment Engineer to optimize and deploy large language models for high-performance inference.

Key Responsibilities

Deploy and optimize LLMs post-training from libraries like HuggingFace

Utilize inference runtimes for efficient execution

Develop and maintain high-performance inference pipelines using Docker and Kubernetes

Required Qualifications

Bachelor's or Master's degree in Computer Science, Electrical Engineering, or related field

Experience in LLM inference deployment and model optimization

Expertise in LLM inference frameworks such as PyTorch and ONNX Runtime

In-depth knowledge of Python for model integration and performance tuning

Experience with containerized AI deployments and LLM memory optimization strategies

[job_alerts.create_a_job]

Deployment Engineer • Ann Arbor, Michigan, United States

[internal_linking.similar_jobs]
Applied ML Engineer, II - GPU Optimization

Applied ML Engineer, II - GPU Optimization

Torc Robotics • Ann Arbor, Michigan, United States
[job_card.full_time]
At Torc, we have always believed that autonomous vehicle technology will transform how we travel, move freight, and do business. A leader in autonomous driving since 2007, Torc has spent over a deca...[show_more]
[last_updated.last_updated_30] • [promoted]
Machine Learning Signal Processing Engineer

Machine Learning Signal Processing Engineer

Leidos Inc • Ann Arbor, MI, United States
[job_card.full_time]
Do you want to join a high performing team that values integrity, innovation, and collaboration with a company whose mission is to make the world safer, healthier, and more efficient through inform...[show_more]
[last_updated.last_updated_30] • [promoted]
Software Engineer, Applications & Customer Solutions

Software Engineer, Applications & Customer Solutions

MemryX • Ann Arbor, Michigan, United States, 48109
[job_card.full_time]
AI accelerators for edge computing.Founded in 2019 and headquartered in Ann Arbor, Michigan, the company also operates existing engineering branches in Taipei, Hsinchu (Taiwan), and Bangalore (Indi...[show_more]
[last_updated.last_updated_30]
Senior ML / AI Engineer

Senior ML / AI Engineer

Source One Technical Solutions • Ann Arbor, MI, United States
[job_card.full_time]
Source One is a consulting services company and we’re currently looking for the following individual to work as a consultant to our client, an autonomous vehicle company in Ann Arbor, MI.We are una...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Member Solutions Engineer & Insights Analyst

Member Solutions Engineer & Insights Analyst

Merit Network, Inc. • Ann Arbor, MI, US
[job_card.full_time]
How to Apply To apply for this position, please visit this link : Include a copy of your resume and cover letter (combined as a single file, with your cover letter as the first page).Making sure you...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Lead Software Engineer (GoLang / TS / CUDA) : $150-215K

Lead Software Engineer (GoLang / TS / CUDA) : $150-215K

IC Resources • Ann Arbor, MI, United States
[job_card.full_time]
We're assisting our European-Based Engineering client identify a.US Headquarters in Ann Arbor, Michigan.This is a very exciting opportunity to. Emerging-Tech team here in the U.We're Only Considerin...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Senior ML / AI Engineer (Ann Arbor)

Senior ML / AI Engineer (Ann Arbor)

Source One Technical Solutions • Ann Arbor, MI, US
[job_card.part_time]
Source One is a consulting services company and were currently looking for the following individual to work as a consultant to our client, an autonomous vehicle company in Ann Arbor, MI.We are unab...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Engineer I Labs - HDS (Willow Run location)

Engineer I Labs - HDS (Willow Run location)

NSF International • Ypsilanti, MI, United States
[job_card.full_time]
As an Engineer I for Engineering / Plastics, you will be responsible for conducting a wide variety of testing on plastics, pipes, fittings and other related plumbing products to specific standards an...[show_more]
[last_updated.last_updated_30] • [promoted]
AI Engineer & Researcher, Inference

AI Engineer & Researcher, Inference

Speechify • Ann Arbor, Michigan, United States
[job_card.full_time]
PLEASE APPLY THROUGH THIS LINK : .The mission of Speechify is to make sure that reading is never a barrier to learning.Over 50 million people use Speechify’s text-to-speech products to turn whatever ...[show_more]
[last_updated.last_updated_30] • [promoted]
Machine Learning AI Engineer

Machine Learning AI Engineer

The Mice Groups, Inc. • Ann Arbor, MI, United States
[job_card.full_time]
ML / AI Engineer / Contract, W2 only / Hybrid, 3 days per week onsite in Ann Arbor, MI or Palo Alto, CA / 1 year, extendable. Simplify vehicle software development and increase developer agility by ...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
AI Systems Engineer

AI Systems Engineer

Vertex Sigma Software • Superior Township, MI, US
[job_card.full_time]
[filters_job_card.quick_apply]
Deploy and secure an on-premises AI infrastructure for hosting large language models (LLMs).Install, configure, and maintain AI model-serving frameworks on internal GPU-enabled servers.Develop and ...[show_more]
[last_updated.last_updated_variable_days]
Senior Machine Learning Engineer - App Engine (CUDA / C++)

Senior Machine Learning Engineer - App Engine (CUDA / C++)

Torc Robotics • Ann Arbor, Michigan, United States
[job_card.full_time]
The mission of the Application Engine Team is to provide a robust, efficient, and flexible platform for integrating and managing various deep learning models and processes in the context of L4 auto...[show_more]
[last_updated.last_updated_30] • [promoted]
ML AI Engineer

ML AI Engineer

Mice Groups • Ann Arbor, Michigan, United States
[job_card.full_time]
ML / AI Engineer / Contract / Hybrid, 3 days per week onsite in Ann Arbor, MI or Palo Alto, CA / 1 year, extendable.Summary : Simplify vehicle software development and increase developer agility by ...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Thermal Systems Simulation Engineer (Ypsilanti)

Thermal Systems Simulation Engineer (Ypsilanti)

RGBSI • Ypsilanti, MI, US
[job_card.part_time]
Develop high fidelity 1-D thermal system and component-level models to support fuel economy and vehicle performance assessments. Perform model correlation and validation against physical data test t...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Thermal Systems Simulation Engineer

Thermal Systems Simulation Engineer

RGBSI • Ypsilanti, MI, US
[job_card.full_time]
Develop high fidelity 1-D thermal system and component-level models to support fuel economy and vehicle performance assessments. Perform model correlation and validation against physical data test t...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
ML / AI Safety Engineer - AD / ADAS Systems

ML / AI Safety Engineer - AD / ADAS Systems

Woven • Ann Arbor, MI / Palo Alto, CA, Texas, United States
[job_card.full_time]
Toyota’s once-in-a-century transformation into a mobility company.Inspired by a legacy of innovating for the benefit of others, our mission is to challenge the current state of mobility through hum...[show_more]
[last_updated.last_updated_30] • [promoted]
Machine Learning Engineer II - App Engine (CUDA)

Machine Learning Engineer II - App Engine (CUDA)

Torc Robotics • Ann Arbor, Michigan, United States
[job_card.full_time]
The mission of the Application Engine Team is to provide a robust, efficient, and flexible platform for integrating and managing various deep learning models and processes in the context of L4 auto...[show_more]
[last_updated.last_updated_30] • [promoted]
Member Solutions Engineer & Insights Analyst

Member Solutions Engineer & Insights Analyst

Merit Network • Ann Arbor, MI, United States
[job_card.full_time]
How to Apply To apply for this position, please visit this link : https : / / careers.Include a copy of your resume and cover letter (combined as a single file, with your cover letter as the first page)...[show_more]
[last_updated.last_updated_variable_days] • [promoted]