Talent.com
Inference Engineer: Scalable AI Model Serving
Inference Engineer: Scalable AI Model ServingVirtue AI • San Francisco, CA, United States
[error_messages.no_longer_accepting]
Inference Engineer : Scalable AI Model Serving

Inference Engineer : Scalable AI Model Serving

Virtue AI • San Francisco, CA, United States
[job_card.variable_hours_ago]
[job_preview.job_type]
  • [job_card.full_time]
[job_card.job_description]

An innovative AI security company in San Francisco is seeking an Inference Engineer who will be pivotal in optimizing ML model inferences. The role requires deep knowledge of serving LLMs and experience in designing inference APIs. Candidates should be comfortable in a fast-paced startup environment and demonstrate the ability to troubleshoot and solve real-world inference issues effectively. This position presents an opportunity to work at the cutting edge of AI security with competitive compensation and growth potential.

#J-18808-Ljbffr

[job_alerts.create_a_job]

Inference Engineer Scalable AI Model Serving • San Francisco, CA, United States

[internal_linking.similar_jobs]
Model API Engineer Fast, Reliable AI Inference

Model API Engineer Fast, Reliable AI Inference

BaseTen Labs, Inc. • San Francisco, CA, United States
[job_card.full_time]
An innovative AI technology company in San Francisco is seeking a skilled individual to join their Model Performance team. You will design and operate Model APIs, focusing on advanced inference capa...[show_more]
[last_updated.last_updated_variable_hours] • [promoted] • [new]
Artificial Intelligence Engineer

Artificial Intelligence Engineer

The Mice Groups Inc • Redwood City, CA, United States
[job_card.permanent]
AI Engineer, Evaluation and Reliability / Contract-to-Hire or Direct Hire / Redwood City / Hybrid, onsite 3 days per week / This position pays $70-80 / hr. W2 for Contract, $140-190K annually upon con...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
AI Engineer

AI Engineer

LangChain • San Francisco, CA, United States
[job_card.full_time]
We're looking for an AI Engineer to join our Professional Services team.You'll work directly with enterprise customers to design, build, and optimize production-grade AI agent systems.This role com...[show_more]
[last_updated.last_updated_30] • [promoted]
AI Infra Engineer, Scalable LLM Serving Platform

AI Infra Engineer, Scalable LLM Serving Platform

Scale AI • San Francisco, CA, United States
[job_card.full_time]
A leading AI technology company is seeking a Software Engineer for the ML Infrastructure team to design and build platforms for LLMs. You will develop fault-tolerant systems and collaborate with res...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Lead AI Engineer - Build Compassionate, Scalable AI

Lead AI Engineer - Build Compassionate, Scalable AI

Woebot Health • San Francisco, CA, United States
[job_card.full_time]
A digital health startup in San Francisco is seeking a Lead AI Engineer to architect and optimize the cognitive engine of their platform. The ideal candidate has over 5 years of experience in AI / ML ...[show_more]
[last_updated.last_updated_30] • [promoted]
Generative AI Engineer

Generative AI Engineer

Regard • San Francisco, California, US
[job_card.full_time]
Job Description Job Description As a Generative AI Engineer at Regard, you'll work across the full lifecycle of developing and deploying AI-driven features, from ideation and design to prototypin...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Senior Core Infra Engineer for a Scalable AI Platform

Senior Core Infra Engineer for a Scalable AI Platform

Harvey • San Francisco, CA, United States
[job_card.full_time]
A tech-driven company in San Francisco is seeking a Software Engineer to design and build scalable infrastructure systems. This role involves evolving multi-cloud infrastructure and ensuring resilie...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
ML Inference Engineer — PyTorch & Scalable AI

ML Inference Engineer — PyTorch & Scalable AI

Together • San Francisco, CA, United States
[job_card.full_time]
A research-driven AI company is seeking a Machine Learning Engineer to join their Inference Engine team.You'll design and develop production systems to enhance AI inference performance, collaborati...[show_more]
[last_updated.last_updated_30] • [promoted]
Senior ML Inference Engineer - PyTorch Performance

Senior ML Inference Engineer - PyTorch Performance

Comfy • San Francisco, CA, United States
[job_card.full_time]
A leading AI platform company in San Francisco is seeking a talented individual to optimize model inference for their advanced visual AI product. The ideal candidate will engage in building efficien...[show_more]
[last_updated.last_updated_30] • [promoted]
AI Engineer

AI Engineer

Ironclad Inc. • San Francisco, CA, United States
[job_card.full_time]
Ironclad is the leading AI contracting platform that transforms agreements into assets.Contracts move faster, insights surface instantly, and agents push work forward, all with you in control.Wheth...[show_more]
[last_updated.last_updated_30] • [promoted]
Staff AI Engineer - Scale AI Platforms & Impact

Staff AI Engineer - Scale AI Platforms & Impact

Medium • San Mateo, CA, United States
[job_card.full_time]
A technology company is seeking a Staff Software Engineer responsible for building, maintaining, and scaling AI products. You will work directly with product managers to define the product roadmap, ...[show_more]
[last_updated.last_updated_1_day] • [promoted]
Senior AI Engineer / ML

Senior AI Engineer / ML

Staffing the Universe • San Francisco, CA, United States
[job_card.full_time]
Full time : Senior AI Engineer / ML - Hybrid / San Francisco, CA.Skill matrix : Total IT experience : Years working with : AI Engineer? Years working with : Generative AI, neural networks, and transfer lear...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Forward Deployed AI Engineer

Forward Deployed AI Engineer

Jenn Nguyen and Friends • San Francisco, CA, USA
[job_card.full_time]
[filters_job_card.quick_apply]
Forward Deployed AI Engineer (ML / Full Stack).Compensation : $115K–$200K base (varies by level and interview performance) + equity. Work Policy : 5 days / week in-office.Sponsorship : TN and OPT visa tr...[show_more]
[last_updated.last_updated_variable_days]
AI Engineer : Build Multimodal LLMs & Scalable AI Infra

AI Engineer : Build Multimodal LLMs & Scalable AI Infra

MLabs • San Francisco, CA, United States
[job_card.full_time]
An AI-focused company in San Francisco is looking for a talented AI Engineer to work on building AI models and features essential to their business operations. This hybrid role will focus on documen...[show_more]
[last_updated.last_updated_30] • [promoted]
In-Office AI Engineer – Prototyping, Scale & Impact (SF)

In-Office AI Engineer – Prototyping, Scale & Impact (SF)

Factory • San Francisco, CA, United States
[job_card.full_time]
A leading tech company in San Francisco is seeking an innovative AI Engineer to design and develop cutting-edge AI systems that enhance productivity. Candidates should have 2+ years of AI / ML experie...[show_more]
[last_updated.last_updated_30] • [promoted]
Inference Engineer : Scalable AI Model Serving

Inference Engineer : Scalable AI Model Serving

Virtue AI • San Francisco, CA, United States
[job_card.full_time]
An innovative AI security company in San Francisco is seeking an Inference Engineer who will be pivotal in optimizing ML model inferences. The role requires deep knowledge of serving LLMs and experi...[show_more]
[last_updated.last_updated_30] • [promoted]
Model API Engineer : Fast, Scalable AI Inference

Model API Engineer : Fast, Scalable AI Inference

Baseten • San Francisco, CA, United States
[job_card.full_time]
A technology startup in San Francisco is seeking a skilled individual to enhance the API infrastructure supporting AI models. The role involves designing and optimizing backend services, focusing on...[show_more]
[last_updated.last_updated_30] • [promoted]
Model Inference Engineer for High-Performance AI

Model Inference Engineer for High-Performance AI

OpenAI • San Francisco, CA, United States
[job_card.full_time]
A technology research company in San Francisco is seeking a Software Engineer for Model Inference to optimize AI models for production environments. The ideal candidate will have over 5 years of exp...[show_more]
[last_updated.last_updated_30] • [promoted]