Talent.com
Inference Engineer: Scalable AI Model Serving
Inference Engineer: Scalable AI Model ServingVirtue AI • San Francisco, CA, United States
[error_messages.no_longer_accepting]
Inference Engineer : Scalable AI Model Serving

Inference Engineer : Scalable AI Model Serving

Virtue AI • San Francisco, CA, United States
[job_card.variable_hours_ago]
[job_preview.job_type]
  • [job_card.full_time]
[job_card.job_description]

An innovative AI security company in San Francisco is seeking an Inference Engineer who will be pivotal in optimizing ML model inferences. The role requires deep knowledge of serving LLMs and experience in designing inference APIs. Candidates should be comfortable in a fast-paced startup environment and demonstrate the ability to troubleshoot and solve real-world inference issues effectively. This position presents an opportunity to work at the cutting edge of AI security with competitive compensation and growth potential.

#J-18808-Ljbffr

[job_alerts.create_a_job]

Inference Engineer Scalable AI Model Serving • San Francisco, CA, United States

[internal_linking.similar_jobs]
AI Engineer

AI Engineer

Chima • San Francisco, CA, United States
[job_card.full_time]
Be among the first 25 applicants.Get AI-powered advice on this job and more exclusive features.This range is provided by Chima. Your actual pay will be based on your skills and experience — talk wit...[show_more]
[last_updated.last_updated_30] • [promoted]
Model API Engineer Fast, Reliable AI Inference

Model API Engineer Fast, Reliable AI Inference

BaseTen Labs, Inc. • San Francisco, CA, United States
[job_card.full_time]
An innovative AI technology company in San Francisco is seeking a skilled individual to join their Model Performance team. You will design and operate Model APIs, focusing on advanced inference capa...[show_more]
[last_updated.last_updated_variable_hours] • [promoted] • [new]
Artificial Intelligence Engineer

Artificial Intelligence Engineer

The Mice Groups Inc • Redwood City, CA, United States
[job_card.permanent]
AI Engineer, Evaluation and Reliability / Contract-to-Hire or Direct Hire / Redwood City / Hybrid, onsite 3 days per week / This position pays $70-80 / hr. W2 for Contract, $140-190K annually upon con...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
AI Engineer

AI Engineer

LangChain • San Francisco, CA, United States
[job_card.full_time]
We're looking for an AI Engineer to join our Professional Services team.You'll work directly with enterprise customers to design, build, and optimize production-grade AI agent systems.This role com...[show_more]
[last_updated.last_updated_30] • [promoted]
AI Infra Engineer, Scalable LLM Serving Platform

AI Infra Engineer, Scalable LLM Serving Platform

Scale AI • San Francisco, CA, United States
[job_card.full_time]
A leading AI technology company is seeking a Software Engineer for the ML Infrastructure team to design and build platforms for LLMs. You will develop fault-tolerant systems and collaborate with res...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
AI Engineer

AI Engineer

SINAI Technologies Inc • San Francisco, CA, United States
[job_card.full_time]
SINAI is a San Francisco–based climate technology company helping enterprises measure, analyze, and reduce carbon emissions. Our platform supports complex reporting, modeling, and regulatory workflo...[show_more]
[last_updated.last_updated_30] • [promoted]
Lead AI Engineer - Build Compassionate, Scalable AI

Lead AI Engineer - Build Compassionate, Scalable AI

Woebot Health • San Francisco, CA, United States
[job_card.full_time]
A digital health startup in San Francisco is seeking a Lead AI Engineer to architect and optimize the cognitive engine of their platform. The ideal candidate has over 5 years of experience in AI / ML ...[show_more]
[last_updated.last_updated_30] • [promoted]
AI Product Engineer for Scalable Hardware Intelligence

AI Product Engineer for Scalable Hardware Intelligence

Apple Inc. • San Francisco, CA, United States
[job_card.full_time]
A leading technology company in San Francisco is seeking an AI Application Engineer to leverage AI / ML expertise in enhancing engineering design processes. The ideal candidate will possess a strong u...[show_more]
[last_updated.last_updated_30] • [promoted]
Generative AI Engineer

Generative AI Engineer

Regard • San Francisco, California, US
[job_card.full_time]
Job Description Job Description As a Generative AI Engineer at Regard, you'll work across the full lifecycle of developing and deploying AI-driven features, from ideation and design to prototypin...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Senior Software Engineer, Intelligence — AI Retrieval

Senior Software Engineer, Intelligence — AI Retrieval

AngelList • San Francisco, CA, United States
[job_card.full_time]
A growing tech company is seeking a Senior Software Engineer to design and implement systems that power data retrieval and search functionalities. The ideal candidate should have extensive experienc...[show_more]
[last_updated.last_updated_30] • [promoted]
Senior ML Inference Engineer - PyTorch Performance

Senior ML Inference Engineer - PyTorch Performance

Comfy • San Francisco, CA, United States
[job_card.full_time]
A leading AI platform company in San Francisco is seeking a talented individual to optimize model inference for their advanced visual AI product. The ideal candidate will engage in building efficien...[show_more]
[last_updated.last_updated_30] • [promoted]
Platform Engineer : Scalable AI Infra for Real-World Labs

Platform Engineer : Scalable AI Infra for Real-World Labs

Withmetis • San Francisco, CA, United States
[job_card.full_time]
A cutting-edge AI platform firm in San Francisco is seeking a Platform Engineer to design and build scalable infrastructure for reinforcement learning frameworks. This role involves owning projects ...[show_more]
[last_updated.last_updated_30] • [promoted]
Staff AI Engineer - Scale AI Platforms & Impact

Staff AI Engineer - Scale AI Platforms & Impact

Medium • San Mateo, CA, United States
[job_card.full_time]
A technology company is seeking a Staff Software Engineer responsible for building, maintaining, and scaling AI products. You will work directly with product managers to define the product roadmap, ...[show_more]
[last_updated.last_updated_1_day] • [promoted]
Forward Deployed AI Engineer

Forward Deployed AI Engineer

Jenn Nguyen and Friends • San Francisco, CA, USA
[job_card.full_time]
[filters_job_card.quick_apply]
Forward Deployed AI Engineer (ML / Full Stack).Compensation : $115K–$200K base (varies by level and interview performance) + equity. Work Policy : 5 days / week in-office.Sponsorship : TN and OPT visa tr...[show_more]
[last_updated.last_updated_variable_days]
AI Engineer : Build Multimodal LLMs & Scalable AI Infra

AI Engineer : Build Multimodal LLMs & Scalable AI Infra

MLabs • San Francisco, CA, United States
[job_card.full_time]
An AI-focused company in San Francisco is looking for a talented AI Engineer to work on building AI models and features essential to their business operations. This hybrid role will focus on documen...[show_more]
[last_updated.last_updated_30] • [promoted]
In-Office AI Engineer – Prototyping, Scale & Impact (SF)

In-Office AI Engineer – Prototyping, Scale & Impact (SF)

Factory • San Francisco, CA, United States
[job_card.full_time]
A leading tech company in San Francisco is seeking an innovative AI Engineer to design and develop cutting-edge AI systems that enhance productivity. Candidates should have 2+ years of AI / ML experie...[show_more]
[last_updated.last_updated_30] • [promoted]
Inference Engineer : Scalable AI Model Serving

Inference Engineer : Scalable AI Model Serving

Virtue AI • San Francisco, CA, United States
[job_card.full_time]
An innovative AI security company in San Francisco is seeking an Inference Engineer who will be pivotal in optimizing ML model inferences. The role requires deep knowledge of serving LLMs and experi...[show_more]
[last_updated.last_updated_30] • [promoted]
Model API Engineer : Fast, Scalable AI Inference

Model API Engineer : Fast, Scalable AI Inference

Baseten • San Francisco, CA, United States
[job_card.full_time]
A technology startup in San Francisco is seeking a skilled individual to enhance the API infrastructure supporting AI models. The role involves designing and optimizing backend services, focusing on...[show_more]
[last_updated.last_updated_30] • [promoted]