GenAI Inference Engineer — Scalable LLM ServingDatabricks Inc. • San Francisco, CA, United States

GenAI Inference Engineer — Scalable LLM Serving

Databricks Inc. • San Francisco, CA, United States

[job_card.variable_days_ago]

[job_preview.job_type]

[job_card.full_time]

[job_card.job_description]

A leading AI-focused technology company in San Francisco is seeking a Software Engineer for GenAI inference. In this role, you'll design, develop, and optimize the inference engine powering the Foundation Model API. You will collaborate closely with researchers and engage in performance-critical system challenges, focusing on large-scale LLM applications. A strong background in software engineering, distributed systems, and machine learning techniques is essential. The role offers a competitive compensation package including potential bonuses and equity.

#J-18808-Ljbffr

[job_alerts.create_a_job]

Engineer Llm • San Francisco, CA, United States

[internal_linking.related_jobs]

Senior GenAI ML Engineer - Scalable Inference & APIs

Adobe • San Francisco, CA, US

[job_card.full_time]

A leading digital media company is seeking a Senior Machine Learning Engineer to develop core GenAI services and APIs for products like Firefly and Photoshop. Candidates should have a strong backgro...[show_more]

[last_updated.last_updated_1_day] • [promoted]

ML Research Engineer : Scalable Training & Inference

Aldea • San Francisco, California, United States

[job_card.full_time]

A multi-modal AI company is seeking a Research Engineer (Machine Learning) to develop infrastructure for AI research.You will design and optimize training systems for large-scale models, ensuring h...[show_more]

[last_updated.last_updated_variable_days] • [promoted]

ML Platform Engineer — Scalable Inference & Autoscaling

Together AI • San Francisco, CA, United States

[job_card.full_time]

A leading AI research company in San Francisco is seeking a skilled software engineer to focus on optimizing AI systems and ensuring robust performance. Candidates should have extensive experience i...[show_more]

[last_updated.last_updated_variable_days] • [promoted]

Real-Time LLM Inference Engineer (Hybrid, SF)

Pantera Capital • San Francisco, CA, United States

[job_card.full_time]

A technology investment firm in San Francisco seeks an AI Inference Engineer to develop APIs for AI inference used by internal and external customers. Responsibilities include benchmarking the infer...[show_more]

[last_updated.last_updated_variable_days] • [promoted]

Senior LLM Applications Engineer for Intelligent Agents

Sahara • San Francisco, CA, United States

[job_card.full_time]

A decentralized AI platform in San Francisco is seeking a talented large language model application engineer.The role involves developing intelligent agents and collaborating with a team to optimiz...[show_more]

[last_updated.last_updated_variable_days] • [promoted]

AI Infra Engineer, Scalable LLM Serving Platform

Scale AI, Inc. • San Francisco, CA, United States

[job_card.full_time]

A leading AI technology company is seeking a Software Engineer for the ML Infrastructure team to design and build platforms for LLMs. You will develop fault-tolerant systems and collaborate with res...[show_more]

[last_updated.last_updated_variable_days] • [promoted]

AI / ML Inference Engineer

Krea • San Francisco, California, United States

[job_card.full_time]

At Krea, we're dedicated to making AI intuitive and controllable for creatives.Our mission is to build tools that empower human creativity, not replace it. We believe AI is a new medium that allows ...[show_more]

[last_updated.last_updated_30] • [promoted]

Software Engineer - ML / LLM Inference

Alldus • San Francisco, CA, United States

[job_card.full_time]

Get AI-powered advice on this job and more exclusive features.Direct message the job poster from Alldus.Principal Recruitment Consultant | AI & Machine Learning | Co-organizer of the AI in Action P...[show_more]

[last_updated.last_updated_30] • [promoted]

Senior ML Ops Engineer – Scalable AI for Audio

Disney • San Francisco, CA, United States

[job_card.full_time]

A major entertainment company is seeking a skilled Sr ML Ops Engineer in San Francisco, CA.The role involves building scalable infrastructure for machine learning and AI frameworks, optimizing CI / C...[show_more]

[last_updated.last_updated_variable_days] • [promoted]

LLM Inference Performance Engineer

Baseten • San Francisco, CA, United States

[job_card.full_time]

A dynamic AI startup in San Francisco is seeking a Software Engineer focused on ML performance.This role involves optimizing large language models, debugging and enhancing ML solutions, and produci...[show_more]

[last_updated.last_updated_variable_days] • [promoted]

AI Engineer LLM Infra

Yutori • San Francisco, California, United States

[job_card.full_time]

Yutori is reimagining how people interact with the web by building AI agents that can reliably do everyday digital tasks. We are building the entire stack to be agent-first, from training our own mo...[show_more]

[last_updated.last_updated_30] • [promoted]

Senior AI Agent Engineer : Build Scalable LLM Pipelines

Tessera Labs • San Francisco, CA, United States

[job_card.full_time]

A leading AI technology company based in San Francisco is seeking an experienced AI Agent Engineer to develop scalable AI systems. The role requires expertise in AI model building, with a strong foc...[show_more]

[last_updated.last_updated_variable_days] • [promoted]

Research Engineer – Scalable ML Training & Inference

Aldea Inc • San Francisco, California, United States

[job_card.full_time]

A leading AI company in San Francisco is looking for a Research Engineer (Machine Learning) to enhance their multi-modal AI capabilities. The role involves building and optimizing infrastructure for...[show_more]

[last_updated.last_updated_variable_days] • [promoted]

Multimodal Inference Engineer — Scale Real-Time AI

OpenAI • San Francisco, CA, United States

[job_card.full_time]

A leading AI research company in San Francisco is seeking a Software Engineer specialized in multimodal inference systems. Responsibilities include designing high-performance infrastructure for audi...[show_more]

[last_updated.last_updated_variable_days] • [promoted]

Senior ML Platform Engineer - Build Scalable AI Systems

Faire • San Francisco, CA, US

[job_card.full_time]

A leading online wholesale marketplace is seeking a Staff Engineer in San Francisco to lead the design and execution of a machine learning platform. The ideal candidate will have over 5 years of exp...[show_more]

[last_updated.last_updated_variable_days] • [promoted]

ML Inference Engineer - Scalable AI Systems

Together • San Francisco, CA, US

[job_card.full_time]

A pioneering AI company in San Francisco seeks a Machine Learning Engineer to join their Inference Engine team.This role involves optimizing AI inference systems, developing high-performance servic...[show_more]

[last_updated.last_updated_variable_days] • [promoted]

Senior ML Engineer — Scalable AI & Personalization (Remote)

Quizlet • San Francisco, California, United States

[filters.remote]

[job_card.full_time]

A leading educational technology company in San Francisco is seeking a Machine Learning Engineer to drive AI initiatives. This role requires extensive experience in Python, ML libraries, and a solid...[show_more]

[last_updated.last_updated_variable_days] • [promoted]

Senior ML Engineer - Scalable Vision-Language Systems

TwelveLabs • San Francisco, CA, United States

[job_card.full_time]

A pioneering AI technology firm based in San Francisco is seeking a Machine Learning Engineer to enhance its ML systems and engineering workflows. The ideal candidate will have over 6 years of exper...[show_more]

[last_updated.last_updated_variable_days] • [promoted]