Talent.com
Model Efficiency Engineer
Model Efficiency EngineerVirtualVocations • Oakland, California, United States
[error_messages.no_longer_accepting]
Model Efficiency Engineer

Model Efficiency Engineer

VirtualVocations • Oakland, California, United States
[job_card.30_days_ago]
[job_preview.job_type]
  • [job_card.full_time]
[job_card.job_description]

A company is looking for a Member of Technical Staff, Model Efficiency.

Key Responsibilities

Improve core performance metrics of ML systems by analyzing model execution and identifying bottlenecks

Collaborate with modeling and systems teams to experiment, measure, and implement optimizations that enhance inference efficiency

Develop advanced performance techniques, including GPU / CUDA optimizations and model execution strategies for large-scale architectures

Required Qualifications

5+ years of experience in writing high-performance, production-quality code

Strong programming skills in C++ or Python (Rust / Go also welcome)

Experience with large language models and the LLM inference ecosystem

Ability to diagnose and resolve performance bottlenecks across the model execution stack

A strong bias for action with a focus on shipping quickly, measuring impact, and iterating

[job_alerts.create_a_job]

Model • Oakland, California, United States

[internal_linking.related_jobs]
Founding Applied ML Engineer

Founding Applied ML Engineer

David AI • San Francisco, California, United States
[job_card.full_time]
David AI is the first audio data research company.We bring an R&D approach to data–developing datasets with the same rigor AI labs bring to models. Speech is versatile, accessible, and.To unlock the...[show_more]
[last_updated.last_updated_30] • [promoted]
Performance Modelling Engineer

Performance Modelling Engineer

PageBolt WordPress • San Francisco, CA, US
[job_card.permanent]
The Role We're searching for a Staff Performance Modelling Engineer to create and own the analytical and simulation models that steer OTPU architecture and software evolution.You will build functio...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Senior Engineer, Model Serving & Inference

Senior Engineer, Model Serving & Inference

Databricks • San Francisco, CA, United States
[job_card.full_time]
A leading data and AI company is seeking a Senior Software Engineer, Model Serving to design and implement core systems that ensure scalability and operational excellence.You will drive architectur...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
System Modeling Engineer, Energy Storage

System Modeling Engineer, Energy Storage

Redwood Materials, Inc. • San Francisco, CA, United States
[job_card.full_time]
System Modeling Engineer, Energy Storage.Redwood is localizing a global battery supply chain that seamlessly integrates recovery, reuse, and recycling—keeping critical minerals in circulation and d...[show_more]
[last_updated.last_updated_30] • [promoted]
Phantom Works Model Based Systems Engineer (Experienced and Lead))

Phantom Works Model Based Systems Engineer (Experienced and Lead))

Boeing • Berkeley, California, USA
[job_card.full_time] +2
Phantom Works Model Based Systems Engineer (Experienced and Lead)).Boeing Defense Space & Security.As part of the MQ-28 team youll work closely with. Boeing Defense Australia (BDA).This program ...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
ML Engineer

ML Engineer

Wispr Flow • San Francisco, California, United States
[job_card.full_time]
Wispr Flow is making it as effortless to interact with your devices as talking to a close friend.Voice is the most natural, powerful way to communicate — and we’re building the interfaces to make t...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
ML Engineer

ML Engineer

Phizenix • Menlo Park, California, United States
[job_card.full_time] +1
Client Opportunity | Through Phizenix.Phizenix, a certified minority and women-led recruiting firm, is hiring on behalf of an innovative generative AI startup that’s developing diffusion-based larg...[show_more]
[last_updated.last_updated_30] • [promoted]
Founding Engineer - ML

Founding Engineer - ML

Datawizz • San Francisco, California, United States
[job_card.full_time]
Datawizz helps companies reduce LLM costs by 85% while improving accuracy by over 20% by combining distillation, model routing, and pruning to route requests to smaller, more efficient models.We st...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
AI Engineer, Applied ML

AI Engineer, Applied ML

Perplexity • San Francisco, California, United States
[job_card.full_time]
Perplexity is looking for an Applied ML Engineer to design, build, and iterate on cutting-edge AI models powering our core experience. As an expert in machine learning and artificial intelligence, y...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Fuel Cycle Modeling Engineer

Fuel Cycle Modeling Engineer

Hadron Energy, Inc. • San Francisco, CA, United States
[job_card.full_time]
Be among the first 25 applicants.Direct message the job poster from Hadron Energy, Inc.Hadron Energy specializes in Micro Modular Reactor (MMR) development, design, and research based in the San Fr...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Machine Learning Engineer, Distributed & Scalable Training

Machine Learning Engineer, Distributed & Scalable Training

Lila Sciences • San Francisco, California, United States
[job_card.full_time]
We’re seeking a ML Engineer specializing in.You’ll design and maintain large-scale training systems, optimize performance for massive models, and integrate cutting-edge techniques to improve effici...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Reliability / DFX Engineer

Reliability / DFX Engineer

OpenAI • San Francisco, CA, US
[job_card.full_time]
About The Team OpenAI's Hardware organization develops silicon and system-level solutions designed for the unique demands of advanced AI workloads. The team is responsible for building the next gen...[show_more]
[last_updated.last_updated_30] • [promoted]
System Modeling Engineer, Energy Storage

System Modeling Engineer, Energy Storage

Redwood Materials • San Francisco, CA, US
[job_card.full_time]
About Redwood Materials Redwood is localizing a global battery supply chain that seamlessly integrates recovery, reuse, and recycling — keeping critical minerals in circulation and driving the ener...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Foundation Model ML Engineer — Remote-Friendly

Foundation Model ML Engineer — Remote-Friendly

Stripe • San Francisco, California, United States
[filters.remote]
[job_card.full_time]
A leading financial technology company is seeking a Machine Learning Engineer for their Foundation Model team.The candidate will develop and optimize machine learning models that enhance payments a...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Performance ML Engineer : CUDA, GPU Systems

Performance ML Engineer : CUDA, GPU Systems

Relace • San Francisco, CA, United States
[job_card.full_time]
A tech company specializing in ML infrastructure is seeking a Machine Learning Engineer who excels at making models faster and more efficient through performance tuning and optimization.The ideal c...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Senior Applied ML Engineer

Senior Applied ML Engineer

Macroscope • San Francisco, California, United States
[job_card.full_time]
Macroscope aims to be the source of truth of what's happening for any company that builds software.Our mission is to give leaders clarity and engineers time. We help leaders understand how their pro...[show_more]
[last_updated.last_updated_30] • [promoted]
Machine Learning Engineer, Training Infrastructure

Machine Learning Engineer, Training Infrastructure

Intellipro Group • San Francisco, California, United States
[job_card.full_time]
Machine Learning Engineer, Training Infrastructure.We are looking for an ML Engineer with .ML workloads at scale, supporting our 3DVAE and video diffusion models. We encourage you to apply even if y...[show_more]
[last_updated.last_updated_30] • [promoted]
Senior Machine Learning Engineer (Modeling), Support

Senior Machine Learning Engineer (Modeling), Support

Block • San Francisco, California, United States
[job_card.full_time]
Block is one company built from many blocks, all united by the same purpose of economic empowerment.The blocks that form our foundational teams — People, Finance, Counsel, Hardware, Information Sec...[show_more]
[last_updated.last_updated_30] • [promoted]