Engineering Manager - Model PerformanceBaseten • San Francisco, CA, US

[error_messages.no_longer_accepting]

Engineering Manager - Model Performance

Baseten • San Francisco, CA, US

[job_card.30_days_ago]

[job_preview.job_type]

[job_card.full_time]

[job_card.job_description]

Join Our Dynamic Team at Baseten

Join our dynamic team at Baseten, where we're revolutionizing AI deployment with cutting-edge inference infrastructure. Backed by premier investors such as IVP, Spark Capital, Greylock, and Conviction, we're trusted by leading enterprises and AI-driven innovatorsincluding Descript, Bland.ai, Patreon, Writer, and Robust Intelligenceto deliver top-tier performance, security, and reliability for their production workloads. With our recent $75 million Series C funding, we're poised to accelerate our mission to make AI accessible across all products. If you're passionate about tackling impactful challenges and building transformative solutions from the ground up, we invite you to join us on this exciting journey!

The Role

Are you passionate about advancing the frontiers of artificial intelligence while leading a team of exceptional engineers? We are looking for a Tech Lead Manager focused on ML performance and inference. This role is ideal for someone with a strong engineering background who is eager to lead and mentor a team while remaining hands-on with technology. If you thrive in a fast-paced startup environment and are excited about both leadership and technical challenges, we want to hear from you.

Responsibilities

Lead, mentor, and manage a team of engineers focused on developing and optimizing ML model inference and performance.
Oversee technical strategy and architecture decisions, driving improvements across our engineering organization.
Collaborate with cross-functional teams to ensure seamless integration and scalability of ML models in production environments.
Dive into the codebase of frameworks like TensorRT, PyTorch, CUDA, and others to identify and solve complex performance bottlenecks.
Drive the development and deployment of large-scale optimization techniques for various ML models, especially large language models (LLMs).
Own the full lifecycle of projects from inception through delivery, including planning, execution, and resource management.
Foster a collaborative, inclusive team environment that encourages continuous learning and growth.

Requirements

Bachelor's, Master's, or Ph.D. in Computer Science, Engineering, or a related field.

5+ years of professional experience in software engineering, with at least 2 years in a technical leadership role.

Proven experience managing and mentoring teams of engineers.

Expertise in one or more programming languages, such as Python, C++, or Go.

In-depth understanding of ML model performance optimization, especially using libraries such as PyTorch, TensorRT, and CUDA.

Strong knowledge of containerization (Docker) and orchestration systems (Kubernetes).

Experience with production-level AI / ML solutions, including scaling and deploying large models.

Ability to balance hands-on technical work with team leadership and project management.

Bonus Points

Experience enhancing the performance of large language models (LLMs) or similar AI systems.

Familiarity with LLM optimization techniques such as quantization, speculative decoding, or continuous batching.

Deep knowledge of GPU architecture and performance tuning.

Previous experience in a high-growth startup environment.

Benefits

Competitive compensation package (Unlimited PTO, 401k, covered healthcare premiums).

An opportunity to lead a talented engineering team at a rapidly growing startup in the machine learning space.

Inclusive and supportive work culture with ample opportunities for professional development.

Exposure to a wide range of ML use cases, offering unmatched learning and networking potential.

Apply now to embark on a rewarding journey in shaping the future of AI! If you are a motivated individual with a passion for machine learning and a desire to be part of a collaborative and forward-thinking team, we would love to hear from you.

At Baseten, we are committed to fostering a diverse and inclusive workplace. We provide equal employment opportunities to all employees and applicants without regard to race, color, religion, gender, sexual orientation, gender identity or expression, national origin, age, genetic information, disability, or veteran status.

[job_alerts.create_a_job]

Manager Performance • San Francisco, CA, US

[internal_linking.related_jobs]

Performance Modelling Engineer

PageBolt WordPress • San Francisco, CA, US

[job_card.permanent]

The Role We're searching for a Staff Performance Modelling Engineer to create and own the analytical and simulation models that steer OTPU architecture and software evolution.You will build functio...[show_more]

[last_updated.last_updated_variable_days] • [promoted]

Engineering Manager, Platform

Lime • San Francisco, CA, US

[job_card.full_time]

Lime is the largest global shared micromobility business, operating in close to 30 countries across five continents.Were on a mission to build a future where transportation is shared, affordable an...[show_more]

[last_updated.last_updated_30] • [promoted]

Hands-on Engineering Manager — Hybrid Lead & Mentor

Quindar • San Francisco, California, United States

[job_card.full_time]

A leading tech company is looking for a Software Engineering Manager to lead their Software Development team in a hybrid role. This position balances hands-on development, team management, and strat...[show_more]

[last_updated.last_updated_variable_days] • [promoted]

Senior ML Engineering Manager, Pricing & Revenue

Opendoor • San Francisco, California, United States

[job_card.full_time]

A leading real estate platform in Seattle is seeking a Senior Manager, Machine Learning Engineering to lead a team of engineers in driving the machine learning ecosystem. Focused on optimizing ML sy...[show_more]

[last_updated.last_updated_variable_days] • [promoted]

Engineering Manager - Autonomy

Booster • San Mateo, CA, United States

[job_card.full_time]

Skydio is the leading US drone company and the world leader in autonomous flight, the key technology for the future of drones and aerial transportation. The Skydio team combines deep expertise in ar...[show_more]

[last_updated.last_updated_variable_days] • [promoted]

Engineering Manager - MLOps & Analytics

Canonical • San Francisco, CA, United States

[job_card.full_time]

Engineering Manager - MLOps & Analytics.Be among the first 25 applicants.Engineering Manager - MLOps & Analytics.Get AI-powered advice on this job and more exclusive features.The role of an Enginee...[show_more]

[last_updated.last_updated_30] • [promoted]

Engineering Manager

VirtualVocations • Oakland, California, United States

[job_card.full_time]

A company is looking for an Engineering Manager to lead efforts in developing high-quality foundational LLMs.Key Responsibilities Lead and manage large teams of developers to execute technical ta...[show_more]

[last_updated.last_updated_30] • [promoted]

Engineering Manager, Merchant Data & ML Solutions

Grubhub • San Francisco, California, United States

[job_card.full_time]

A leading food delivery platform headquartered in San Francisco is looking for an Engineering Manager to lead the Merchant engineering team. You will be responsible for driving the development of da...[show_more]

[last_updated.last_updated_variable_days] • [promoted]

Remote AI-Driven Engineering Manager

Y-Axis • San Francisco, CA, United States

[filters.remote]

[job_card.full_time]

A leading technology company based in San Francisco is seeking an Engineering Manager to lead a talented team.This role involves designing engineering services for AI features, optimizing performan...[show_more]

[last_updated.last_updated_variable_days] • [promoted]

Engineering Manager, ML Acceleration

Anthropic • San Francisco, CA, United States

[job_card.full_time]

Anthropic’s mission is to create reliable, interpretable, and steerable AI systems.We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group ...[show_more]

[last_updated.last_updated_30] • [promoted]

Engineering Manager, ML

TwelveLabs • San Francisco, CA, United States

[job_card.full_time]

At TwelveLabs, we are pioneering the development of frontier multimodal foundation models that can see, hear and understand the world as humans do. Our models have redefined the standards in video-l...[show_more]

[last_updated.last_updated_variable_days] • [promoted]

Engineering Manager - Machine Learning Infrastructure

Plaid Inc • San Francisco, CA, United States

[job_card.full_time]

Plaid is evolving into an AI-first company, where data and machine learning are the key enablers of smarter, more secure insight products built on top of Plaid’s vast financial data network.The Mac...[show_more]

[last_updated.last_updated_variable_days] • [promoted]

Engineering Manager, Desktop

anthropic • San Francisco, CA, United States

[job_card.full_time]

[last_updated.last_updated_variable_days] • [promoted]

Mission‑Driven ML Engineering Manager, Public Sector

Scale • San Francisco, California, United States

[job_card.full_time]

A technology firm is seeking a Machine Learning Engineering Manager to guide a team in developing AI systems for public sector customers. This role requires strong leadership in pushing the boundari...[show_more]

[last_updated.last_updated_variable_days] • [promoted]

Engineering Manager, Host

Turo • San Francisco, CA, United States

[job_card.full_time]

As an engineering manager for the Host product team, you’ll lead a cross‑functional team of Software Engineers that build features to support the supply side of Turo’s global marketplace.This team ...[show_more]

[last_updated.last_updated_variable_days] • [promoted]

Engineering Manager

Scoutbee GmbH • San Francisco, CA, United States

[job_card.full_time]

At Scoutbee, we've built the world's most sophisticated AI-powered supplier intelligence platform through rigorous innovation and engineering excellence. Now, as we join forces with Coupa, we’re mai...[show_more]

[last_updated.last_updated_variable_days] • [promoted]

ML Engineering Manager, Ads Marketplace & Bidding

Pinterest • San Francisco, California, United States

[job_card.full_time]

A leading social media platform is seeking a Product Manager in San Francisco to define the strategy for vertical ads products and mentor a team of machine learning engineers.The role entails colla...[show_more]

[last_updated.last_updated_variable_days] • [promoted]

Engineering Program Manager - Fleet Engineering

Lambda • San Francisco, CA, US

[job_card.full_time]

Lambda, The Superintelligence Cloud, builds Gigawatt-scale AI Factories for Training and Inference.Lambda's mission is to make compute as ubiquitous as electricity and give every person access to a...[show_more]

[last_updated.last_updated_variable_days] • [promoted]