Talent.com
Software Engineer - ML Performance
Software Engineer - ML PerformanceBaseten • San Ramon, California, United States
Software Engineer - ML Performance

Software Engineer - ML Performance

Baseten • San Ramon, California, United States
[job_card.30_days_ago]
[job_preview.job_type]
  • [job_card.full_time]
[job_card.job_description]

ABOUT BASETEN

We’re a growing team of builders backed by top-tier investors, including IVP , Spark Capital , Greylock , and Sarah Guo at Conviction . ML teams at enterprises and category-defining AI-native companies like Descript , Bland.ai , Patreon , Writer , and Robust Intelligence use Baseten to power their core production workloads with best-in-class performance, security, and reliability. While we’ve unlocked PMF and secured Series B funding , the ML infrastructure market is massive, and we’re just getting started. If you’re excited to work on engaging and relevant problems while building something new from the ground up, come join us!

THE ROLE

Are you passionate about advancing the application of artificial intelligence? We are looking for a Software Engineer focused on ML performance to join our dynamic team. This role is ideal for someone who thrives in a fast-paced startup environment and is eager to make significant contributions to the exciting field of LLM Inference. If you are a backend engineer who thrives on making things faster and is excited about open-source ML models, we look forward to your application.

RESPONSIBILITIES :

Implement, refine, and productionize cutting-edge techniques (quantization, speculative decoding, kv cache reuse, chunked prefill and LoRA) for ML model inference and infrastructure.

Deep dive into underlying codebases of TensorRT, PyTorch, TensorRT-LLM, vllm, sglang, CUDA, and other libraries to debug ML performance issues.

Apply and scale optimization techniques across a wide range of ML models, particularly large language models.

Collaborate with a diverse team to design and implement innovative solutions.

Own projects from idea to production.

REQUIREMENTS :

Bachelor's, Master's, or Ph.D. degree in Computer Science, Engineering, Mathematics, or related field.

Experience with one or more general-purpose programming languages, such as Python or C++.

Familiarity with LLM optimization techniques (e.g., quantization, speculative decoding, continuous batching).

Strong familiarity with ML libraries, especially PyTorch, TensorRT, or TensorRT-LLM.

Demonstrated interest and experience in LLM’s.

Deep understanding of GPU architecture.

BONUS POINTS :

Proficiency in enhancing the performance of software systems, particularly in the context of large language models (LLMs).

Experience with CUDA or similar technologies.

Deep understanding of software engineering principles and a proven track record of developing and deploying AI / ML inference solutions.

Experience with Docker and Kubernetes.

BENEFITS :

Competitive compensation package (Unlimited PTO, 401k, covered healthcare premiums).

This is a unique opportunity to be part of a rapidly growing startup in one of the most exciting engineering fields of our era.

An inclusive and supportive work culture that fosters learning and growth.

Exposure to a variety of ML startups, offering unparalleled learning and networking opportunities.

Apply Now to embark on a rewarding journey in shaping the future of AI! If you are a motivated individual with a passion for machine learning and a desire to be part of a collaborative and forward-thinking team, we would love to hear from you.

At Baseten, we are committed to fostering a diverse and inclusive workplace. We provide equal employment opportunities to all employees and applicants without regard to race, color, religion, gender, sexual orientation, gender identity or expression, national origin, age, genetic information, disability, or veteran status.

[job_alerts.create_a_job]

Software Engineer ML Performance • San Ramon, California, United States

[internal_linking.similar_jobs]
Software Engineer, ML Infra

Software Engineer, ML Infra

Newsbreak • Mountain View, California, United States
[job_card.full_time]
NewsBreak is redefining the way users interact with local news and their communities.By bridging local users, local content creators, and local businesses, our mission is to foster safer, more vibr...[show_more]
[last_updated.last_updated_30] • [promoted]
Senior ML Platform Engineer : Scale LLM Infrastructure

Senior ML Platform Engineer : Scale LLM Infrastructure

GEICO • Palo Alto, CA, United States
[job_card.full_time]
A leading insurance company in California is seeking a Senior ML Platform Engineer to enhance their machine learning infrastructure. This role involves designing scalable systems for Large Language ...[show_more]
[last_updated.last_updated_30] • [promoted]
Senior Software Engineer, AI / ML GenAI, Core

Senior Software Engineer, AI / ML GenAI, Core

Google Inc. • Mountain View, CA, United States
[job_card.full_time]
Senior Software Engineer, AI / ML GenAI, Core.Experience driving progress, solving problems, and mentoring more junior team members. deeper expertise and applied knowledge within relevant area.Bachel...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Senior ML Engineer — GenAI & RAG Platforms

Senior ML Engineer — GenAI & RAG Platforms

Cisco Systems • Santa Clara, CA, United States
[job_card.full_time]
A global technology leader seeks an experienced engineer to develop AI-driven services and APIs for its hybrid, multi-cloud environment. Ideal candidates will have 5+ years in backend or distributed...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Staff Software Engineer, Machine Learning Trust & Safety

Staff Software Engineer, Machine Learning Trust & Safety

Match Group • Palo Alto, California, United States
[job_card.full_time]
Launched in 2012, Tinder® revolutionized how people meet, growing from 1 match to one billion matches in just two years.This rapid growth demonstrates its ability to fulfill a fundamental human nee...[show_more]
[last_updated.last_updated_30] • [promoted]
Senior Software Engineer - ML / LLM Serving

Senior Software Engineer - ML / LLM Serving

Alldus • San Jose, CA, United States
[job_card.full_time]
Senior Software Engineer - ML / LLM Serving.This range is provided by Alldus.Your actual pay will be based on your skills and experience — talk with your recruiter to learn more.Direct message the jo...[show_more]
[last_updated.last_updated_30] • [promoted]
Senior ML Engineer - Code Understanding & Transformers

Senior ML Engineer - Code Understanding & Transformers

ShareThis • Palo Alto, California, United States
[job_card.full_time]
A technology company in Palo Alto seeks a Sr.Machine Learning Engineer to design, train, and evaluate machine learning models for code understanding and generation. Responsibilities include developi...[show_more]
[last_updated.last_updated_variable_hours] • [promoted] • [new]
Hybrid ML Engineer — Large-Scale Recommender Systems

Hybrid ML Engineer — Large-Scale Recommender Systems

Pinterest • Palo Alto, California, United States
[job_card.full_time]
A leading social media platform in the United States is seeking Machine Learning Engineers to innovate and enhance user experiences. Candidates will contribute to cutting-edge projects in machine le...[show_more]
[last_updated.last_updated_variable_hours] • [promoted] • [new]
Principal Software Engineer, ML Systems

Principal Software Engineer, ML Systems

Waymo • Martinez, CA, United States
[job_card.full_time]
Waymo is an autonomous driving technology company with the mission to be the world's most trusted driver.Since its start as the Google Self-Driving Car Project in 2009, Waymo has focused on buildin...[show_more]
[last_updated.last_updated_30] • [promoted]
Software Engineer L4, Machine Learning Platform (Metaflow)

Software Engineer L4, Machine Learning Platform (Metaflow)

Netflix • Los Gatos, California, United States
[job_card.full_time]
Netflix is one of the world's leading entertainment services, with 283 million paid memberships in over 190 countries enjoying TV series, films and games across a wide variety of genres and lan...[show_more]
[last_updated.last_updated_30] • [promoted]
Senior Software Engineer, AI / ML

Senior Software Engineer, AI / ML

Striim, Inc. • Palo Alto, CA, United States
[job_card.full_time]
Striim, (pronounced “stream” with two i’s for integration and intelligence), is a unified data integration and streaming platform that connects clouds, data, and applications with unprecedented spe...[show_more]
[last_updated.last_updated_30] • [promoted]
Principal AI / ML System Software Engineer

Principal AI / ML System Software Engineer

d-Matrix • Santa Clara, CA, United States
[job_card.full_time]
AI to power the transformation of technology.We are at the forefront of software and hardware innovation, pushing the boundaries of what is possible. We value humility and believe in direct communic...[show_more]
[last_updated.last_updated_30] • [promoted]
Senior ML Engineer — Pricing Platform & Systems

Senior ML Engineer — Pricing Platform & Systems

Uber • Sunnyvale, CA, United States
[job_card.full_time]
A leading technology company is seeking a Senior Staff Machine Learning Engineer to lead the development of advanced ML systems for courier pricing. The role demands extensive experience in building...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Senior ML Full-Stack Engineer for GenAI Apps

Senior ML Full-Stack Engineer for GenAI Apps

Adobe Inc. • San Jose, CA, United States
[job_card.full_time]
A leading digital experience company in California seeks a Senior Full-Stack Engineer to develop innovative GenAI features and products. The ideal candidate has over 10 years of experience in web ap...[show_more]
[last_updated.last_updated_30] • [promoted]
ML Platform Engineer : Scale Training Pipelines

ML Platform Engineer : Scale Training Pipelines

Samsung Electronics Perú • Mountain View, CA, United States
[job_card.full_time]
A leading technology company is seeking a Machine Learning Platform Engineer in Mountain View, CA.The role involves designing and developing advanced machine learning platforms for advertising, men...[show_more]
[last_updated.last_updated_30] • [promoted]
Staff ML Engineer : LLM Fine-Tuning for RTL / Verilog

Staff ML Engineer : LLM Fine-Tuning for RTL / Verilog

Highbrow Technology Inc • San Jose, CA, United States
[job_card.full_time]
A prominent tech company in California seeks a Staff Machine Learning Engineer to lead the fine-tuning and deployment of LLM-based solutions for code workflows in secure environments.This role requ...[show_more]
[last_updated.last_updated_30] • [promoted]
Software Engineer Manager, Photos Agent / LLM Infrastructure

Software Engineer Manager, Photos Agent / LLM Infrastructure

jobr.pro • Mountain View, CA, United States
[job_card.full_time]
Bachelor’s degree, or equivalent practical experience.LLMs, Multi-Modal, Large Vision Models) or with GenAI-related concepts (language modeling, computer vision). Master's degree or PhD in Computer ...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Senior Software Engineer - AI / ML & Scalable Platform Lead

Senior Software Engineer - AI / ML & Scalable Platform Lead

COMFORT SYSTEMS • Sunnyvale, CA, United States
[job_card.full_time]
A leading retail company is seeking a Senior Software Engineer to lead the delivery of scalable software solutions.The role involves managing feature implementation, integrating AI / ML components, a...[show_more]
[last_updated.last_updated_variable_days] • [promoted]