Talent.com
Staff Software Engineer, Foundational Model Serving
Staff Software Engineer, Foundational Model ServingMenlo Ventures • San Francisco, CA, United States
Staff Software Engineer, Foundational Model Serving

Staff Software Engineer, Foundational Model Serving

Menlo Ventures • San Francisco, CA, United States
[job_card.variable_days_ago]
[job_preview.job_type]
  • [job_card.full_time]
[job_card.job_description]

Overview

At Databricks, we are passionate about enabling data teams to solve the worlds toughest problems from making the next mode of transportation a reality to accelerating the development of medical breakthroughs. We do this by building and running the worlds best data and AI infrastructure platform so our customers can use deep data insights to improve their business.

Foundation Model Serving is the API Product for hosting and serving frontier AI model inference for open?source models like Llama, Qwen, and GPT?OSS as well as proprietary models like Claude and OpenAI GPT. For this role, no prior ML or AI experience is necessary. Were looking for engineers who have owned high?scale operationally sensitive systems such as customer?facing APIs, Edge Gateways, ML inference, or similar services and have an interest in building LLM APIs and runtimes at scale.

As a Staff Engineer, youll play a critical role in shaping both the product experience and core infrastructure. You will design and build systems that enable high?throughput, low?latency inference on GPU workloads with frontier models, influence architectural direction, and collaborate closely across platform, product, infrastructure, and research teams to deliver a world?class foundation model API product.

Impact you will have

  • Design and implement core systems and APIs that power Databricks Foundation Model Serving, ensuring scalability, reliability, and operational excellence.
  • Partner with product and engineering leadership to define the technical roadmap and long?term architecture for serving workloads.
  • Drive architectural decisions and trade?offs to optimize performance, throughput, autoscaling, and operational efficiency for GPU serving workloads.
  • Contribute directly to key components across the serving infrastructure from working in systems like vLLM and SGLang to creating token?based rate limiters and optimizers ensuring smooth and efficient operations at scale.
  • Collaborate cross?functionally with product, platform, and research teams to translate customer needs into reliable and performant systems.
  • Establish best practices for code quality, testing, and operational readiness, and mentor other engineers through design reviews and technical guidance.
  • Represent the team in cross?organizational technical discussions and influence Databricks broader AI platform strategy.

What we look for

  • 10+ years of experience building and operating large?scale distributed systems.
  • Experience leading high?scale operationally sensitive backend systems.
  • A track record of up?leveling teams engineering excellence.
  • Strong foundation in algorithms, data structures, and system design as applied to large?scale, low?latency serving systems.
  • Proven ability to deliver technically complex, high?impact initiatives that create measurable customer or business value.
  • Strong communication skills and ability to collaborate across teams in fast?moving environments.
  • Strategic and product?oriented mindset with the ability to align technical execution with long?term vision.
  • Passion for mentoring, growing engineers, and fostering technical excellence.
  • Pay Range Transparency

    Databricks is committed to fair and equitable compensation practices. The pay range(s) for this role is listed below and represents the expected salary range for non?commissionable roles or on?target earnings for commissionable roles. Actual compensation packages are based on several factors that are unique to each candidate, including but not limited to job?related skills, depth of experience, relevant certifications and training, and specific work location. Based on the factors above, Databricks anticipates utilizing the full width of the range. The total compensation package for this position may also include eligibility for annual performance bonus, equity, and the benefits listed above. For more information regarding which range your location is in visit our page here.

    Local Pay Range : $192,000 $260,000 USD

    About Databricks

    Databricks is the data and AI company. More than 10,000 organizations worldwide including Comcast, Cond?Nast, Grammarly, and over 50% of the Fortune?500 rely on the Databricks Data Intelligence Platform to unify and democratize data, analytics and AI. Databricks is headquartered in San?Francisco, with offices around the globe and was founded by the original creators of Lakehouse, Apache?Spark, Delta?Lake and MLflow.

    To learn more, follow Databricks on Twitter, LinkedIn and Facebook.

    Benefits : At Databricks, we strive to provide comprehensive benefits and perks that meet the needs of all of our employees. For specific details on the benefits offered in your region, please visit https : / / www.mybenefitsnow.com / databricks.

    Our Commitment to Diversity and Inclusion

    At Databricks, we are committed to fostering a diverse and inclusive culture where everyone can excel. We take great care to ensure that our hiring practices are inclusive and meet equal employment opportunity standards. Individuals looking for employment at Databricks are considered without regard to age, color, disability, ethnicity, family or marital status, gender identity or expression, language, national origin, physical and mental ability, political affiliation, race, religion, sexual orientation, socio?economic status, veteran status, and other protected characteristics.

    Compliance

    If access to export?controlled technology or source code is required for performance of job duties, it is within Employers discretion whether to apply for a U.S. government license for such positions, and Employer may decline to proceed with an applicant on this basis alone.

    #J-18808-Ljbffr

    [job_alerts.create_a_job]

    Staff Software Engineer Foundational Model Serving • San Francisco, CA, United States

    [internal_linking.similar_jobs]
    Staff Software Engineer - Origination

    Staff Software Engineer - Origination

    PayJoy • San Francisco, California, US
    [job_card.full_time]
    Job Description Job Description About PayJoy PayJoy is a mission-first credit provider dedicated to helping under-served customers in emerging markets to achieve financial stability and success....[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Staff Software Engineer

    Staff Software Engineer

    Artera • San Francisco, California, US
    [job_card.full_time]
    Job Description Job Description ABOUT ARTERA Our Mission : Make healthcare #1 in customer service.What We Deliver : Artera, a SaaS leader in digital health, transforms patient experience with AI-p...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Staff Software Engineer

    Staff Software Engineer

    Linden Lab • San Francisco, California, US
    [job_card.full_time]
    Job Description Job Description Our Story and Purpose Linden Lab develops platforms that empower people to create, connect, and thrive through transformative virtual experiences.Since our foundin...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Staff Software Engineer

    Staff Software Engineer

    LoopMe • San Francisco, California, US
    [job_card.full_time]
    Job Description Job Description • • •LoopMe is one of Campaign's Best Places to Work 2023 AND 2024! • • • Our vision is to change advertising for the better. LoopMe's technology brings together adverti...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Staff Software Engineer (Remote)

    Staff Software Engineer (Remote)

    Imply • Burlingame, California, US
    [filters.remote]
    [job_card.full_time]
    Job Description Job Description At Imply, our mission is to empower people and organizations to achieve more with their data. We believe that better insights lead to better decisions, and that the...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Senior / Staff Software Engineer

    Senior / Staff Software Engineer

    Joyous • Foster City, California, US
    [job_card.full_time]
    Job Description Job Description Join Joyous in our mission to revolutionize mental health care.As pioneers, we leverage very low dose (VLD) ketamine, AI-powered treatments, and advanced technolog...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Staff Software Engineer, Product

    Staff Software Engineer, Product

    Cleric • San Francisco, CA, United States
    [job_card.full_time]
    We're building an autonomous AI SRE that helps software engineering teams reliably investigate production incidents.Our agent combines LLMs with tools to understand systems, reason through problems...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Staff Software Engineer - Product Foundations

    Staff Software Engineer - Product Foundations

    Plaid • San Francisco, California, US
    [job_card.full_time]
    Job Description Job Description Network Foundations is Plaid's authoritative source of truth for the user lifecycle, powering user recognition across integration paths, authentication with Plaid,...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Staff Software Engineer - GenAI inference

    Staff Software Engineer - GenAI inference

    Menlo Ventures • San Francisco, CA, United States
    [job_card.full_time]
    As a staff software engineer for GenAI inference, you will lead the architecture, development, and optimization of the inference engine that powers Databricks Foundation Model API.Youll bridge rese...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Staff Software Engineer

    Staff Software Engineer

    Alluxio • Foster City, California, US
    [job_card.full_time]
    Job Description Job Description About Alluxio Alluxio powers the data layer for modern AI and analytics.Proven in production at eight of the top ten internet companies and seven of the ten highes...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Staff Software Engineer, ML Serving Platform

    Staff Software Engineer, ML Serving Platform

    San Francisco Staffing • San Francisco, CA, United States
    [job_card.full_time]
    DoorDash is building the world's most reliable on-demand logistics engine.Behind the scenes, our Machine Learning Platform (MLP) powers critical real-time decision-making for millions of orders eac...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Staff Software Engineer

    Staff Software Engineer

    Idler • San Francisco, California, US
    [job_card.full_time]
    Job Description Job Description What we do Idler builds reinforcement learning environments that teach AI models to code like 0. Our training environments are based on real-world coding scenarios t...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Staff Software Engineer

    Staff Software Engineer

    Virco Talent • San Francisco, California, US
    [job_card.full_time]
    Job Description Job Description Employment Type : Full-time Location : San Francisco / New York City Salary : $190K - $230K Equity : Competitive Visa Sponsorship : Sponsors OPT and H1B transfers; no new...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Staff Software Engineer

    Staff Software Engineer

    Allergan Aesthetics • San Francisco, California, US
    [job_card.full_time]
    Job Description Job Description Company Description At Allergan Aesthetics, an AbbVie company, we develop, manufacture, and market a portfolio of leading aesthetics brands and products.Our aesthe...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Staff Software Engineer - Partner Platform, APIs & Ecosystem Services

    Staff Software Engineer - Partner Platform, APIs & Ecosystem Services

    Quizlet • San Francisco, California, US
    [job_card.full_time]
    Job Description Job Description About Quizlet : At Quizlet, our mission is to help every learner achieve their outcomes in the most effective and delightful way. Our $1B+ learning platform serves ...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Staff Software Engineer, Trading Platforms

    Staff Software Engineer, Trading Platforms

    The Voleon Group • Berkeley, California, US
    [job_card.full_time]
    Job Description Job Description Voleon is a technology company that applies state-of-the-art AI and machine learning techniques to real-world problems in finance. For nearly two decades, we have l...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Staff Software Engineer, Database Systems

    Staff Software Engineer, Database Systems

    Zilliz • Redwood City, California, US
    [job_card.full_time]
    Job Description Job Description Zilliz is a fast-growing startup developing the industry's leading vector database company for enterprise-grade AI. Founded by the engineers behind Milvus, the worl...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Staff Software Engineer

    Staff Software Engineer

    Okta, Inc. • San Francisco, CA, United States
    [job_card.full_time]
    Okta is The World's Identity Company.We free everyone to safely use any technology, anywhere, on any device or app.Our flexible and neutral products, Okta Platform and Auth0 Platform, provide secur...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]