Talent.com
Staff Software Engineer, Foundational Model Serving
Staff Software Engineer, Foundational Model ServingMenlo Ventures • San Francisco, CA, United States
[error_messages.no_longer_accepting]
Staff Software Engineer, Foundational Model Serving

Staff Software Engineer, Foundational Model Serving

Menlo Ventures • San Francisco, CA, United States
[job_card.variable_days_ago]
[job_preview.job_type]
  • [job_card.full_time]
[job_card.job_description]

Overview

At Databricks, we are passionate about enabling data teams to solve the worlds toughest problems from making the next mode of transportation a reality to accelerating the development of medical breakthroughs. We do this by building and running the worlds best data and AI infrastructure platform so our customers can use deep data insights to improve their business.

Foundation Model Serving is the API Product for hosting and serving frontier AI model inference for open?source models like Llama, Qwen, and GPT?OSS as well as proprietary models like Claude and OpenAI GPT. For this role, no prior ML or AI experience is necessary. Were looking for engineers who have owned high?scale operationally sensitive systems such as customer?facing APIs, Edge Gateways, ML inference, or similar services and have an interest in building LLM APIs and runtimes at scale.

As a Staff Engineer, youll play a critical role in shaping both the product experience and core infrastructure. You will design and build systems that enable high?throughput, low?latency inference on GPU workloads with frontier models, influence architectural direction, and collaborate closely across platform, product, infrastructure, and research teams to deliver a world?class foundation model API product.

Impact you will have

  • Design and implement core systems and APIs that power Databricks Foundation Model Serving, ensuring scalability, reliability, and operational excellence.
  • Partner with product and engineering leadership to define the technical roadmap and long?term architecture for serving workloads.
  • Drive architectural decisions and trade?offs to optimize performance, throughput, autoscaling, and operational efficiency for GPU serving workloads.
  • Contribute directly to key components across the serving infrastructure from working in systems like vLLM and SGLang to creating token?based rate limiters and optimizers ensuring smooth and efficient operations at scale.
  • Collaborate cross?functionally with product, platform, and research teams to translate customer needs into reliable and performant systems.
  • Establish best practices for code quality, testing, and operational readiness, and mentor other engineers through design reviews and technical guidance.
  • Represent the team in cross?organizational technical discussions and influence Databricks broader AI platform strategy.

What we look for

  • 10+ years of experience building and operating large?scale distributed systems.
  • Experience leading high?scale operationally sensitive backend systems.
  • A track record of up?leveling teams engineering excellence.
  • Strong foundation in algorithms, data structures, and system design as applied to large?scale, low?latency serving systems.
  • Proven ability to deliver technically complex, high?impact initiatives that create measurable customer or business value.
  • Strong communication skills and ability to collaborate across teams in fast?moving environments.
  • Strategic and product?oriented mindset with the ability to align technical execution with long?term vision.
  • Passion for mentoring, growing engineers, and fostering technical excellence.
  • Pay Range Transparency

    Databricks is committed to fair and equitable compensation practices. The pay range(s) for this role is listed below and represents the expected salary range for non?commissionable roles or on?target earnings for commissionable roles. Actual compensation packages are based on several factors that are unique to each candidate, including but not limited to job?related skills, depth of experience, relevant certifications and training, and specific work location. Based on the factors above, Databricks anticipates utilizing the full width of the range. The total compensation package for this position may also include eligibility for annual performance bonus, equity, and the benefits listed above. For more information regarding which range your location is in visit our page here.

    Local Pay Range : $192,000 $260,000 USD

    About Databricks

    Databricks is the data and AI company. More than 10,000 organizations worldwide including Comcast, Cond?Nast, Grammarly, and over 50% of the Fortune?500 rely on the Databricks Data Intelligence Platform to unify and democratize data, analytics and AI. Databricks is headquartered in San?Francisco, with offices around the globe and was founded by the original creators of Lakehouse, Apache?Spark, Delta?Lake and MLflow.

    To learn more, follow Databricks on Twitter, LinkedIn and Facebook.

    Benefits : At Databricks, we strive to provide comprehensive benefits and perks that meet the needs of all of our employees. For specific details on the benefits offered in your region, please visit https : / / www.mybenefitsnow.com / databricks.

    Our Commitment to Diversity and Inclusion

    At Databricks, we are committed to fostering a diverse and inclusive culture where everyone can excel. We take great care to ensure that our hiring practices are inclusive and meet equal employment opportunity standards. Individuals looking for employment at Databricks are considered without regard to age, color, disability, ethnicity, family or marital status, gender identity or expression, language, national origin, physical and mental ability, political affiliation, race, religion, sexual orientation, socio?economic status, veteran status, and other protected characteristics.

    Compliance

    If access to export?controlled technology or source code is required for performance of job duties, it is within Employers discretion whether to apply for a U.S. government license for such positions, and Employer may decline to proceed with an applicant on this basis alone.

    #J-18808-Ljbffr

    [job_alerts.create_a_job]

    Staff Software Engineer Foundational Model Serving • San Francisco, CA, United States

    [internal_linking.similar_jobs]
    Staff Software Integration Engineer

    Staff Software Integration Engineer

    Mytra • San Francisco, CA, United States
    [job_card.full_time]
    Mytra's robotics team is looking for an experienced Staff Software Engineer to develop software critical for motion control, task execution, and safety management of our robotic fleet as we scale.Y...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Staff Software Engineer

    Staff Software Engineer

    Pandora • Oakland, CA, United States
    [job_card.full_time]
    Staff Software Engineer Pandora.Be among the first 25 applicants.SiriusXM and its brands (Pandora, SiriusXM Media, AdsWizz, Simplecast, and SiriusXM Connect) are leading a new era of audio entertai...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Senior / Staff Software Engineer

    Senior / Staff Software Engineer

    Multiply Labs • San Francisco, CA, United States
    [job_card.full_time]
    Multiply Labs Software Engineering Role.Multiply Labs is a cutting-edge startup based in San Francisco, California, supported by top-tier tech and life science investors. We are revolutionizing the ...[show_more]
    [last_updated.last_updated_1_day] • [promoted]
    Staff Software Engineer, Product

    Staff Software Engineer, Product

    Cleric • San Francisco, CA, United States
    [job_card.full_time]
    We're building an autonomous AI SRE that helps software engineering teams reliably investigate production incidents.Our agent combines LLMs with tools to understand systems, reason through problems...[show_more]
    [last_updated.last_updated_1_day] • [promoted]
    Staff Software Engineer - GenAI inference

    Staff Software Engineer - GenAI inference

    Menlo Ventures • San Francisco, CA, United States
    [job_card.full_time]
    As a staff software engineer for GenAI inference, you will lead the architecture, development, and optimization of the inference engine that powers Databricks Foundation Model API.Youll bridge rese...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Staff Software Engineer, Full-Stack & Firmware

    Staff Software Engineer, Full-Stack & Firmware

    Mammoth DS, Inc • South San Francisco, CA, United States
    [job_card.full_time]
    Were a newly funded, high?growth startup at the intersection of biotechnology and computer science, tackling some of the most complex, long?term data challenges in the world.Were looking for an exc...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Staff Software Engineer, Full-Stack & Firmware

    Staff Software Engineer, Full-Stack & Firmware

    Atlas Data Storage • South San Francisco, CA, United States
    [job_card.full_time]
    Staff Software Engineer, Full?Stack & Firmware.Staff Software Engineer, Full?Stack & Firmware.We're a newly funded, high?growth startup at the intersection of biotechnology and computer science, ta...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Staff Software Engineer, ML Serving Platform

    Staff Software Engineer, ML Serving Platform

    San Francisco Staffing • San Francisco, CA, United States
    [job_card.full_time]
    DoorDash is building the world's most reliable on-demand logistics engine.Behind the scenes, our Machine Learning Platform (MLP) powers critical real-time decision-making for millions of orders eac...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Staff Software Engineer

    Staff Software Engineer

    Solidroad Inc • San Francisco, CA, United States
    [job_card.full_time]
    Building something great is addictive.It's like discovering your favorite TV show mid-season, except you're helping write the next episode. It's like unboxing a new gadget, except you're the one des...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Staff Software Engineer

    Staff Software Engineer

    Discover International • San Francisco, CA, United States
    [job_card.full_time] +1
    Senior Staff Full Stack Engineer Platform.Location : San Francisco (Hybrid).Our client is a well-funded, high-growth technology company building AI-powered software used by large enterprise customer...[show_more]
    [last_updated.last_updated_variable_hours] • [promoted] • [new]
    Staff Software Engineer

    Staff Software Engineer

    Pear Suite • San Francisco, CA, United States
    [job_card.full_time]
    Pear Suite is a mission-driven healthcare technology company transforming how community-based care is delivered.Our platform empowers community health workers, doulas, and other frontline providers...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Staff Software Engineer

    Staff Software Engineer

    Medium • San Francisco, CA, United States
    [job_card.full_time]
    Make healthcare #1 in customer service.Artera, a SaaS leader in digital health, transforms patient experience with AI-powered virtual agents (voice and text) for every step of the patient journey.T...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Staff Software Engineer

    Staff Software Engineer

    Zendesk • San Francisco, CA, United States
    [job_card.full_time]
    Lead technical execution and system design for the ITAM Tasks & Approvals sub-team.Extend Zendesk's Tasks and Approvals platform for ITAM-specific needs (e. Build scalable, performant APIs and user-...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Staff Software Engineer

    Staff Software Engineer

    Okta, Inc. • San Francisco, CA, United States
    [job_card.full_time]
    Okta is The World's Identity Company.We free everyone to safely use any technology, anywhere, on any device or app.Our flexible and neutral products, Okta Platform and Auth0 Platform, provide secur...[show_more]
    [last_updated.last_updated_1_day] • [promoted]
    Staff Software Engineer

    Staff Software Engineer

    Coders Connect • San Francisco, CA, United States
    [job_card.full_time]
    Senior / Staff Software Engineer.Coders Connect is partnering with a fast-growing AI-first start-up that's helping companies turn GenAI from theory into practice. They specialize in building custom LL...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Staff Software Engineer

    Staff Software Engineer

    CloudDevs • San Francisco, CA, United States
    [job_card.full_time]
    Airbnb was born in 2007 when two hosts welcomed three guests to their San Francisco home, and has since grown to over 5 million hosts who have welcomed over 2 billion guest arrivals in almost every...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Staff Software Engineer

    Staff Software Engineer

    Allergan Aesthetics, an AbbVie Company • San Francisco, CA, United States
    [job_card.full_time]
    At Allergan Aesthetics, an AbbVie company, we develop, manufacture, and market a portfolio of leading aesthetics brands and products. Our aesthetics portfolio includes facial injectables, body conto...[show_more]
    [last_updated.last_updated_1_day] • [promoted]
    Staff Software Engineer, Model Serving

    Staff Software Engineer, Model Serving

    Databricks Inc. • San Francisco, CA, United States
    [job_card.full_time]
    At Databricks, we are passionate about enabling data teams to solve the world's toughest problems — from making the next mode of transportation a reality to accelerating the development of medical ...[show_more]
    [last_updated.last_updated_30] • [promoted]