Talent.com
Staff Software Engineer, Model Serving
Staff Software Engineer, Model ServingDatabricks Inc. • San Francisco, CA, United States
[error_messages.no_longer_accepting]
Staff Software Engineer, Model Serving

Staff Software Engineer, Model Serving

Databricks Inc. • San Francisco, CA, United States
[job_card.30_days_ago]
[job_preview.job_type]
  • [job_card.full_time]
[job_card.job_description]

At Databricks, we are passionate about enabling data teams to solve the world's toughest problems — from making the next mode of transportation a reality to accelerating the development of medical breakthroughs. We do this by building and running the world's best data and AI infrastructure platform so our customers can use deep data insights to improve their business.

Databricks’ Model Serving product provides enterprises with a unified, scalable, and governed platform to deploy and manage AI / ML models — from traditional ML to fine-tuned and proprietary large language models. It offers real-time, low-latency inference, governance, monitoring, and lineage. As AI adoption accelerates, Model Serving is a core pillar of the Databricks platform, enabling customers to operationalize models at scale with strong SLAs and cost efficiency.

As a Staff Engineer, you’ll play a critical role in shaping both the product experience and the foundational infrastructure of Model Serving. You will design and build systems that enable high-throughput, low-latency inference across CPU and GPU workloads, influence architectural direction, and collaborate closely across platform, product, infrastructure, and research teams to deliver a world-class serving platform.

The impact you will have

  • Design and implement core systems and APIs that power Databricks Model Serving, ensuring scalability, reliability, and operational excellence.
  • Partner with product and engineering leadership to define the technical roadmap and long-term architecture for serving workloads.
  • Drive architectural decisions and trade-offs to optimize performance, throughput, autoscaling, and operational efficiency for CPU and GPU serving workloads.
  • Contribute directly to key components across the serving infrastructure — from model container builds and deployment workflows to runtime systems like routing, caching, observability, and intelligent autoscaling — ensuring smooth and efficient operations at scale.
  • Collaborate cross-functionally with product, platform, and research teams to translate customer needs into reliable and performant systems.
  • Lead technical initiatives that improve latency, availability, and cost-effectiveness across both customer-facing and foundational serving layers.
  • Establish best practices for code quality, testing, and operational readiness, and mentor other engineers through design reviews and technical guidance.
  • Represent the team in cross-organizational technical discussions and influence Databricks’ broader AI platform strategy.

What we look for

  • 10+ years of experience building and operating large-scale distributed systems.
  • Deep expertise in model serving, inference systems, and related infrastructure (e.g., routing, scheduling, autoscaling, and observability).
  • Strong foundation in algorithms, data structures, and system design as applied to large-scale, low-latency serving systems.
  • Proven ability to deliver technically complex, high-impact initiatives that create measurable customer or business value.
  • Experience leading architecture for large-scale, performance-sensitive CPU / GPU inference systems.
  • Strong communication skills and ability to collaborate across teams in fast-moving environments.
  • Strategic and product-oriented mindset with the ability to align technical execution with long-term vision.
  • Passion for mentoring, growing engineers, and fostering technical excellence.
  • Databricks is committed to fair and equitable compensation practices. The pay range(s) for this role is listed below and represents the expected salary range for non-commissionable roles or on-target earnings for commissionable roles. Actual compensation packages are based on several factors that are unique to each candidate, including but not limited to job-related skills, depth of experience, relevant certifications and training, and specific work location. For more information regarding which range your location is in visit our page here.

    Local Pay Range

    $192,000 — $260,000 USD

    About Databricks

    Databricks is the data and AI company. More than 10,000 organizations worldwide — including Comcast, Condé Nast, Grammarly, and over 50% of the Fortune 500 — rely on the Databricks Data Intelligence Platform to unify and democratize data, analytics and AI. Databricks is headquartered in San Francisco, with offices around the globe and was founded by the original creators of Lakehouse, Apache Spark™, Delta Lake and MLflow. To learn more, follow Databricks on Twitter, LinkedIn and Facebook.

    Benefits

    At Databricks, we strive to provide comprehensive benefits and perks that meet the needs of all of our employees. For specific details on the benefits offered in your region, please visit https : / / www.mybenefitsnow.com / databricks.

    Our Commitment to Diversity and Inclusion

    At Databricks, we are committed to fostering a diverse and inclusive culture where everyone can excel. We take great care to ensure that our hiring practices are inclusive and meet equal employment opportunity standards. Individuals looking for employment at Databricks are considered without regard to age, color, disability, ethnicity, family or marital status, gender identity or expression, language, national origin, physical and mental ability, political affiliation, race, religion, sexual orientation, socio-economic status, veteran status, and other protected characteristics.

    Compliance

    If access to export-controlled technology or source code is required for performance of job duties, it is within Employer's discretion whether to apply for a U.S. government license for such positions, and Employer may decline to proceed with an applicant on this basis alone.

    #J-18808-Ljbffr

    [job_alerts.create_a_job]

    Staff Software Engineer • San Francisco, CA, United States

    [internal_linking.related_jobs]
    Staff Machine Learning Software Engineer

    Staff Machine Learning Software Engineer

    Intuitive • San Francisco, California, United States
    [job_card.full_time]
    At Intuitive, we are united behind our mission : we believe that minimally invasive care is life-enhancing care.Through ingenuity and intelligent technology, we expand the potential of physicians to...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Software Engineer - ML Platform (Staff / Sr Staff)

    Software Engineer - ML Platform (Staff / Sr Staff)

    Equilibrium Energy • San Francisco, CA, United States
    [job_card.full_time]
    Software Engineer - ML Platform (Staff / Sr Staff).Equilibrium Energy is revolutionizing the clean energy transition by developing innovative grid-scale energy storage solutions.Our technology and ...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Staff Software Engineer

    Staff Software Engineer

    Golunar • San Francisco, CA, United States
    [job_card.full_time]
    Lunar is a stealth technology company building a new type of software platform for health systems.We are on a mission to revolutionize healthcare with cutting‑edge, AI‑powered software designed to ...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Staff Software Engineer, Model Serving

    Staff Software Engineer, Model Serving

    Menlo Ventures • San Francisco, CA, United States
    [job_card.full_time]
    At Databricks, we are passionate about enabling data teams to solve the world's toughest problems — from making the next mode of transportation a reality to accelerating the development of medical ...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Staff Software Engineer Metrics US

    Staff Software Engineer Metrics US

    Promote Project • San Francisco, CA, United States
    [job_card.full_time]
    At Weights & Biases, our mission is to build the best tools for AI developers.We founded our company on the insight that while there were excellent tools for developers to build better code, there ...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Staff Software Engineer AI Agents

    Staff Software Engineer AI Agents

    Goodleap • San Francisco, California, United States
    [job_card.full_time]
    GoodLeap is a technology company delivering best-in-class financing and software products for sustainable solutions, from solar panels and batteries to energy-efficient HVAC, heat pumps, roofing, w...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Staff Machine Learning Engineer

    Staff Machine Learning Engineer

    Moloco • Redwood City, California, United States
    [job_card.full_time]
    Moloco is a machine learning company empowering organizations of all sizes to grow and unlock the full value of their unique first-party data, elevating the traditional path to performance advertis...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Staff Software Engineer

    Staff Software Engineer

    Cleric • San Francisco, CA, United States
    [job_card.full_time]
    We're building an autonomous AI SRE that helps software engineering teams reliably investigate production incidents.Our agent combines LLMs with tools to understand systems, reason through problems...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Staff Software Engineer - AI Agent Infrastructure (Healthcare)

    Staff Software Engineer - AI Agent Infrastructure (Healthcare)

    Honey Health • San Francisco, CA, US
    [job_card.full_time]
    Honey Health is the all-in-one AI back office for primary and specialty care.Our AI agents autonomously handle core back-office jobs, such as aggregating patients data, processing orders and prescr...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Software Engineer, Staff

    Software Engineer, Staff

    Merge API • San Francisco, CA, United States
    [job_card.full_time]
    Merge enables B2B companies to add hundreds of integrations to their products, making it easy to access and sync their customers' data. We offer Unified APIs that provide normalized data across key ...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Staff Software Engineer, Core

    Staff Software Engineer, Core

    Descript • San Francisco, CA, United States
    [job_card.full_time]
    We are building the next‑generation AI‑powered platform and web application for easy and fast creation of audio and video content. Growing this revolutionary product involves unique technical challe...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Staff Software Engineer, Machine Learning

    Staff Software Engineer, Machine Learning

    Mlabs • Burlingame, California, United States
    [job_card.full_time]
    Staff Software Engineer, Machine Learning.Burlingame, CA (On-site, 4 days a week).We are a rapidly growing AI company applying. We're looking for a highly skilled and experienced.Staff Software Engi...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Staff Software Engineer

    Staff Software Engineer

    Idler • San Francisco, CA, US
    [job_card.full_time]
    What we do Idler builds reinforcement learning environments that teach AI models to code like 0.Make your application after reading the following skill and qualification requirements for this posit...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Staff Software Engineer - Forward Deployed

    Staff Software Engineer - Forward Deployed

    Invisible Technologies • San Francisco, California, United States
    [job_card.full_time]
    Invisible Technologies is the AI operating system for the enterprise.Our end-to-end AI Software Platform structures messy data, builds digital workflows, deploys agentic solutions, evaluates / measur...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Staff Software Engineer

    Staff Software Engineer

    Omada Health • South San Francisco, CA, United States
    [job_card.full_time]
    Omada Health is on a mission to inspire and engage people in lifelong health, one step at a time.Omada Health is a digital care provider that empowers people to achieve their health goals through s...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Staff AI / ML Engineer

    Staff AI / ML Engineer

    Sigma Computing • San Francisco, California, United States
    [job_card.full_time]
    At Sigma, we’re not just adding AI—we’re building the future of how people work with data.Our platform already lets users explore billions of rows of data in seconds with a spreadsheet-like interfa...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Staff Software Engineer, Model Serving

    Staff Software Engineer, Model Serving

    Databricks Inc. • San Francisco, CA, United States
    [job_card.full_time]
    At Databricks, we are passionate about enabling data teams to solve the world's toughest problems — from making the next mode of transportation a reality to accelerating the development of medical ...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Staff Software Engineer

    Staff Software Engineer

    1Five • San Francisco, CA, United States
    [job_card.full_time]
    Matching AI / ML Experts with the World’s Best Companies.Five and our clients (seed - publicly traded tech companies) are seeking Staff Software Engineers with deep expertise in ML Infrastructure, pa...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]