Talent.com
Model Deployment Engineer
Model Deployment EngineerRime • San Francisco, CA, United States
[error_messages.no_longer_accepting]
Model Deployment Engineer

Model Deployment Engineer

Rime • San Francisco, CA, United States
[job_card.variable_days_ago]
[job_preview.job_type]
  • [job_card.full_time]
[job_card.job_description]

Location : San Francisco or Remote (U.S.)

About Rime

Rime builds enterprise-grade voice models that sound truly human — trusted by global telcos, healthcare systems, and leading brands to power billions of real customer interactions. Our mission is to set the standard for natural, controllable, and privacy-preserving voice AI.

We’re an early-stage team backed by top investors, growing quickly as enterprises adopt AI voice across customer experience, automation, and accessibility use cases.

Role Overview

We’re looking for a Model Deployment Engineer to bridge the gap between Rime’s cutting-edge voice models and our enterprise customers. You’ll be responsible for designing, deploying, and optimizing Rime’s on-prem and VPC implementations , ensuring our technology runs seamlessly inside complex enterprise environments.

This is a hands-on, customer-facing technical role — part Sales Engineer, part Forward Deployed Engineer. You’ll work closely with our Sales, Product, and Infrastructure teams to design solutions, troubleshoot deployments, and make sure our customers see value from day one.

What You’ll Do

  • Partner with Sales to scope and architect deployments for enterprise customers (on-prem, VPC, or hybrid).
  • Deploy and configure Rime’s inference runtimes , integrate APIs, and optimize GPU and network performance.
  • Build reference deployments, automation scripts, and Helm templates to streamline future implementations.
  • Train customer technical teams on deployment management, scaling, and observability best practices.
  • Act as the technical voice of the customer , providing structured feedback to Rime’s Product and Engineering teams.
  • Troubleshoot performance issues, manage escalations, and ensure high uptime and reliability post-launch.

Who You Are

  • 6+ years in Sales Engineering, Solutions Architecture, Forward Deployed Engineering, or DevOps roles supporting enterprise AI / ML systems.
  • Experienced with AI / ML inference workloads in on-prem, private cloud, or hybrid environments.
  • Proficient with Kubernetes, Docker, Helm , and major cloud providers ( AWS, GCP, Azure ).
  • Familiar with TTS or LLM inference , GPU optimization, and model-serving frameworks.
  • Strong communicator who can translate between enterprise IT, AI engineering, and business stakeholders .
  • Thrives in an early-stage environment and loves rolling up your sleeves to make deployments work in production.
  • Why Join Rime

  • Be part of a team defining how enterprises adopt and deploy AI voice technology at scale.
  • Work directly with world-class engineers, linguists, and founders at the forefront of speech AI.
  • High ownership, zero bureaucracy, and the chance to build the deployment playbook for a fast-growing category leader.
  • Competitive compensation package, equity, and the opportunity to shape the future of AI voice infrastructure.
  • [job_alerts.create_a_job]

    Deployment Engineer • San Francisco, CA, United States

    [internal_linking.related_jobs]
    Deployment Engineer

    Deployment Engineer

    Netic • San Francisco, CA, United States
    [job_card.full_time]
    Netic is the AI revenue engine that handles multi‑modal workflows, generates new demand, and drives measurable revenue for the $500B+ essential service industries that keep America running.With $20...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Deployment Engineer

    Deployment Engineer

    Mytra • Brisbane, California, United States, 94005
    [job_card.full_time]
    We’re creating an entirely new way to solve the most ubiquitous problem in industry - moving and storing material.We’re applying robotics and distributed software to create a new class of product f...[show_more]
    [last_updated.last_updated_30]
    LLM Platform Engineer

    LLM Platform Engineer

    Whatnot • San Francisco, California, United States
    [job_card.full_time]
    Join the Future of Commerce with Whatnot!.Whatnot is the largest live shopping platform in North America and Europe to buy, sell, and discover the things you love. We’re re-defining e-commerce by bl...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Model Deployment Engineer

    Model Deployment Engineer

    Rime • San Francisco, CA, United States
    [job_card.full_time]
    Rime builds enterprise‑grade voice models that sound truly human — trusted by global telcos, healthcare systems, and leading brands to power billions of real customer interactions.Our mission is to...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Lead Engineer - Developer & Deployment Products

    Lead Engineer - Developer & Deployment Products

    Health Universe • San Francisco, CA, United States
    [job_card.full_time]
    We're looking for a Lead Engineer to drive the future of AI-powered healthcare tools.AI innovation with real-world clinical impact. We're building tools that empower clinicians to deliver better, fa...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Senior MLOps Engineer

    Senior MLOps Engineer

    Clariti Cloud Inc. • San Francisco, CA, US
    [job_card.full_time] +1
    Join our mission to provide governments with exceptional experiences so they can do the same for their communities!.We empower governments to deliver exceptional citizen experiences.How will you he...[show_more]
    [last_updated.last_updated_30] • [promoted]
    ML Engineer

    ML Engineer

    Wispr Flow • San Francisco, California, United States
    [job_card.full_time]
    Wispr Flow is making it as effortless to interact with your devices as talking to a close friend.Voice is the most natural, powerful way to communicate — and we’re building the interfaces to make t...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Platform Engineer, Model Shaping

    Platform Engineer, Model Shaping

    Together AI • San Francisco, CA, United States
    [job_card.full_time]
    The Model Shaping team at Together AI works on products and research for tailoring open foundation models to downstream applications. We build services that allow machine learning developers to choo...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Systems Modeling Engineer, ESS

    Systems Modeling Engineer, ESS

    Peak Energy • Burlingame, CA, US
    [job_card.full_time]
    Peak Energy is the first American venture to advance globally proven Sodium-Ion battery systems as the storage standard for the new era of renewable energy on a resilient grid.Sodium-Ion is cheap, ...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Deployment Engineer

    Deployment Engineer

    CRG - People and Technology • San Francisco, CA, United States
    [job_card.full_time]
    Get AI-powered advice on this job and more exclusive features.CRG - People and Technology provided pay range.This range is provided by CRG - People and Technology. Your actual pay will be based on y...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Fuel Cycle Modeling Engineer

    Fuel Cycle Modeling Engineer

    Hadron Energy, Inc. • San Francisco, CA, United States
    [job_card.full_time]
    Be among the first 25 applicants.Direct message the job poster from Hadron Energy, Inc.Hadron Energy specializes in Micro Modular Reactor (MMR) development, design, and research based in the San Fr...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Senior Systems Engineer, Modeling Focus

    Senior Systems Engineer, Modeling Focus

    Atomic Machines • Emeryville, CA, US
    [job_card.full_time]
    Atomic Machines is ushering in a new era of micromanufacturing with its Matter Compiler™ technology platform.This platform enables new classes of micromachines to be designed and built by pro...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Lead Performance Modelling Engineer - Systems & Simulators

    Lead Performance Modelling Engineer - Systems & Simulators

    PageBolt WordPress • San Francisco, CA, United States
    [job_card.full_time]
    A leading technology firm in San Francisco is looking for a Staff Performance Modelling Engineer.You will create and manage analytical models that guide hardware and software evolution, collaborati...[show_more]
    [last_updated.last_updated_1_day] • [promoted]
    Forward Deployed Engineer

    Forward Deployed Engineer

    Merge API • San Francisco, CA, US
    [job_card.full_time]
    Merge is the leading provider of agentic tools and customer-facing integrations for frontier LLMs, Fortune 500 organizations, and B2B SaaS companies. Our platform offers two core products : Merge Uni...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Forward Deployed Engineer

    Forward Deployed Engineer

    Lovefreedom Solution • San Francisco, CA, US
    [job_card.full_time]
    We’re hiring for a Forward Deployed Engineer.It’s a multi-faceted role that will require all skills from programming to project management to people coordination.You will be inside the ...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Senior Machine Learning Engineer (Modeling), Support

    Senior Machine Learning Engineer (Modeling), Support

    Block • San Francisco, California, United States
    [job_card.full_time]
    Block is one company built from many blocks, all united by the same purpose of economic empowerment.The blocks that form our foundational teams — People, Finance, Counsel, Hardware, Information Sec...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Forward Deployed Engineer

    Forward Deployed Engineer

    Postman • San Francisco, California, United States
    [job_card.full_time]
    Postman is the world’s leading API platform, used by more than 40 million developers and 500,000 organizations, including 98% of the Fortune 500. Postman is helping developers and professionals acro...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Lead Software Engineer, Model Serving Platform

    Lead Software Engineer, Model Serving Platform

    Sciforium • San Francisco, CA, United States
    [job_card.full_time]
    Lead Software Engineer, Model Serving Platform.Lead Software Engineer, Model Serving Platform.Sciforium is an AI infrastructure company developing next‑generation multimodal AI models and a proprie...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]