Talent.com
Senior Software Engineer, Model Inference
Senior Software Engineer, Model InferenceApple • San Francisco, CA, United States
Senior Software Engineer, Model Inference

Senior Software Engineer, Model Inference

Apple • San Francisco, CA, United States
[job_card.variable_days_ago]
[job_preview.job_type]
  • [job_card.full_time]
[job_card.job_description]
  • Weekly Hours :
  • 40
  • Role Number :
  • 200638185-3401
  • Summary
  • Join Apple Maps to help build the best map in the world. In this role on ML Platform, you will help bring advanced deep learning and large language models into high-volume, low-latency, highly available production serving, improving search quality and powering experiences across Maps. You will partner closely with research and product teams, take end-to-end ownership, and deliver measurable results at global scale.

    • Description
    • As a Software Engineer on the Apple Maps team, you will lead the design and implementation of large-scale, high-performance inference services that support a wide range of models used across Maps, including deep learning and large language models. You will collaborate closely with research and product partners to bring models into production, with a strong focus on efficiency, reliability, and scalability. Your responsibilities span the full server stack, including onboarding new use cases, optimizing inference across heterogeneous accelerated compute hardware, deploying services on Kubernetes, building and integrating inference engines and control-plane components, and ensuring seamless integration with Maps infrastructure.

    • Minimum Qualifications
    • Bachelor's degree in Computer Science, Engineering, or related field (or equivalent experience).
    • 5+ years in software engineering focused on ML inference, GPU acceleration, and large-scale systems.
    • Expertise in deploying and optimizing LLMs for high-performance, production-scale inference.
    • Proficiency in Python, Java or C++.
    • Experience with deep learning frameworks like PyTorch, TensorFlow, and Hugging Face Transformers.
    • Experience with model serving tools (e.g., NVIDIA Triton, TensorFlow Serving, VLLM, etc)
    • Experience with optimization techniques like Attention Fusion, Quantization, and Speculative Decoding.
    • Skilled in GPU optimization (e.g., CUDA, TensorRT-LLM, cuDNN) to accelerate inference tasks.
    • Skilled in cloud technologies like Kubernetes, Ingress, HAProxy for scalable deployment.
    • Preferred Qualifications
    • Master's or PhD in Computer Science, Machine Learning, or a related field.
    • Understanding of ML Ops practices, continuous integration, and deployment pipelines for machine learning models.
    • Familiarity with model distillation, low-rank approximations, and other model compression techniques for reducing memory footprint and improving inference speed.
    • Strong understanding of distributed systems, multi-GPU / multi-node parallelism, and system-level optimization for large-scale inference.
    • Apple is an equal opportunity employer that is committed to inclusion and diversity. We seek to promote equal opportunity for all applicants without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, Veteran status, or other legally protected characteristics. Learn more about your EEO rights as an applicant (https : / / www.eeoc.gov / sites / default / files / 2023-06 / 22-088\_EEOC\_KnowYourRights6.12ScreenRdr.pdf) .

    [job_alerts.create_a_job]

    Senior Software Engineer • San Francisco, CA, United States

    [internal_linking.similar_jobs]
    Senior Engineer, Model Serving & Inference

    Senior Engineer, Model Serving & Inference

    Databricks • San Francisco, CA, United States
    [job_card.full_time]
    A leading data and AI company is seeking a Senior Software Engineer, Model Serving to design and implement core systems that ensure scalability and operational excellence.You will drive architectur...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Senior Software Engineer, Model Serving

    Senior Software Engineer, Model Serving

    Databricks Inc. • San Francisco, CA, United States
    [job_card.full_time]
    At Databricks, we are passionate about enabling data teams to solve the world's toughest problems — from making the next mode of transportation a reality to accelerating the development of medical ...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Senior ML Inference Platform Engineer

    Senior ML Inference Platform Engineer

    Baseten • San Francisco, CA, United States
    [job_card.full_time]
    A prominent AI company in San Francisco is seeking a Senior Software Engineer specializing in Infrastructure.The role involves architecting and developing the ML inference platform to support produ...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Senior Software Engineer, AI / ML

    Senior Software Engineer, AI / ML

    GLUE • San Francisco, CA, United States
    [job_card.full_time]
    Glue is a well-funded startup working on the next generation of work communication tools.We believe that today’s work chat is noisy, unstructured, and not designed for productivity.We’re drawing fr...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Senior Software Engineer - Machine Learning

    Senior Software Engineer - Machine Learning

    Rippling • San Francisco, CA, United States
    [job_card.full_time]
    Senior Software Engineer - Machine Learning.Rippling gives businesses one place to run HR, IT, and Finance.It brings together all of the workforce systems that are normally scattered across a compa...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Senior Software Engineer, Observability

    Senior Software Engineer, Observability

    Together AI • San Francisco, CA, United States
    [job_card.full_time]
    Together AI is building the AI Acceleration Cloud, an end-to-end platform for the full generative AI lifecycle, combining the fastest LLM inference engine with state-of-the-art AI cloud infrastruct...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Senior Software Engineer, Machine Learning

    Senior Software Engineer, Machine Learning

    Planet Labs PBC • San Francisco, CA, United States
    [job_card.full_time]
    We believe in using space to help life on Earth.Planet designs, builds, and operates the largest constellation of imaging satellites in history. This constellation delivers an unprecedented dataset ...[show_more]
    [last_updated.last_updated_30] • [promoted]
    1.12 Senior AI Software Engineer — Edge Model Optimization & Deployment

    1.12 Senior AI Software Engineer — Edge Model Optimization & Deployment

    Field AI • San Francisco, CA, United States
    [job_card.full_time]
    Senior AI Software Engineer — Edge Model Optimization & Deployment.Senior AI Software Engineer — Edge Model Optimization & Deployment. Be among the first 25 applicants.Field AIis transforming how ro...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Senior Software Engineer – AI Agents

    Senior Software Engineer – AI Agents

    FleetWorks • San Francisco, California, United States
    [job_card.full_time]
    Every year, companies spend over a trillion dollars moving freight across the U.We’re building voice agents that transform the chaotic freight booking process into a modern, intelligent marketplace...[show_more]
    [last_updated.last_updated_1_day] • [promoted]
    Software Engineer, Model Inference

    Software Engineer, Model Inference

    OpenAI • San Francisco, CA, United States
    [job_card.full_time]
    Our Inference team brings OpenAI's most capable research and technology to the world through our products.We empower consumers, enterprise and developers alike to use and access our start-of-the-ar...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Senior Software Engineer, Intelligence — AI Retrieval

    Senior Software Engineer, Intelligence — AI Retrieval

    AngelList • San Francisco, CA, United States
    [job_card.full_time]
    A growing tech company is seeking a Senior Software Engineer to design and implement systems that power data retrieval and search functionalities. The ideal candidate should have extensive experienc...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Senior Software Engineer ($160K – $250K + Equity) at Series B Multimodal AI Lab

    Senior Software Engineer ($160K – $250K + Equity) at Series B Multimodal AI Lab

    Jack & Jill / External ATS • San Francisco, California, United States
    [job_card.full_time]
    Senior Software Engineer Salary : .Series B backed multimodal AI lab.Job Description You will lead the technical development of a real‑time conversational video interface, bridging the gap between hu...[show_more]
    [last_updated.last_updated_variable_hours] • [promoted] • [new]
    Software Engineer, Inference

    Software Engineer, Inference

    Trypulse • San Francisco, CA, United States
    [job_card.full_time]
    Pulse is tackling one of the most persistent challenges in data infrastructure : extracting accurate, structured information from complex documents at scale. We have a breakthrough approach to docume...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Senior Software Engineer, Model Inference

    Senior Software Engineer, Model Inference

    Apple Inc. • San Francisco, CA, United States
    [job_card.full_time]
    Senior Software Engineer, Model Inference.San Francisco Bay Area, California, United States Software and Services.Join Apple Maps to help build the best map in the world. In this role on ML Platform...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Senior AI / ML Software Engineer – Inference on Neuron

    Senior AI / ML Software Engineer – Inference on Neuron

    Amazon • San Francisco, CA, United States
    [job_card.full_time]
    A leading technology company in Herndon, Virginia is seeking a Senior Software Development Engineer to work on AI / ML projects. You will design and optimize machine learning models for deployment on ...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Senior Software Engineer - Intelligence

    Senior Software Engineer - Intelligence

    Hard Yaka • San Francisco, CA, United States
    [job_card.full_time]
    We exist to accelerate innovation.We do this by giving more people the opportunity to participate in the venture economy by building the financial infrastructure that makes it possible for more peo...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Software Engineer, Inference

    Software Engineer, Inference

    Anthropic • San Francisco, CA, United States
    [job_card.full_time]
    Senior / Staff Software Engineer, Inference.Anthropic’s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a ...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Senior Software Engineer, 3D Modeling

    Senior Software Engineer, 3D Modeling

    HOVER • San Francisco, CA, United States
    [job_card.full_time]
    Hover helps people design, improve, and protect the properties they love.With proprietary AI built on over a decade of real property data, Hover answers age-old questions like “What will it look li...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]