Talent.com
Senior Software Engineer, Model Inference
Senior Software Engineer, Model InferenceApple • San Francisco, CA, United States
[error_messages.no_longer_accepting]
Senior Software Engineer, Model Inference

Senior Software Engineer, Model Inference

Apple • San Francisco, CA, United States
[job_card.variable_days_ago]
[job_preview.job_type]
  • [job_card.full_time]
[job_card.job_description]
  • Weekly Hours :
  • 40
  • Role Number :
  • 200638185-3401
  • Summary
  • Join Apple Maps to help build the best map in the world. In this role on ML Platform, you will help bring advanced deep learning and large language models into high-volume, low-latency, highly available production serving, improving search quality and powering experiences across Maps. You will partner closely with research and product teams, take end-to-end ownership, and deliver measurable results at global scale.

    • Description
    • As a Software Engineer on the Apple Maps team, you will lead the design and implementation of large-scale, high-performance inference services that support a wide range of models used across Maps, including deep learning and large language models. You will collaborate closely with research and product partners to bring models into production, with a strong focus on efficiency, reliability, and scalability. Your responsibilities span the full server stack, including onboarding new use cases, optimizing inference across heterogeneous accelerated compute hardware, deploying services on Kubernetes, building and integrating inference engines and control-plane components, and ensuring seamless integration with Maps infrastructure.

    • Minimum Qualifications
    • Bachelor's degree in Computer Science, Engineering, or related field (or equivalent experience).
    • 5+ years in software engineering focused on ML inference, GPU acceleration, and large-scale systems.
    • Expertise in deploying and optimizing LLMs for high-performance, production-scale inference.
    • Proficiency in Python, Java or C++.
    • Experience with deep learning frameworks like PyTorch, TensorFlow, and Hugging Face Transformers.
    • Experience with model serving tools (e.g., NVIDIA Triton, TensorFlow Serving, VLLM, etc)
    • Experience with optimization techniques like Attention Fusion, Quantization, and Speculative Decoding.
    • Skilled in GPU optimization (e.g., CUDA, TensorRT-LLM, cuDNN) to accelerate inference tasks.
    • Skilled in cloud technologies like Kubernetes, Ingress, HAProxy for scalable deployment.
    • Preferred Qualifications
    • Master's or PhD in Computer Science, Machine Learning, or a related field.
    • Understanding of ML Ops practices, continuous integration, and deployment pipelines for machine learning models.
    • Familiarity with model distillation, low-rank approximations, and other model compression techniques for reducing memory footprint and improving inference speed.
    • Strong understanding of distributed systems, multi-GPU / multi-node parallelism, and system-level optimization for large-scale inference.
    • Apple is an equal opportunity employer that is committed to inclusion and diversity. We seek to promote equal opportunity for all applicants without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, Veteran status, or other legally protected characteristics. Learn more about your EEO rights as an applicant (https : / / www.eeoc.gov / sites / default / files / 2023-06 / 22-088\_EEOC\_KnowYourRights6.12ScreenRdr.pdf) .

    [job_alerts.create_a_job]

    Senior Software Engineer • San Francisco, CA, United States

    [internal_linking.similar_jobs]
    Senior Software Engineer Applied AI Systems

    Senior Software Engineer Applied AI Systems

    Symbolica AI • San Francisco, California, United States
    [job_card.full_time]
    Symbolica is an AI research lab pioneering the application of category theory to enable logical reasoning in machines.We’re a well-resourced, nimble team of experts on a mission to bridge the gap b...[show_more]
    [last_updated.last_updated_variable_hours] • [promoted] • [new]
    Senior / Staff Software Engineer, Applied AI

    Senior / Staff Software Engineer, Applied AI

    Zip • San Francisco, California, United States
    [job_card.full_time]
    The simple task of buying software, services, or tools at work has become hopelessly complicated at even the most innovative companies in the world. Today, enterprises spend $120T+ per year globally...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Senior Software Engineer, Model Serving

    Senior Software Engineer, Model Serving

    Databricks Inc. • San Francisco, CA, United States
    [job_card.full_time]
    At Databricks, we are passionate about enabling data teams to solve the world's toughest problems — from making the next mode of transportation a reality to accelerating the development of medical ...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Senior Software Engineer (GTM)

    Senior Software Engineer (GTM)

    Toma • San Francisco, California, United States
    [job_card.full_time]
    We're building the AI platform for underserved industries.LLM usage has seen a meteoric rise in the past year, but there is still a significant gap between agentic innovation and its use in the rea...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Senior Software Engineer, Machine Learning

    Senior Software Engineer, Machine Learning

    Planet Labs PBC • San Francisco, CA, United States
    [job_card.full_time]
    We believe in using space to help life on Earth.Planet designs, builds, and operates the largest constellation of imaging satellites in history. This constellation delivers an unprecedented dataset ...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Senior Software Engineer, Data Model

    Senior Software Engineer, Data Model

    Roblox • San Mateo, California, United States
    [job_card.full_time]
    As a Senior Software Engineer on the Engine DataModel team, you will own and innovate on the foundational components that form the backbone of the Roblox platform. In the Roblox Engine, the DataMode...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Senior Software Engineer : Circuit and Fault Intelligence

    Senior Software Engineer : Circuit and Fault Intelligence

    Gridware • San Francisco, California, United States
    [job_card.full_time]
    Gridware is a San Francisco-based technology company dedicated to protecting and enhancing the electrical grid.We pioneered a groundbreaking new class of grid management called active grid response...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Senior Software Engineer, Core Data Resilience

    Senior Software Engineer, Core Data Resilience

    Box • Redwood City, California, United States
    [job_card.full_time]
    Within Core Data, the Resilience team ensures that the services stay healthy, performant, and fault-tolerant - especially under load. We're looking for a Senior Software Engineer to join this Resili...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Senior Software Engineer, AI

    Senior Software Engineer, AI

    Peregrine Technologies • San Francisco, California, United States
    [job_card.full_time]
    Backed by leading investors from Silicon Valley, Peregrine supports public safety agencies across the country — from Los Angeles to Louisville to Atlanta — empowering public servants to improve ope...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Senior Software Engineer, Observability

    Senior Software Engineer, Observability

    Together Ai • San Francisco, California, United States
    [job_card.full_time]
    Together AI is building the AI Acceleration Cloud, an end-to-end platform for the full generative AI lifecycle, combining the fastest LLM inference engine with state-of-the-art AI cloud infrastruct...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Software Engineer, Model Inference

    Software Engineer, Model Inference

    OpenAI • San Francisco, CA, United States
    [job_card.full_time]
    Our Inference team brings OpenAI's most capable research and technology to the world through our products.We empower consumers, enterprise and developers alike to use and access our start-of-the-ar...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Senior / Staff Software Engineer - Machine Learning

    Senior / Staff Software Engineer - Machine Learning

    Zoox • Foster City, California, United States
    [job_card.full_time]
    At Zoox, you will collaborate with a team of world-class engineers with diverse backgrounds in areas such as AI, robotics, mechatronics, planning, control, localization, computer vision, rendering,...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Senior Software Engineer - Machine Learning- Publica by IAS

    Senior Software Engineer - Machine Learning- Publica by IAS

    Publica • San Francisco, California, United States
    [job_card.full_time]
    At Publica, engineers have a unique opportunity to work on a platform that handles billions of requests per hour in one of the fastest growing areas in Ad Tech : Connected Television.Engineers at Pu...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Software Engineer, Model Inference

    Software Engineer, Model Inference

    Openai • San Francisco, California, United States
    [job_card.full_time]
    Our team brings OpenAI’s most capable research and technology to the world through our products.We empower consumers, enterprise and developers alike to use and access our start-of-the-art AI model...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Senior Software Engineer, Core Product

    Senior Software Engineer, Core Product

    Discord • San Francisco Bay, California, United States
    [job_card.full_time]
    Discord is used by over 200 million people every month for many different reasons, but there’s one thing that nearly everyone does on our platform : . Over 90% of our users play games, spending a comb...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Senior Software Engineer, Applied AI

    Senior Software Engineer, Applied AI

    Chaos Industries • San Francisco, California, United States
    [job_card.full_time]
    Founded in 2022 by a seasoned leadership team, CHAOS has quickly become the place where world-class multi-disciplinary engineers come to build mission-critical technologies.CHAOS has a mission-focu...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Senior Software Engineer, AI Entities

    Senior Software Engineer, AI Entities

    Evenup • San Francisco, California, United States
    [job_card.full_time]
    EvenUp is one of the fastest-growing generative AI startups in history, on a mission to level the playing field for personal injury victims, which range from motor vehicle accidents to child abuse ...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Senior Software Engineer

    Senior Software Engineer

    Shakudo • San Francisco, California, United States
    [job_card.full_time]
    At Shakudo, we are building the world’s first operating system for data and AI.We use the term operating system in the truest sense of the word. Like iOS, Windows and Linux, Shakudo’s end-to-end OS ...[show_more]
    [last_updated.last_updated_30] • [promoted]