Model [h1.location_city]
[job_alerts.create_a_job]
Model • berkeley ca
Senior Software Engineer, Model Inference
Apple Inc.San Francisco, CA, United StatesBackend Engineer - Third Party Model
FalSan Francisco, California, United StatesPrincipal, Replicate Model Marketplace
CloudflareSan Francisco, California, USA- [promoted]
Engineering Manager - Model Performance
BasetenSan Francisco, CA, USResearch Engineer, Model Evaluations
Menlo VenturesSan Francisco, CA, United States- [promoted]
Product Manager, Model Behavior
OpenAISan Francisco, CA, USScientist, iPSC Model Development
Kelly ServicesSan Francisco, CA, USSenior Software Engineer, Model Inference
AppleSan Francisco, CA, United StatesInference Engineer : Scalable AI Model Serving
Virtue AISan Francisco, CA, United States- [promoted]
Model
TradeJobsWorkForce94109 San Francisco, CA, US- [promoted]
Product Manager, Model Behavior
CartesiaSan Francisco, CA, US- [promoted]
Finance Operating Model Strategist - Manager
San Francisco StaffingSan Francisco, CA, USAI Infrastructure Engineer, Model Serving Platform
Scale AI, Inc.San Francisco, CA, United StatesModel
Academy of Art UniversitySan FranciscoSr. Manager, Engineering - Model Serving
Databricks Inc.San Francisco, CA, United StatesSoftware Engineer, Model Inference
OpenaiSan Francisco, California, United StatesStaff Data Scientist - Model Strategy Management
BlockSan Francisco, California, United States- [promoted]
Model N Flex Project Lead
E-SolutionsSan Francisco, CA, USTechnical Program Manager, Model Evaluations
AnthropicSan Francisco, California, USASenior Software Engineer, Model Inference
Apple Inc.San Francisco, CA, United States- [job_card.full_time]
Senior Software Engineer, Model Inference
San Francisco Bay Area, California, United States Software and Services
Join Apple Maps to help build the best map in the world. In this role on ML Platform, you will help bring advanced deep learning and large language models into high-volume, low-latency, highly available production serving, improving search quality and powering experiences across Maps. You will partner closely with research and product teams, take end-to-end ownership, and deliver measurable results at global scale.
Description
As a Software Engineer on the Apple Maps team, you will lead the design and implementation of large-scale, high-performance inference services that support a wide range of models used across Maps, including deep learning and large language models. You will collaborate closely with research and product partners to bring models into production, with a strong focus on efficiency, reliability, and scalability. Your responsibilities span the full server stack, including onboarding new use cases, optimizing inference across heterogeneous accelerated compute hardware, deploying services on Kubernetes, building and integrating inference engines and control-plane components, and ensuring seamless integration with Maps infrastructure.
Responsibilities
- Own the technical architecture of large-scale ML inference platforms, defining long-term design direction for serving deep learning and large language models across Apple Maps.
- Lead system-level optimization efforts across the inference stack, balancing latency, throughput, accuracy, and cost through advanced techniques such as quantization, kernel fusion, speculative decoding, and efficient runtime scheduling.
- Design and evolve control-plane services responsible for model lifecycle management, including deployment orchestration, versioning, traffic routing, rollout strategies, capacity planning, and failure handling in production environments.
- Drive adoption of platform abstractions and standards that enable partner teams to onboard, deploy, and operate models reliably and efficiently at scale.
- Partner closely with research, product, and infrastructure teams to translate model requirements into production-ready systems, providing technical guidance and feedback to influence upstream model design.
- Optimize inference execution across heterogeneous compute environments, including GPUs and specialized accelerators, collaborating with runtime, compiler, and kernel teams to maximize hardware utilization.
- Establish robust observability and performance diagnostics, defining metrics, dashboards, and profiling workflows to proactively identify bottlenecks and guide optimization decisions.
- Provide technical leadership and mentorship, reviewing designs, setting engineering best practices, and raising the quality bar across teams contributing to the inference ecosystem.
- Continuously evaluate emerging research and industry trends in LLM inference, distributed systems, and ML infrastructure, driving the transition of high-impact ideas into production systems.
Minimum Qualifications
Preferred Qualifications
Compensation and Benefits
At Apple, base pay is one part of our total compensation package and is determined within a range. The base pay range for this role is between $181,100 and $318,400, and your base pay will depend on your skills, qualifications, experience, and location.
Apple employees also have the opportunity to become an Apple shareholder through participation in Apple’s discretionary employee stock programs. Apple employees are eligible for discretionary restricted stock unit awards, and can purchase Apple stock at a discount if voluntarily participating in Apple’s Employee Stock Purchase Plan. Additional benefits include comprehensive medical and dental coverage, retirement benefits, a range of discounted products and free services, and reimbursement for certain educational expenses—including tuition. This role might be eligible for discretionary bonuses or commission payments as well as relocation. Learn more about Apple Benefits.
Note : Apple benefit, compensation and employee stock programs are subject to eligibility requirements and other terms of the applicable plan or program.
Apple is an equal opportunity employer that is committed to inclusion and diversity. We seek to promote equal opportunity for all applicants without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, Veteran status, or other legally protected characteristics. Learn more about your EEO rights as an applicant.
Apple accepts applications to this posting on an ongoing basis.
#J-18808-Ljbffr