Talent.com

Model [h1.location_city]

[job_alerts.create_a_job]

Model • berkeley ca

[last_updated.last_updated_1_day]
Senior Software Engineer, Model Inference

Senior Software Engineer, Model Inference

Apple Inc.San Francisco, CA, United States
[job_card.full_time]
Senior Software Engineer, Model Inference.San Francisco Bay Area, California, United States Software and Services.Join Apple Maps to help build the best map in the world. In this role on ML Platform...[show_more][last_updated.last_updated_variable_days]
Backend Engineer - Third Party Model

Backend Engineer - Third Party Model

FalSan Francisco, California, United States
[job_card.full_time]
Backend Engineer - Third Party Model.This role is ideal for engineers who want to be on the forefront of the GenAI media revolution. Utilize your deep experience with backend APIs, robust http clien...[show_more][last_updated.last_updated_30]
Principal, Replicate Model Marketplace

Principal, Replicate Model Marketplace

CloudflareSan Francisco, California, USA
[job_card.full_time]
At Cloudflare we are on a mission to help build a better Internet.Today the company runs one of the worlds largest networks that powers millions of websites and other Internet properties for custom...[show_more][last_updated.last_updated_variable_days]
  • [promoted]
Engineering Manager - Model Performance

Engineering Manager - Model Performance

BasetenSan Francisco, CA, US
[job_card.full_time]
Join Our Dynamic Team at Baseten.Join our dynamic team at Baseten, where we're revolutionizing AI deployment with cutting-edge inference infrastructure. Backed by premier investors such as IVP, Spar...[show_more][last_updated.last_updated_30]
Research Engineer, Model Evaluations

Research Engineer, Model Evaluations

Menlo VenturesSan Francisco, CA, United States
[job_card.full_time]
Anthropic’s mission is to create reliable, interpretable, and steerable AI systems.We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group ...[show_more][last_updated.last_updated_variable_days]
  • [promoted]
Product Manager, Model Behavior

Product Manager, Model Behavior

OpenAISan Francisco, CA, US
[job_card.full_time]
Product Manager For Model Behavior Team.The Model Behavior team is responsible for how OpenAI's models behave.We're focused on making current and future models better for people at scaleimproving e...[show_more][last_updated.last_updated_30]
Scientist, iPSC Model Development

Scientist, iPSC Model Development

Kelly ServicesSan Francisco, CA, US
[job_card.full_time]
Kelly® Science & Clinical is seeking a Scientist for a future opportunity with a client in the Bay Area.If you are driven by the prospect of translating cutting-edge scientific discoveries into...[show_more][last_updated.last_updated_30]
Senior Software Engineer, Model Inference

Senior Software Engineer, Model Inference

AppleSan Francisco, CA, United States
[job_card.full_time]
Weekly Hours : • • 40 • •Role Number : • • 200638185-3401 • •Summary • • Join Apple Maps to help build the best map in the world. In this role on ML Platform, you will help bring advanced deep learning a...[show_more][last_updated.last_updated_1_day]
Inference Engineer : Scalable AI Model Serving

Inference Engineer : Scalable AI Model Serving

Virtue AISan Francisco, CA, United States
[job_card.full_time]
An innovative AI security company in San Francisco is seeking an Inference Engineer who will be pivotal in optimizing ML model inferences. The role requires deep knowledge of serving LLMs and experi...[show_more][last_updated.last_updated_variable_days]
  • [promoted]
Model

Model

TradeJobsWorkForce94109 San Francisco, CA, US
[job_card.full_time]
Pose for artists and photographers in various outfits and collaborate with directors, photographers, and designers to achieve the desired outcome. Interact with agents, designers, photographers, and...[show_more][last_updated.last_updated_30]
  • [promoted]
Product Manager, Model Behavior

Product Manager, Model Behavior

CartesiaSan Francisco, CA, US
[job_card.full_time]
We're seeking an exceptional Product Manager to drive model quality and behavior excellence for our text-to-speech and speech-to-text products at Cartesia. As our Model Behavior PM, you'll be the br...[show_more][last_updated.last_updated_30]
  • [promoted]
Finance Operating Model Strategist - Manager

Finance Operating Model Strategist - Manager

San Francisco StaffingSan Francisco, CA, US
[job_card.full_time]
We partner with finance executives to drive value across the enterprise.As finance leaders move into business partner roles, they need processes, technology, and people to help drive efficiencies, ...[show_more][last_updated.last_updated_30]
AI Infrastructure Engineer, Model Serving Platform

AI Infrastructure Engineer, Model Serving Platform

Scale AI, Inc.San Francisco, CA, United States
[job_card.full_time]
As a Software Engineer on the ML Infrastructure team, you will design and build platforms for scalable, reliable, and efficient serving of LLMs. Our platform powers cutting-edge research and product...[show_more][last_updated.last_updated_30]
Model

Model

Academy of Art UniversitySan Francisco
[job_card.part_time]
Do these words motivate you? If so, then we want to talk with you.Academy of Art University offers a rewarding employment experience for those who excel in a dynamic environment and who can consist...[show_more][last_updated.last_updated_30]
Sr. Manager, Engineering - Model Serving

Sr. Manager, Engineering - Model Serving

Databricks Inc.San Francisco, CA, United States
[job_card.full_time]
At Databricks, we are passionate about enabling data teams to solve the world's toughest problems — from making the next mode of transportation a reality to accelerating the development of medical ...[show_more][last_updated.last_updated_variable_days]
Software Engineer, Model Inference

Software Engineer, Model Inference

OpenaiSan Francisco, California, United States
[job_card.full_time]
Our team brings OpenAI’s most capable research and technology to the world through our products.We empower consumers, enterprise and developers alike to use and access our start-of-the-art AI model...[show_more][last_updated.last_updated_30]
Staff Data Scientist - Model Strategy Management

Staff Data Scientist - Model Strategy Management

BlockSan Francisco, California, United States
[job_card.full_time]
Block is one company built from many blocks, all united by the same purpose of economic empowerment.The blocks that form our foundational teams — People, Finance, Counsel, Hardware, Information Sec...[show_more][last_updated.last_updated_30]
  • [promoted]
Model N Flex Project Lead

Model N Flex Project Lead

E-SolutionsSan Francisco, CA, US
[job_card.full_time]
Provide end-to-end leadership for Model N Flex programs within the Pharma Access landscape.[show_more][last_updated.last_updated_30]
Technical Program Manager, Model Evaluations

Technical Program Manager, Model Evaluations

AnthropicSan Francisco, California, USA
[job_card.full_time]
Anthropics mission is to create reliable interpretable and steerable AI systems.We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of ...[show_more][last_updated.last_updated_variable_days]
Senior Software Engineer, Model Inference

Senior Software Engineer, Model Inference

Apple Inc.San Francisco, CA, United States
[job_card.variable_days_ago]
[job_preview.job_type]
  • [job_card.full_time]
[job_card.job_description]

Senior Software Engineer, Model Inference

San Francisco Bay Area, California, United States Software and Services

Join Apple Maps to help build the best map in the world. In this role on ML Platform, you will help bring advanced deep learning and large language models into high-volume, low-latency, highly available production serving, improving search quality and powering experiences across Maps. You will partner closely with research and product teams, take end-to-end ownership, and deliver measurable results at global scale.

Description

As a Software Engineer on the Apple Maps team, you will lead the design and implementation of large-scale, high-performance inference services that support a wide range of models used across Maps, including deep learning and large language models. You will collaborate closely with research and product partners to bring models into production, with a strong focus on efficiency, reliability, and scalability. Your responsibilities span the full server stack, including onboarding new use cases, optimizing inference across heterogeneous accelerated compute hardware, deploying services on Kubernetes, building and integrating inference engines and control-plane components, and ensuring seamless integration with Maps infrastructure.

Responsibilities

  • Own the technical architecture of large-scale ML inference platforms, defining long-term design direction for serving deep learning and large language models across Apple Maps.
  • Lead system-level optimization efforts across the inference stack, balancing latency, throughput, accuracy, and cost through advanced techniques such as quantization, kernel fusion, speculative decoding, and efficient runtime scheduling.
  • Design and evolve control-plane services responsible for model lifecycle management, including deployment orchestration, versioning, traffic routing, rollout strategies, capacity planning, and failure handling in production environments.
  • Drive adoption of platform abstractions and standards that enable partner teams to onboard, deploy, and operate models reliably and efficiently at scale.
  • Partner closely with research, product, and infrastructure teams to translate model requirements into production-ready systems, providing technical guidance and feedback to influence upstream model design.
  • Optimize inference execution across heterogeneous compute environments, including GPUs and specialized accelerators, collaborating with runtime, compiler, and kernel teams to maximize hardware utilization.
  • Establish robust observability and performance diagnostics, defining metrics, dashboards, and profiling workflows to proactively identify bottlenecks and guide optimization decisions.
  • Provide technical leadership and mentorship, reviewing designs, setting engineering best practices, and raising the quality bar across teams contributing to the inference ecosystem.
  • Continuously evaluate emerging research and industry trends in LLM inference, distributed systems, and ML infrastructure, driving the transition of high-impact ideas into production systems.

Minimum Qualifications

  • Bachelor's degree in Computer Science, Engineering, or related field (or equivalent experience).
  • 5+ years in software engineering focused on ML inference, GPU acceleration, and large-scale systems.
  • Expertise in deploying and optimizing LLMs for high-performance, production-scale inference.
  • Proficiency in Python, Java or C++.
  • Experience with deep learning frameworks like PyTorch, TensorFlow, and Hugging Face Transformers.
  • Experience with model serving tools (e.g., NVIDIA Triton, TensorFlow Serving, VLLM, etc).
  • Experience with optimization techniques like Attention Fusion, Quantization, and Speculative Decoding.
  • Skilled in GPU optimization (e.g., CUDA, TensorRT-LLM, cuDNN) to accelerate inference tasks.
  • Skilled in cloud technologies like Kubernetes, Ingress, HAProxy for scalable deployment.
  • Preferred Qualifications

  • Master’s or PhD in Computer Science, Machine Learning, or a related field.
  • Understanding of ML Ops practices, continuous integration, and deployment pipelines for machine learning models.
  • Familiarity with model distillation, low-rank approximations, and other model compression techniques for reducing memory footprint and improving inference speed.
  • Strong understanding of distributed systems, multi-GPU / multi-node parallelism, and system-level optimization for large-scale inference.
  • Compensation and Benefits

    At Apple, base pay is one part of our total compensation package and is determined within a range. The base pay range for this role is between $181,100 and $318,400, and your base pay will depend on your skills, qualifications, experience, and location.

    Apple employees also have the opportunity to become an Apple shareholder through participation in Apple’s discretionary employee stock programs. Apple employees are eligible for discretionary restricted stock unit awards, and can purchase Apple stock at a discount if voluntarily participating in Apple’s Employee Stock Purchase Plan. Additional benefits include comprehensive medical and dental coverage, retirement benefits, a range of discounted products and free services, and reimbursement for certain educational expenses—including tuition. This role might be eligible for discretionary bonuses or commission payments as well as relocation. Learn more about Apple Benefits.

    Note : Apple benefit, compensation and employee stock programs are subject to eligibility requirements and other terms of the applicable plan or program.

    Apple is an equal opportunity employer that is committed to inclusion and diversity. We seek to promote equal opportunity for all applicants without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, Veteran status, or other legally protected characteristics. Learn more about your EEO rights as an applicant.

    Apple accepts applications to this posting on an ongoing basis.

    #J-18808-Ljbffr