Talent.com
Machine Learning Engineer - Fine-Tuning and On-device AI
Machine Learning Engineer - Fine-Tuning and On-device AIHp Iq • Palo Alto, CA, US
[error_messages.no_longer_accepting]
Machine Learning Engineer - Fine-Tuning and On-device AI

Machine Learning Engineer - Fine-Tuning and On-device AI

Hp Iq • Palo Alto, CA, US
[job_card.variable_days_ago]
[job_preview.job_type]
  • [job_card.full_time]
[job_card.job_description]

Machine Learning Engineer – Fine-Tuning and On-device AI

HP IQ is HP's new AI innovation lab. Combining startup agility with HP's global scale, we're building intelligent technologies that redefine how the world works, creates, and collaborates.

We're assembling a diverse, world-class team—engineers, designers, researchers, and product minds—focused on creating an intelligent ecosystem across HP's portfolio. Together, we're developing intuitive, adaptive solutions that spark creativity, boost productivity, and make collaboration seamless.

We create breakthrough solutions that make complex tasks feel effortless, teamwork more natural, and ideas more impactful—always with a human-centric mindset.

By embedding AI advancements into every HP product and service, we're expanding what's possible for individuals, organisations, and the future of work.

Join us as we reinvent work, so people everywhere can do their best work.

About the Role

We are seeking a Machine Learning Engineer to lead the fine-tuning, optimization, and deployment of AI models for diverse tasks, with a strong emphasis on on-device inference. You will work on cutting-edge applications such as orchestration, planning, multi-agent coordination, and other intelligent decision-making systems.

You will be responsible for adapting foundation models (LLMs, multimodal models) to specialized domains, making them fast, accurate, and efficient for resource-constrained environments—while ensuring robustness and safety.

What You Might Do

Fine-tune large language models, multimodal models, and task-specific models for orchestration, planning, and any other workflows as defined.

Design and run experiments to improve task accuracy, robustness, and generalization.

Explore and apply methods like full fine-tuning, LoRA, QLoRA and other types of parameter-efficient fine-tuning.

Employ advanced techniques such as QAT, DPO, GRPO to further improve the model quality.

On-Device Optimization

Prune, quantize and compress models (e.g., INT8, INT4, mixed-precision) for CPU, GPU, NPU and edge accelerators.

Optimize models for low-latency inference using frameworks like OpenVINO, ONNX Runtime, QNNetc..

Build robust data pipelines for domain-specific datasets, including synthetic data generation and annotation.

Define evaluation metrics. Perform evaluations and analyze results.

Establish best practices for versioning, reproducibility, and continuous improvement of model performance.

AI Orchestration & Planning

Develop and refine models to support multi-step reasoning, tool orchestration, and decision planning.

Work with stakeholders on orchestrator architecture.

Collaborate with product and research teams to design intelligent, context-aware assistant capabilities.

Essential Qualifications

5+ years of experience in applied machine learning, including at least 3 years in LLM fine-tuning.

Proficiency in Python and ML framework ecosystem (HuggingFace, PyTorch).

Strong understanding of transformer architectures, attention mechanisms, and PEFT techniques.

Experience with on-device inference optimization (OpenVINO, ONNX, QNN).

Familiarity with orchestration / planning architectures and techniques for AI assistants.

Track record of delivering production-ready ML solutions in latency-sensitive environments.

Preferred Qualifications

Experience with multi-agent systems or AI assistant orchestration.

Familiarity with advanced inference optimization techniques such as KV cache paging, flash attention.

Knowledge about common inference engines, including but not limited to llama.cpp, vLLM.

Salary Range : $120,000 - $215,000

Compensation & Benefits (Full-Time Employees)

The salary range for this role is listed above. Final salary offered is based upon multiple factors including individual job-related qualifications, education, experience, knowledge and skills.

Health insurance

Vision insurance

Long term / short term disability insurance

Employee assistance program

Flexible spending account

Life insurance

Generous time off policies, including

4-12 weeks fully paid parental leave based on tenure

11 paid holidays

Additional flexible paid vacation and sick leave (US benefits overview)

HP IQ is HP's new AI innovation lab, building the intelligence to empower humanity—reimagining how we work, create, and connect to shape the future of work.

Innovative Work Help shape the future of intelligent computing and workplace transformation.

Autonomy and Agility Work with the speed and focus of a startup, backed by HP's scale.

Meaningful Impact Build AI-powered solutions that help people and organisations thrive.

Flexible Work Environment Freedom and flexibility to do your best work.

Forward-Thinking Culture We learn fast, stay future-focused, and imagine what comes next—together.

Equal Opportunity Employer (EEO) Statement

HP, Inc. provides equal employment opportunity to all employees and prospective employees, without regard to race, color, religion, sex, national origin, ancestry, citizenship, sexual orientation, age, disability, or status as a protected veteran, marital status, familial status, physical or mental disability, medical condition, pregnancy, genetic predisposition or carrier status, uniformed service status, political affiliation or any other characteristic protected by applicable national, federal, state, and local law(s).

Please be assured that you will not be subject to any adverse treatment if you choose to disclose the information requested. This information is provided voluntarily. The information obtained will be kept in strict confidence.

J-18808-Ljbffr

[job_alerts.create_a_job]

Machine Learning Engineer • Palo Alto, CA, US

[internal_linking.related_jobs]
Founding Machine Learning Engineer

Founding Machine Learning Engineer

Key Technology • Hayward, CA, United States
[job_card.full_time]
You’ll design, build, and ship ranking and recommendation systems that make every match feel more personal and improve week after week. Train and fine-tune LLMs / encoders.Collaborate across ML, platf...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Machine Learning Engineer - AI Synthesis

Machine Learning Engineer - AI Synthesis

Wayve • Sunnyvale, CA, United States
[job_card.full_time]
Machine Learning Engineer - AI Synthesis.Get AI-powered advice on this job and more exclusive features.This range is provided by Wayve. Your actual pay will be based on your skills and experience — ...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Machine Learning Engineer 2

Machine Learning Engineer 2

Intuit • Mountain View, CA, United States
[job_card.full_time]
Embedded inside a vibrant team of data scientists, you’ll conceive, code, and deploy data science models at scale using industry tools. Key skills : data wrangling, feature engineering, model develop...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Machine Learning Engineer, End-to-end Autonomy

Machine Learning Engineer, End-to-end Autonomy

Woven by Toyota • Palo Alto, CA, US
[job_card.full_time]
Machine Learning Engineer, End-to-end Autonomy Join to apply for the Machine Learning Engineer, End-to-end Autonomy role at Woven by Toyota. Woven by Toyota is enabling Toyota's once-in-a-century...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Machine Learning Engineer - Privacy-Preserving Personalization

Machine Learning Engineer - Privacy-Preserving Personalization

Apple Inc. • Cupertino, CA, United States
[job_card.full_time]
Machine Learning Engineer - Privacy-Preserving Personalization.Cupertino, California, United States Machine Learning and AI. The future of personalization is private, and it lives on the device.Our ...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Machine Learning Engineer, Integration, AI Platforms

Machine Learning Engineer, Integration, AI Platforms

Tesla • Palo Alto, CA, US
[job_card.full_time]
Machine Learning Engineer, Integration, AI Platforms.Tesla is a leader in innovative technology, pioneering advancements in autonomous vehicles and humanoid robotics. Our cutting-edge AI platform po...[show_more]
[last_updated.last_updated_30] • [promoted]
Machine Learning Engineer Associate

Machine Learning Engineer Associate

Tencent • Palo Alto, California, United States
[job_card.full_time]
Machine Learning Engineer Associate.About The Hiring Team Level Infinite is Tencent’s global gaming brand.It is a global game publisher offering a comprehensive network of services for games, devel...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Machine Learning Engineer

Machine Learning Engineer

Abaka AI • Palo Alto, California, United States
[job_card.full_time]
Abaka AI is built on one mission : to be the world’s most trusted data partner for AI companies.More than 1,000 industry leaders across Generative AI, Embodied AI, and Automotive AI rely on us to po...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Machine Learning Engineer, Otter - Mountain View

Machine Learning Engineer, Otter - Mountain View

Otter • Mountain View, CA, United States
[job_card.full_time]
Machine Learning Engineer, Mountain View.Based in our Mountain View office.Otter delivers software to help restaurateurs succeed in online food delivery. We power delivery for restaurants around the...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Machine Learning & Generative AI Engineer (7+ years of experience only required)

Machine Learning & Generative AI Engineer (7+ years of experience only required)

Nisum • San Jose, CA, US
[job_card.full_time]
Machine Learning & Generative AI Engineer (7+ years of experience only required).Design, implement, and optimize ML and GenAI pipelines on Azure Databricks. Build and deploy RAG systems and agen...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Machine Learning Engineer

Machine Learning Engineer

Amazon Web Services (AWS) • Mountain View, CA, United States
[job_card.full_time]
The Generative AI Innovation Center at AWS empowers customers to harness state‑of‑the‑art AI technologies for transformative business opportunities. Our multidisciplinary team of strategists, scient...[show_more]
[last_updated.last_updated_1_day] • [promoted]
AI / Machine Learning Engineer

AI / Machine Learning Engineer

US Tech Solutions • Mountain View, CA, United States
[job_card.full_time]
The ideal candidate will also bring expertise in.While the role will receive high-level guidance, candidates should be able to. For example, given a high-level requirement for a new type of evaluati...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Machine Learning Engineer 5

Machine Learning Engineer 5

Adobe • San Jose, CA, United States
[job_card.full_time]
Adobe is seeking a Machine Learning Engineer 5 to contribute to the development of advanced machine learning models and platforms for personalized and creative customer experiences.As a senior memb...[show_more]
[last_updated.last_updated_30] • [promoted]
OS Intelligence ML Engineer — On-Device AI for OS

OS Intelligence ML Engineer — On-Device AI for OS

Apple Inc. • Cupertino, CA, United States
[job_card.full_time]
A global technology company based in California seeks a talented individual to design and develop Deep Learning architectures. This role involves creating intelligent experiences through innovative ...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
AI Machine Learning Engineer I (Intern) - United States

AI Machine Learning Engineer I (Intern) - United States

Cisco Systems, Inc. • San Jose, CA, United States
[job_card.full_time]
Please note this posting is to advertise potential job opportunities.This exact role may not be open today but could open in the near future. When you apply, a Cisco representative may contact you d...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Machine Learning Engineer, Ad platform

Machine Learning Engineer, Ad platform

NewsBreak • Mountain View, CA, United States
[job_card.full_time]
Be among the first 25 applicants.Founded in 2015, NewsBreak is the Content Intelligence platform shaping the future content economy. With over 40 million monthly active users, our flagship platform ...[show_more]
[last_updated.last_updated_30] • [promoted]
AI Engineer - Machine Learning (US)

AI Engineer - Machine Learning (US)

Slab Inc. • Palo Alto, CA, United States
[job_card.full_time]
Gauss Labs is looking for a passionate and talented AI Engineer for developing cutting-edge Industrial AI solutions that will normalize the standard of AI for manufacturing.We are working with the ...[show_more]
[last_updated.last_updated_30] • [promoted]
Machine Learning Engineer, LLM Fine-Tuning

Machine Learning Engineer, LLM Fine-Tuning

First Soft Solutions LLC • San Jose, CA, US
[job_card.full_time]
Machine Learning Engineer, LLM Fine-Tuning We are actively hiring for a Machine Learning Engineer focused on LLM fine-tuning for Verilog / RTL applications. Location : San Jose, CA (Onsite).Skills : ...[show_more]
[last_updated.last_updated_variable_days] • [promoted]