Machine Learning Engineer, LLM Fine-TuningFirst Soft Solutions LLC • San Jose, CA, US

Machine Learning Engineer, LLM Fine-Tuning

First Soft Solutions LLC • San Jose, CA, US

[job_card.variable_days_ago]

[job_preview.job_type]

[job_card.full_time]

[job_card.job_description]

Machine Learning Engineer, LLM Fine-Tuning

We are actively hiring for a Machine Learning Engineer focused on LLM fine-tuning for Verilog / RTL applications.

Location : San Jose, CA (Onsite)

Skills : LLM fine-tuning, Verilog / RTL, AWS, Bedrock, SageMaker

Responsibilities

Own the technical roadmap for Verilog / RTL-focused LLM capabilities—from model selection and adaptation to evaluation, deployment, and continuous improvement.

Lead a hands-on team of applied scientists / engineers : set direction, unblock technically, review designs / code, and raise the bar on experimentation velocity and reliability.

Fine-tune and customize models using state-of-the-art techniques (LoRA / QLoRA, PEFT, instruction tuning, preference optimization / RLAIF) with robust HDL-specific evals :

Compile- / lint- / simulate-based pass rates, pass@k for code generation, constrained decoding to enforce syntax, and "does-it-synthesize" checks.

Design privacy-first ML pipelines on AWS :

Training / customization and hosting using Amazon Bedrock and SageMaker (or EKS + KServe / Triton / DJL) for bespoke training needs.

Artifacts in S3 with KMS CMKs; isolated VPC subnets & PrivateLink (including Bedrock VPC endpoints), IAM least-privilege, CloudTrail auditing, and Secrets Manager for credentials.

Enforce encryption in transit / at rest, data minimization, no public egress for customer / RTL corpora.

Stand up dependable model serving : Bedrock model invocation where it fits, and / or low-latency self-hosted inference (vLLM / TensorRT-LLM), autoscaling, and canary / blue-green rollouts.

Build an evaluation culture : automatic regression suites that run HDL compilers / simulators, measure behavioral fidelity, and detect hallucinations / constraint violations; model cards and experiment tracking (MLflow / Weights & Biases).

Partner deeply with hardware design, CAD / EDA, Security, and Legal to source / prepare datasets (anonymization, redaction, licensing), define acceptance gates, and meet compliance requirements.

Drive productization : integrate LLMs with internal developer tools (IDEs / plug-ins, code review bots, CI), retrieval (RAG) over internal HDL repos / specs, and safe tool-use / function-calling.

Mentor & uplevel : coach ICs on LLM best practices, reproducible training, critical paper reading, and building secure-by-default systems.

Qualifications

10+ years total engineering experience with 5+ years in ML / AI or large-scale distributed systems; 3+ years working directly with transformers / LLMs.

Proven track record shipping LLM-powered features in production and leading ambiguous, cross-functional initiatives at Staff level.

Deep hands-on skill with PyTorch, Hugging Face Transformers / PEFT / TRL, distributed training (DeepSpeed / FSDP), quantization-aware fine-tuning (LoRA / QLoRA), and constrained / grammar-guided decoding.

AWS expertise to design and defend secure enterprise deployments : Bedrock, SageMaker, S3, EC2 / EKS / ECR, VPC / Subnets / Security Groups, IAM, KMS, PrivateLink, CloudWatch / CloudTrail, Step Functions, Batch, Secrets Manager.

Strong software engineering fundamentals : testing, CI / CD, observability, performance tuning; Python a must (bonus for Go / Java / C++).

Demonstrated ability to set technical vision and influence across teams; excellent written and verbal communication for execs and engineers.

Seniority Level

Mid-Senior level

Employment Type

Full-time

Job Function

Engineering and Information Technology

Industries

IT Services and IT Consulting

J-18808-Ljbffr

[job_alerts.create_a_job]

Machine Learning Engineer • San Jose, CA, US

[internal_linking.related_jobs]

Machine Learning Engineer - LLM, AI & Robotics

XPENG & Volkswagen Group • Santa Clara, CA, United States

[job_card.full_time]

XPENG is a leading smart technology company at the forefront of innovation, integrating advanced AI and autonomous driving technologies into its vehicles, including electric vehicles (EVs), electri...[show_more]

[last_updated.last_updated_30] • [promoted]

Sr Machine Learning Engineer - GenAI, LLM, Agentic AI

Eightfold • Santa Clara, California, United States

[job_card.full_time]

Research, design, development, and deployment of advanced AI agents and agentic systems.Architect and implement complex multi-agent systems, including planning, decision-making, and execution capab...[show_more]

[last_updated.last_updated_1_day] • [promoted]

Machine Learning Engineer - GenAI, LLM, Agentic AI

Nutanix • Santa Clara, CA, United States

[job_card.full_time]

We are building the next generation of our AI-powered talent platform, aiming to match the right career for everyone in the world. Our AI-native enterprise talent intelligence platform leverages Gen...[show_more]

[last_updated.last_updated_variable_days] • [promoted]

Machine Learning Engineer — Relevance & Production ML

hackajob • Palo Alto, California, United States

[job_card.full_time]

A leading technology company located in Palo Alto is seeking a Machine Learning Engineer for their Digital Intelligence team. The role includes leveraging large-scale computation and machine learnin...[show_more]

[last_updated.last_updated_variable_days] • [promoted]

Machine Learning Engineer - Relevance & Production ML

hackajob • Palo Alto, CA, US

[job_card.full_time]

[last_updated.last_updated_variable_days] • [promoted]

Machine Learning Engineer – LLM, AI & Robotics

XPENG • Santa Clara, CA, United States

[job_card.full_time]

AI and autonomous driving technologies into its vehicles, including electric vehicles (EVs), electric vertical take-off and landing (eVTOL) aircraft, and robotics. With a strong focus on intelligent...[show_more]

[last_updated.last_updated_variable_days] • [promoted]

Machine Learning Engineer (Hybrid)

Cisco Systems, Inc. • Milpitas, CA, United States

[job_card.full_time]

Applications are accepted until 1 / 12 / 2026.Join us at Cisco to shape the future of enterprise AI.We are building an AI Platform Team focused on creating the next-generation foundation that powers in...[show_more]

[last_updated.last_updated_variable_days] • [promoted]

Machine Learning Engineer - GenAI, LLM, Agentic AI

Eightfold • Santa Clara, California, United States

[job_card.full_time]

[last_updated.last_updated_1_day] • [promoted]

Machine Learning Engineer : LLM, VLM / VLA and reasoning models

Tensor • San Jose, CA, US

[job_card.full_time]

Machine Learning Engineer : LLM, VLM / VLA and reasoning models Tensor is an agentic AI company dedicated to building agentic products that empower individual consumers. Our flagship product, the Tenso...[show_more]

[last_updated.last_updated_variable_days] • [promoted]

Staff Machine Learning Engineer / Principal ML Engineer

SRS Consulting Inc • San Jose, CA, United States

[job_card.full_time]

Role : Staff Machine Learning Engineer.Location : San Jose, CA (Onsite) Locals.Mode of Interview : Virtual & Final In-person. We're building privacy‐preserving LLM capabilities that help hardware desig...[show_more]

[last_updated.last_updated_variable_hours] • [promoted] • [new]

Staff Machine Learning Engineer (Applied ML)

Earnin • Mountain View, California, United States

[job_card.full_time]

As one of the first pioneers of earned wage access, our passion at EarnIn is building products that deliver real-time financial flexibility for those with the unique needs of living paycheck to pay...[show_more]

[last_updated.last_updated_30] • [promoted]

Machine Learning Engineer - CV / NLP / Multimodal LLM (TikTok Trust and Safety) - 2025 Start(Master[...]

TikTok • San Jose, CA, US

[job_card.full_time]

Overview Join to apply for the Machine Learning Engineer - CV / NLP / Multimodal LLM (TikTok Trust and Safety) - 2025 Start(Master / Bachelor) role at TikTok. Responsibilities The algorithm team is r...[show_more]

[last_updated.last_updated_30] • [promoted]

Machine Learning Engineer, ML Resources

Waymo • Mountain View, California, United States

[job_card.full_time]

Waymo is an autonomous driving technology company with the mission to be the world's most trusted driver.Since its start as the Google Self-Driving Car Project in 2009, Waymo has focused on buildin...[show_more]

[last_updated.last_updated_variable_days] • [promoted]

Machine Learning Engineer, Recommendation

Newsbreak • Mountain View, California, United States

[job_card.full_time]

NewsBreak is redefining the way users interact with local news and their communities.By bridging local users, local content creators, and local businesses, our mission is to foster safer, more vibr...[show_more]

[last_updated.last_updated_30] • [promoted]

Senior Machine Learning Engineer II - LLM

Moveworks • Mountain View, California, United States

[job_card.full_time]

We are looking for a Machine Learning Engineer to help build cutting edge ML infrastructure for building and serving LLM’s at Moveworks. This role will be critical in building, optimizing and scali...[show_more]

[last_updated.last_updated_30] • [promoted]

Machine Learning Engineer 756

Protegrity • Palo Alto, California, United States

[filters.remote]

[job_card.full_time]

At Protegrity, we lead innovation by using AI and quantum-resistant cryptography to transform data protection across cloud-native, hybrid, on-premises, and open source environments.We leverage adva...[show_more]

[last_updated.last_updated_30] • [promoted]