Talent.com
Staff Machine Learning Engineer – SMLENGG 25-33679
Staff Machine Learning Engineer – SMLENGG 25-33679Compu-Vision Consulting, Inc. • San Jose, California, United States
Staff Machine Learning Engineer – SMLENGG 25-33679

Staff Machine Learning Engineer – SMLENGG 25-33679

Compu-Vision Consulting, Inc. • San Jose, California, United States
[job_card.variable_hours_ago]
[job_preview.job_type]
  • [job_card.full_time]
[job_card.job_description]

Staff Machine Learning Engineer, LLM Fine-Tuning (Verilog / RTL Applications)

Level : Staff

Location : San Jose, CA

Cloud :

AWS (primary : Bedrock SageMaker)

Why this role exists

You will architect and lead privacy-preserving LLM capabilities that support hardware design teams working with Verilog / SystemVerilog and RTL artifacts. This includes code generation, refactoring, lint explanation, constraint translation, and spec-to-RTL assistance. You ll lead a small, high-leverage team focused on fine-tuning and productizing LLMs in a strict enterprise data-privacy environment.

You

do not

need deep RTL expertise to start curiosity, LLM craftsmanship, and strong engineering rigor matter most. Exposure to HDL / EDA tooling is a plus.

Responsibilities

Technical Leadership & Roadmap

Own the end-to-end roadmap for Verilog / RTL-focused LLM capabilities, covering model selection, fine-tuning, evals, deployment, and continuous improvement.

Lead a hands-on team of applied ML engineers / scientists, unblock technically, review designs and code, and drive experimentation velocity and reliability.

Model Training & Customization

Fine-tune and customize models using modern techniques (LoRA / QLoRA, PEFT, instruction tuning, RLAIF / preference optimization).

Build HDL-aware evaluation workflows :

Compile / lint / simulate-based pass rates

Pass@k for code generation

Constrained decoding enforcing HDL syntax

Does-it-synthesize? checks

Privacy-First AWS ML Pipelines

Design secure training & inference environments using AWS services such as :

Amazon Bedrock (incl. Anthropic models)

SageMaker or EKS KServe / Triton / DJL for bespoke training

Implement strict privacy controls :

Artifacts in S3 with KMS CMKs

VPC-only infrastructure with PrivateLink (incl. Bedrock endpoints)

IAM least-privilege, CloudTrail auditing

Secrets Manager for credential handling

Full encryption in transit / at rest

No public egress for customer / RTL corpora

Inference & Deployment

Stand up scalable, reliable LLM serving :

Bedrock model invocation where applicable

Low-latency self-hosted inference (vLLM / TensorRT-LLM)

Autoscaling and canary / blue-green rollouts

Evaluation Culture & Tooling

Build automated regression suites running HDL compilers / simulators to measure correctness and detect hallucinations.

Track experiments and produce model cards using MLflow / W&B.

Cross-Functional Collaboration

Work with hardware design teams, CAD / EDA, Security, and Legal to :

Prepare / anonymize datasets

Define acceptance gates

Meet licensing, compliance, and security requirements

Productization

Integrate models into engineering workflows : IDE plugins, CI bots, code review assistants, retrieval over internal HDL repos / specs, and safe function-calling.

Mentorship

Develop team capabilities in LLM training, reproducibility, secure pipelines, and research literacy.

Minimum Qualifications

10 years total engineering experience; 5 years in ML / AI or large-scale distributed systems; 3 years with transformers / LLMs.

Proven record shipping LLM-powered features and leading cross-functional technical initiatives at Staff level.

Deep, hands-on experience with :

PyTorch, Hugging Face Transformers / PEFT / TRL

Distributed training (DeepSpeed / FSDP)

LoRA / QLoRA, grammar-guided decoding

Strong AWS expertise :

Bedrock (model customization, Guardrails, Knowledge Bases, VPC endpoints)

SageMaker (Training / Inference / Pipelines)

S3, EC2 / EKS / ECR, IAM, VPC, KMS, CloudWatch / CloudTrail, Step Functions, Secrets Manager

Strong Python engineering fundamentals (testing, CI / CD, observability, performance tuning).

Excellent technical communication and ability to set vision across teams.

Preferred Qualifications

Familiarity with Verilog / SystemVerilog / RTL workflows (lint, simulation, synthesis, timing closure, test benches).

Experience with static-analysis / AST-aware tokenization and grammar-constrained decoding.

RAG over code / spec repos; tool-use / function-calling for code transformation.

Inference optimization (TensorRT-LLM, KV-cache tuning, speculative decoding).

Experience with enterprise model governance and security frameworks (SOC2 / ISO 27001 / NIST).

Background in data anonymization, DLP scanning, and code de-identification.

What success looks like

90 Days

Stand up HDL-aware eval harness with compile / simulate checks.

Establish secure AWS training & inference environments (VPC-only, KMS encryption, no public egress).

Deliver initial fine-tuned model with measurable performance gains.

180 Days

Expand training coverage using Bedrock SageMaker / EKS.

Add constrained decoding and retrieval over design specs.

Productionize inference with SLOs and rollout to pilot teams.

12 Months

Reduce RTL review / iteration cycles using measurable metrics : lint-clean time, defect reductions, suggestion acceptance rates.

Establish a stable MLOps pathway for continuous improvements.

Security & Privacy by Design

All sensitive data remains within private AWS VPCs with IAM-controlled access and CloudTrail auditing.

Bedrock access via VPC PrivateLink endpoints only.

Strict data minimization, tagging, retention, reproducibility, and DLP scanning.

Model cards, lineage, and evaluation artifacts for each release.

Tech Stack

Modeling :

PyTorch, HF Transformers / PEFT / TRL, DeepSpeed / FSDP, vLLM, TensorRT-LLM

AWS / MLOps :

Bedrock, SageMaker, ECR, EKS / KServe / Triton, MLflow / W&B, Step Functions

Platform / Security :

S3 KMS, IAM, VPC / PrivateLink, CloudWatch / CloudTrail, Secrets Manager

Bonus :

HDL toolchains, vector stores (pgvector / OpenSearch), GitHub / GitLab CI

#J-18808-Ljbffr

[job_alerts.create_a_job]

Staff Machine Learning Engineer • San Jose, California, United States

[internal_linking.related_jobs]
Staff Machine Learning Engineer

Staff Machine Learning Engineer

Axiado • San Jose, CA, US
[job_card.full_time]
Axiado is an AI-enhanced security processor company redefining the control and management of every digital system.The company was founded in 2017, and currently has 150+ employees.At Axiado, develo...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Staff Machine Learning Engineer

Staff Machine Learning Engineer

Adobe • San Jose, CA, United States
[job_card.full_time]
Staff Machine Learning Engineer.Adobe Photoshop is seeking a Staff Machine Learning Services Engineer to serve as the technical lead for our Generative AI Services domain.In this high-impact role, ...[show_more]
[last_updated.last_updated_30] • [promoted]
Staff Machine Learning Engineer

Staff Machine Learning Engineer

Cisco Systems • San Jose, CA, United States
[job_card.full_time]
Join the engineering team building the intelligent backbone of Splunk Observability Cloud.We are committed to leveraging the latest advancements in data science and machine learning to unlock unpre...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Staff Machine Learning Engineer

Staff Machine Learning Engineer

Cisco Systems, Inc. • San Jose, CA, United States
[job_card.full_time]
Join the engineering team building theintelligent backbone of Splunk Observability Cloud.This role involvesresearching, developing, and deploying core analytical componentsfocused on streaming anom...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Senior Staff Machine Learning Engineer

Senior Staff Machine Learning Engineer

Coupanginternal • Mountain View, California, United States
[job_card.full_time]
Please complete the attached the.Internal Transfer Request Form.Please make sure you are applying with your Coupang e-mail address. We know we’re doing the right thing when we hear our customers say...[show_more]
[last_updated.last_updated_30] • [promoted]
Machine Learning Engineer, Staff - Model Factory

Machine Learning Engineer, Staff - Model Factory

D-matrix • Santa Clara, California, United States
[job_card.full_time]
AI to power the transformation of technology.We are at the forefront of software and hardware innovation, pushing the boundaries of what is possible. We value humility and believe in direct communic...[show_more]
[last_updated.last_updated_30] • [promoted]
Staff, Machine Learning Engineer

Staff, Machine Learning Engineer

Walmart • Sunnyvale, CA, United States
[job_card.full_time]
ML System Architecture & Delivery.Lead the design and implementation of scalable, production‑grade ML solutions that address business‑critical needs. Translate complex or ambiguous problem statement...[show_more]
[last_updated.last_updated_variable_hours] • [promoted] • [new]
Staff Machine Learning Engineer, Level 6

Staff Machine Learning Engineer, Level 6

Minimal • Palo Alto, CA, United States
[job_card.full_time]
We believe the camera presents the greatest opportunity to improve the way people live and communicate.Snap contributes to human progress by empowering people to express themselves, live in the mom...[show_more]
[last_updated.last_updated_30] • [promoted]
Staff Machine Learning Engineer (ML Platform)

Staff Machine Learning Engineer (ML Platform)

EarnIn • Palo Alto, CA, United States
[job_card.full_time]
Get AI-powered advice on this job and more exclusive features.As one of the first pioneers of earned wage access, our passion at EarnIn is building products that deliver real-time financial flexibi...[show_more]
[last_updated.last_updated_30] • [promoted]
Staff Machine Learning Engineer

Staff Machine Learning Engineer

Servicenow • Santa Clara, California, United States
[job_card.full_time]
It all started in sunny San Diego, California in 2004 when a visionary engineer, Fred Luddy, saw the potential to transform how we work. Fast forward to today — ServiceNow stands as a global market ...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Staff / Principal Machine Learning Engineer

Staff / Principal Machine Learning Engineer

Inworld Ai • Mountain View, California, United States
[job_card.full_time]
At Inworld, we believe the processes of building, scaling, and evolving applications are monsters that consume value before it can reach users. Our mission is to solve evolution and transform static...[show_more]
[last_updated.last_updated_30] • [promoted]
Staff Machine Learning Engineer / Principal ML Engineer

Staff Machine Learning Engineer / Principal ML Engineer

SRS Consulting Inc • San Jose, CA, United States
[job_card.full_time]
Role : Staff Machine Learning Engineer.Location : San Jose, CA (Onsite) Locals.Mode of Interview : Virtual & Final In-person. We're building privacy‐preserving LLM capabilities that help hardware desig...[show_more]
[last_updated.last_updated_1_day] • [promoted]
Staff Machine Learning Engineer

Staff Machine Learning Engineer

Coupand • Mountain View, California, United States
[job_card.full_time]
We know we’re doing the right thing when we hear our customers say, "How did we ever live without Coupang?" Born out of an obsession to make shopping, eating, and living easier than ever, we’re col...[show_more]
[last_updated.last_updated_30] • [promoted]
Staff Machine Learning Engineer

Staff Machine Learning Engineer

GEICO • Palo Alto, CA, United States
[job_card.full_time]
Staff Machine Learning Engineer • • • •Overview : • • •single • AI / Machine Learning team, responsible for the tech design and tech health of the team. You will build and architect scalable and reliable AIML...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Staff Machine Learning Engineer, Intelligent Scheduling Systems

Staff Machine Learning Engineer, Intelligent Scheduling Systems

Tesla • Fremont, CA, United States
[job_card.full_time]
Staff Machine Learning Engineer, Intelligent Scheduling Systems.Be among the first 25 applicants.Staff Machine Learning Engineer, Intelligent Scheduling Systems. Tesla is seeking a Machine Learning ...[show_more]
[last_updated.last_updated_30] • [promoted]
Staff Machine Learning R&D Engineer

Staff Machine Learning R&D Engineer

Matterport • Sunnyvale, CA, United States
[job_card.full_time]
Matterport is leading the digital transformation of the built world.Our groundbreaking spatial computing platform turns buildings into data making every space more valuable and accessible.Millions ...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Sr. Staff Machine Learning Engineer, Closeup Relevance

Sr. Staff Machine Learning Engineer, Closeup Relevance

Pinterest • Palo Alto, CA, United States
[job_card.full_time]
Millions of people around the world come to our platform to find creative ideas, dream about new possibilities and plan for memories that will last a lifetime. At Pinterest, we're on a mission to br...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Staff Machine Learning Engineer, Optimization

Staff Machine Learning Engineer, Optimization

Waymo • Mountain View, CA, United States
[job_card.full_time]
Waymo is an autonomous driving technology company with the mission to be the most trusted driver.Since its start as the Google Self-Driving Car Project in 2009, Waymo has focused on building the Wa...[show_more]
[last_updated.last_updated_variable_days] • [promoted]