Talent.com
Machine Learning Engineer, LLM Fine-Tuning
Machine Learning Engineer, LLM Fine-TuningFirst Soft Solutions LLC • San Jose, CA, United States
Machine Learning Engineer, LLM Fine-Tuning

Machine Learning Engineer, LLM Fine-Tuning

First Soft Solutions LLC • San Jose, CA, United States
[job_card.30_days_ago]
[job_preview.job_type]
  • [job_card.full_time]
[job_card.job_description]

Machine Learning Engineer, LLM Fine‑Tuning

We are actively hiring for a Machine Learning Engineer focused on LLM fine‑tuning for Verilog / RTL applications.

Location : San Jose, CA (Onsite)

Skills : LLM fine‑tuning, Verilog / RTL, AWS, Bedrock, SageMaker

Responsibilities

  • Own the technical roadmap for Verilog / RTL‑focused LLM capabilities—from model selection and adaptation to evaluation, deployment, and continuous improvement.
  • Lead a hands‑on team of applied scientists / engineers : set direction, unblock technically, review designs / code, and raise the bar on experimentation velocity and reliability.

Fine‑tune and customize models using state‑of‑the‑art techniques (LoRA / QLoRA, PEFT, instruction tuning, preference optimization / RLAIF) with robust HDL‑specific evals :

  • Compile‑ / lint‑ / simulate‑based pass rates, pass@k for code generation, constrained decoding to enforce syntax, and “does‑it‑synthesize” checks.
  • Design privacy‑first ML pipelines on AWS :

  • Training / customization and hosting using Amazon Bedrock and SageMaker (or EKS + KServe / Triton / DJL) for bespoke training needs.
  • Artifacts in S3 with KMS CMKs; isolated VPC subnets & PrivateLink (including Bedrock VPC endpoints), IAM least‑privilege, CloudTrail auditing, and Secrets Manager for credentials.
  • Enforce encryption in transit / at rest, data minimization, no public egress for customer / RTL corpora.
  • Stand up dependable model serving : Bedrock model invocation where it fits, and / or low‑latency self‑hosted inference (vLLM / TensorRT‑LLM), autoscaling, and canary / blue‑green rollouts.
  • Build an evaluation culture : automatic regression suites that run HDL compilers / simulators, measure behavioral fidelity, and detect hallucinations / constraint violations; model cards and experiment tracking (MLflow / Weights & Biases).
  • Partner deeply with hardware design, CAD / EDA, Security, and Legal to source / prepare datasets (anonymization, redaction, licensing), define acceptance gates, and meet compliance requirements.
  • Drive productization : integrate LLMs with internal developer tools (IDEs / plug‑ins, code review bots, CI), retrieval (RAG) over internal HDL repos / specs, and safe tool‑use / function‑calling.
  • Mentor & uplevel : coach ICs on LLM best practices, reproducible training, critical paper reading, and building secure‑by‑default systems.
  • Qualifications

  • 10+ years total engineering experience with 5+ years in ML / AI or large‑scale distributed systems; 3+ years working directly with transformers / LLMs.
  • Proven track record shipping LLM‑powered features in production and leading ambiguous, cross‑functional initiatives at Staff level.
  • Deep hands‑on skill with PyTorch, Hugging Face Transformers / PEFT / TRL, distributed training (DeepSpeed / FSDP), quantization‑aware fine‑tuning (LoRA / QLoRA), and constrained / grammar‑guided decoding.
  • AWS expertise to design and defend secure enterprise deployments : Bedrock, SageMaker, S3, EC2 / EKS / ECR, VPC / Subnets / Security Groups, IAM, KMS, PrivateLink, CloudWatch / CloudTrail, Step Functions, Batch, Secrets Manager.
  • Strong software engineering fundamentals : testing, CI / CD, observability, performance tuning; Python a must (bonus for Go / Java / C++).
  • Demonstrated ability to set technical vision and influence across teams; excellent written and verbal communication for execs and engineers.
  • Seniority Level

    Mid‑Senior level

    Employment Type

    Full‑time

    Job Function

    Engineering and Information Technology

    Industries

    IT Services and IT Consulting

    #J-18808-Ljbffr

    [job_alerts.create_a_job]

    Machine Learning Engineer • San Jose, CA, United States

    [internal_linking.related_jobs]
    Machine Learning Engineer - LLM, AI & Robotics

    Machine Learning Engineer - LLM, AI & Robotics

    XPENG & Volkswagen Group • Santa Clara, CA, United States
    [job_card.full_time]
    XPENG is a leading smart technology company at the forefront of innovation, integrating advanced AI and autonomous driving technologies into its vehicles, including electric vehicles (EVs), electri...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Founding Machine Learning Engineer

    Founding Machine Learning Engineer

    Key Technology • Santa Clara, CA, United States
    [job_card.full_time]
    You’ll design, build, and ship ranking and recommendation systems that make every match feel more personal and improve week after week. Train and fine-tune LLMs / encoders.Collaborate across ML, platf...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Staff Machine Learning Engineer (Applied ML)

    Staff Machine Learning Engineer (Applied ML)

    EarnIn • Mountain View, CA, United States
    [job_card.full_time]
    Mountain View, US – Salary : $272,700 – $333,300 plus equity and benefits, hybrid in Mountain View.One of the first pioneers of earned wage access – building products that deliver real‑time financia...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Machine Learning Engineer - GenAI, LLM, Agentic AI

    Machine Learning Engineer - GenAI, LLM, Agentic AI

    Nutanix • Santa Clara, CA, United States
    [job_card.full_time]
    We are building the next generation of our AI-powered talent platform, aiming to match the right career for everyone in the world. Our AI-native enterprise talent intelligence platform leverages Gen...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Machine Learning Engineer - Relevance & Production ML

    Machine Learning Engineer - Relevance & Production ML

    hackajob • Palo Alto, CA, US
    [job_card.full_time]
    A leading technology company located in Palo Alto is seeking a Machine Learning Engineer for their Digital Intelligence team. The role includes leveraging large-scale computation and machine learnin...[show_more]
    [last_updated.last_updated_1_day] • [promoted]
    Machine Learning Engineer — Relevance & Production ML

    Machine Learning Engineer — Relevance & Production ML

    hackajob • Palo Alto, California, United States
    [job_card.full_time]
    A leading technology company located in Palo Alto is seeking a Machine Learning Engineer for their Digital Intelligence team. The role includes leveraging large-scale computation and machine learnin...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    AIML – Machine Learning Engineer – Special Projects

    AIML – Machine Learning Engineer – Special Projects

    NLP PEOPLE • Cupertino, CA, United States
    [job_card.full_time]
    Apple is where individual imaginations gather together, committing to the values that lead to great work.Every new product we build, service we create, or experience we deliver is the result of us ...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Machine Learning Engineer - Privacy-Preserving Personalization

    Machine Learning Engineer - Privacy-Preserving Personalization

    Apple Inc. • Cupertino, CA, United States
    [job_card.full_time]
    Machine Learning Engineer - Privacy-Preserving Personalization.Cupertino, California, United States Machine Learning and AI. The future of personalization is private, and it lives on the device.Our ...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Machine Learning Engineer (Hybrid)

    Machine Learning Engineer (Hybrid)

    Cisco Systems, Inc. • Milpitas, CA, United States
    [job_card.full_time]
    Applications are accepted until 1 / 12 / 2026.Join us at Cisco to shape the future of enterprise AI.We are building an AI Platform Team focused on creating the next-generation foundation that powers in...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Machine Learning Engineer – LLM, AI & Robotics

    Machine Learning Engineer – LLM, AI & Robotics

    XPENG • Santa Clara, CA, United States
    [job_card.full_time]
    AI and autonomous driving technologies into its vehicles, including electric vehicles (EVs), electric vertical take-off and landing (eVTOL) aircraft, and robotics. With a strong focus on intelligent...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Machine Learning Engineer

    Machine Learning Engineer

    Amazon Jobs • Mountain View, CA, United States
    [job_card.full_time]
    The Generative AI Innovation Center at AWS empowers customers to harness state of the art AI technologies for transformative business opportunities. Our multidisciplinary team of strategists, scient...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Staff Machine Learning Engineer (ML Platform)

    Staff Machine Learning Engineer (ML Platform)

    EarnIn • Palo Alto, CA, United States
    [job_card.full_time]
    Get AI-powered advice on this job and more exclusive features.As one of the first pioneers of earned wage access, our passion at EarnIn is building products that deliver real-time financial flexibi...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Machine Learning Engineer : LLM, VLM / VLA and reasoning models

    Machine Learning Engineer : LLM, VLM / VLA and reasoning models

    Tensor • San Jose, CA, US
    [job_card.full_time]
    Machine Learning Engineer : LLM, VLM / VLA and reasoning models Tensor is an agentic AI company dedicated to building agentic products that empower individual consumers. Our flagship product, the Tenso...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Staff Machine Learning Engineer / Principal ML Engineer

    Staff Machine Learning Engineer / Principal ML Engineer

    SRS Consulting Inc • San Jose, CA, United States
    [job_card.full_time]
    Role : Staff Machine Learning Engineer.Location : San Jose, CA (Onsite) Locals.Mode of Interview : Virtual & Final In-person. We're building privacy‐preserving LLM capabilities that help hardware desig...[show_more]
    [last_updated.last_updated_variable_hours] • [promoted] • [new]
    Machine Learning Engineer - CV / NLP / Multimodal LLM (TikTok Trust and Safety) - 2025 Start(Master[...]

    Machine Learning Engineer - CV / NLP / Multimodal LLM (TikTok Trust and Safety) - 2025 Start(Master[...]

    TikTok • San Jose, CA, US
    [job_card.full_time]
    Overview Join to apply for the Machine Learning Engineer - CV / NLP / Multimodal LLM (TikTok Trust and Safety) - 2025 Start(Master / Bachelor) role at TikTok. Responsibilities The algorithm team is r...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Machine Learning Engineer

    Machine Learning Engineer

    Clutch Canada • Palo Alto, CA, United States
    [job_card.full_time]
    Palo Alto, CA - Engineering - Hybrid - Full-time.Building hardware is like writing software with no debugger, no logs, and only three compile attempts — before mass production.This lack of visibili...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Staff Machine Learning Engineer, Adobe Firefly Services

    Staff Machine Learning Engineer, Adobe Firefly Services

    Adobe • San Jose, CA, US
    [job_card.full_time]
    Our Company Changing the world through digital experiences is what Adobe's all about.We give everyone—from emerging artists to global brands—everything they need to design and deliver exceptional d...[show_more]
    [last_updated.last_updated_variable_hours] • [promoted] • [new]
    Machine Learning Engineer, LLM Fine-Tuning

    Machine Learning Engineer, LLM Fine-Tuning

    First Soft Solutions LLC • San Jose, CA, US
    [job_card.full_time]
    Machine Learning Engineer, LLM Fine-Tuning We are actively hiring for a Machine Learning Engineer focused on LLM fine-tuning for Verilog / RTL applications. Location : San Jose, CA (Onsite).Skills : ...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]