Talent.com

Program evaluation h1.location_city

[job_alerts.create_a_job]

Program evaluation • sunnyvale ca

[last_updated.last_updated_variable_hours]

LLM Evaluation Engineer

The Fountain GroupMountain View, CA
[job_card.full_time]

This isn’t your typical QA role – it’s a unique blend of technical engineering, machine learning evaluation, and data analysis.You’ll work closely with cutting-edge conversational AI technology, de...[internal_linking.show_more]

Software Engineer, Simulator Evaluation

WaymoMountain View, CA, United States
[job_card.full_time]

Waymo is an autonomous driving technology company with the mission to be the world's most trusted driver.Since its start as the Google Self-Driving Car Project in 2009, Waymo has focused on buildin...[internal_linking.show_more]

 • [job_card.promoted]

AIML - Sr. Software Development Engineer, Evaluation

AppleCupertino, CA, United States
[job_card.full_time]

At Apple, we create world-class innovative products that seamlessly combine cutting-edge hardware with intelligent software experiences, powered by advanced machine learning technologies.The Evalua...[internal_linking.show_more]

 • [job_card.promoted]

Program Manager

InsideHigherEdStanford, California, United States
[job_card.full_time]

Graduate School of Business, Stanford, California, United States.Administration📅Mar 27, 2026 Post Date📅108598 Requisition #.Stanford Graduate School of Business.GSB) has built a global repu...[internal_linking.show_more]

 • [job_card.promoted]

Human Evaluation & Content Quality Vendor Operations Manager

US Tech SolutionsMountain View, CA, United States
[job_card.temporary]

Location: Mountain View, CA (Hybrid).As a Human Evaluation & Content Quality Vendor Operations Manager, you will play a key role in scaling and optimizing our global human evaluation ecosystem - th...[internal_linking.show_more]

 • [job_card.promoted]

Program Manager

Foxconn Industrial InternetSan Jose, CA, US
[job_card.full_time] +1
[filters_job_card.quick_apply]

Program Manager San Jose, CA About the job: FULL-TIME/PERMANENT DEPARTMENT: Program Management POSITION: Program Manager JOB FUNCTION: The Program Manager will be mainly responsible for managing th...[internal_linking.show_more]

AI Safety & Evaluation Engineer

DeWinter GroupCampbell, CA
[job_card.full_time]

AI Safety and Evaluations Engineer.Our client, a leader in AI testing and Generative AI solutions, is looking for a skilled.AI Safety and Evaluations Engineer.This project involves designing and bu...[internal_linking.show_more]

Research Scientist (Model Evaluation)

SanasPalo Alto, California, United States, 94301
[job_card.full_time]
[filters_job_card.quick_apply]

Sanas is pioneering the future of human communication.Founded by a team of Stanford researchers and entrepreneurs with deep industry experience, Sanas has developed the world's first real-time spee...[internal_linking.show_more]

Program Manager

Amphenol HSIOSanta Clara, CA, US
[job_card.full_time]

Amphenol Communications Solutions (ACS), a division of Amphenol Corporation, is a world leader in interconnect solutions for Communications, Mobile, RF, Optics, and Commercial electronics markets.A...[internal_linking.show_more]

 • [job_card.new]

Program Associate

StanfordStanford, CA, United States
[job_card.full_time]

The Hoover Institution at Stanford University is seeking qualified candidates for the full-time position of Program Associate.A cover letter and resume are required for full consideration.About Sta...[internal_linking.show_more]

Program Manager

SupermicroSan Jose, CA, United States
[job_card.full_time]

Supermicro is a Top Tier provider of advanced server, storage, and networking solutions for Data Center, Cloud Computing, Enterprise IT, Hadoop/ Big Data, Hyperscale, HPC and IoT/Embedded customers...[internal_linking.show_more]

 • [job_card.promoted]

Software Engineer, Metrics, GenAI Model Evaluation

TeslaPalo Alto, CA, United States
[job_card.full_time]

The AI Evaluation team is the main line of defense in ensuring customer safety.Working alongside our AI team, you will design metrics that utilize fleet data and run on large inference clusters to ...[internal_linking.show_more]

 • [job_card.promoted]

Program Administrator*

SanminaSan Jose, CA, United States
[job_card.full_time]

Sanmina Corporation (Nasdaq: SANM) is a leading integrated manufacturing solutions provider serving the fastest-growing segments of the global Electronics Manufacturing Services (EMS) market.Recogn...[internal_linking.show_more]

 • [job_card.promoted]

Program Analyst

QualcommSanta Clara, CA, United States
[job_card.full_time]

Operations Group, Operations Group >.The Program Analyst will support the Oryon CPU Engineering team developing and delivering key Qualcomm SoCs.This is the team behind Snapdragon X Elite laptop an...[internal_linking.show_more]

 • [job_card.promoted]

Program Manager

ACL DigitalSan Jose, CA, United States
[job_card.permanent]

The Program Manager is responsible for customer product delivery schedules, planning, managing, and tracking.The Program Manager reports to the Chief Executive Officer.Leads all phases of assigned ...[internal_linking.show_more]

 • [job_card.promoted]

Full-stack Engineer, Data Platform - Experimentation & Evaluation

Tik TokSan Jose, CA, United States
[job_card.full_time]

Team Introduction Our mission in experimentation and evaluation team is to build the next-gen A/B testing platform, that empowers the company to make data-driven decision for the products.The suppo...[internal_linking.show_more]

 • [job_card.promoted]

Program Manager

Hays RecruitmentSanta Clara, CA, United States
[job_card.permanent]

Disclaimer: This is an evergreen job posting designed to connect with top talent for future opportunities.While this role is not actively hiring at the moment, we welcome applications to be conside...[internal_linking.show_more]

 • [job_card.promoted]

Program Manager

CyngnMountain View, CA, United States
[job_card.full_time]

Based in Mountain View, CA, Cyngn is a publicly-traded autonomous technology company.We deploy self-driving industrial vehicles like forklifts and tuggers to factories, warehouses, and other facili...[internal_linking.show_more]

 • [job_card.promoted]

Director, Simulation and Evaluation - Autonomous Driving

Bosch GroupSunnyvale, California, United States
[job_card.full_time]

Company Description* *We Are Bosch.At Bosch, we shape the future by inventing high-quality technologies and services that spark enthusiasm and enrich people’s lives.Our areas of activity are every ...[internal_linking.show_more]

LLM Evaluation Engineer

LLM Evaluation Engineer

The Fountain GroupMountain View, CA
[job_card.30_days_ago]
[job_preview.job_type]
  • [job_card.full_time]
[job_card.job_description]
About the Role:
We are seeking a LLM Evaluation Engineer to join a forward-thinking team responsible for developing a sophisticated voice assistant platform. This isn’t your typical QA role – it’s a unique blend of technical engineering, machine learning evaluation, and data analysis. You’ll work closely with cutting-edge conversational AI technology, designing evaluation frameworks, building custom scripts, and creating data visualizations to assess platform performance.

Key Responsibilities:
  • Design and implement evaluation strategies for voice and language models, including automated testing approaches.
  • Analyze unstructured data from log store systems to identify performance gaps and optimize user experiences.
  • Build and maintain custom Python scripts to streamline data processing and generate actionable insights.
  • Develop visual reports to communicate findings and drive continuous improvement.
  • Collaborate with cross-functional teams globally to identify and address pain points in conversational AI performance.
  • Use prompt engineering techniques to refine LLM outputs and articulate system health.

Ideal Candidate:
  • 3+ years of experience in machine learning evaluation, data analysis, or related technical roles.
  • Intermediate to advanced Python scripting, including log parsing and API testing.
  • Familiarity with GenAI and LLMs, including automated workflows and API integrations.
  • Strong analytical mindset, capable of working independently and identifying innovative solutions.
  • Excellent communication skills, able to present complex findings clearly to both technical and non-technical stakeholders.