Talent.com

Program evaluation [h1.location_city]

[job_alerts.create_a_job]

Program evaluation • seattle wa

[last_updated.last_updated_variable_hours]
  • [new]
GenAI Evaluation Engineer

GenAI Evaluation Engineer

Diverse LynxBellevue, WA, United States
[job_card.full_time]
Strong understanding of LLMs and generative AI concepts, including model behavior and output evaluation.Experience with AI evaluation and benchmarking methodologies, including baseline creation and...[show_more][last_updated.last_updated_variable_hours]
  • [promoted]
Advanced Practice Provider : Acute Clinical Evaluation

Advanced Practice Provider : Acute Clinical Evaluation

UnavailableSeattle, WA, United States
[job_card.full_time]
Fred Hutchinson Cancer Center is an independent, nonprofit organization providing adult cancer treatment and groundbreaking research focused on cancer and infectious diseases.Based in Seattle, Fred...[show_more][last_updated.last_updated_30]
Test & Evaluation Engineer 3

Test & Evaluation Engineer 3

Indotronix International CorporationSeattle, Washington
[job_card.full_time]
Test & Evaluation Engineer 3 Seattle, Washington, United States | Posted : 8 / 11 / 2025.[show_more][last_updated.last_updated_30]
Test & Evaluation Lab Technician

Test & Evaluation Lab Technician

SSi PeopleTukwila / Washington
[job_card.full_time]
Job Title : Test & Evaluation Lab Technician.Conduct tests and evaluations in our client's lab, supporting various projects across the enterprise. Collaborate effectively with team members and partne...[show_more][last_updated.last_updated_30]
Program Manager

Program Manager

CHIEF SEATTLE CLUBSeattle, WA, US
[job_card.full_time] +2
Program Manager (Permanent Supportive Housing).Director of Supportive Housing.Status : ☒Regular ☐Temporary ☒Full-Time ☐Part-Time FLSA : ☒ Exempt OR.This position re...[show_more][last_updated.last_updated_30]
Program Manager

Program Manager

Cypress HCMSeattle, WA, US
[job_card.full_time]
Seattle-based candidates required to support face to face customer meetings if needed.We are seeking a seasoned Program Manager contractor with deep experience in Enterprise B2B software to support...[show_more][last_updated.last_updated_30]
  • [promoted]
  • [new]
Python Insfrastructure Engineer - Model Evaluation

Python Insfrastructure Engineer - Model Evaluation

AlignerrSeattle, WA, United States
[job_card.full_time]
AI labs to build, evaluate, and improve next-generation models.We work on real production systems and high-impact research workflows across data, tooling, and infrastructure.Senior Python Full-Stac...[show_more][last_updated.last_updated_variable_hours]
  • [new]
Term Limited - Evaluation and Process Improvement Advisor

Term Limited - Evaluation and Process Improvement Advisor

City of Seattle, WASeattle, WA, United States
[job_card.full_time]
Civil Service Exempt, Term Limited, Full-Time.Aging and Disabilities Services.Aging and Disability Services (ADS) is dedicated to helping older adults and adults with disabilities live with indepen...[show_more][last_updated.last_updated_variable_hours]
  • [promoted]
Program Manager

Program Manager

F5Seattle, WA, US
[job_card.full_time]
At F5, we strive to bring a better digital world to life.Our teams empower organizations across the globe to create, secure, and run applications that enhance how we experience our evolving digital...[show_more][last_updated.last_updated_30]
  • [new]
AI Evaluation Engineer - Health

AI Evaluation Engineer - Health

AppleSeattle, WA, United States
[job_card.full_time]
The Health Sensing team builds outstanding technologies to support our users in living their healthiest, happiest lives by providing them with objective, accurate, and timely information about thei...[show_more][last_updated.last_updated_variable_hours]
  • [promoted]
Head of Evaluation and Oversight Research

Head of Evaluation and Oversight Research

Scale AI, Inc.Seattle, WA, United States
[job_card.full_time]
Scale is the leading data and evaluation partner for frontier AI companies, playing an integral role in advancing the science of evaluating and characterizing large language models (LLMs).Our resea...[show_more][last_updated.last_updated_30]
  • [promoted]
Senior Program Manager, Program Services

Senior Program Manager, Program Services

Washington StaffingSeattle, WA, US
[job_card.full_time]
Northwest Center has been a vendor at Amazon for over 20 years.The NWC @ Amazon division has grown to over 360 employees, providing a wide variety of customer service support on Amazon's thriving c...[show_more][last_updated.last_updated_variable_days]
Program Manager

Program Manager

Blueprint TechnologiesBellevue, Washington, USA
[job_card.full_time]
We are a technology solutions firm headquartered in Bellevue Washington with a strong presence across the United States.Unified by a shared passion for solving complicated problems our people are o...[show_more][last_updated.last_updated_variable_days]
  • [promoted]
  • [new]
Staff Software Engineer, Perception Evaluation

Staff Software Engineer, Perception Evaluation

WaymoBellevue, WA, United States
[job_card.full_time]
Waymo is an autonomous driving technology company with the mission to be the world's most trusted driver.Since its start as the Google Self-Driving Car Project in 2009, Waymo has focused on buildin...[show_more][last_updated.last_updated_variable_hours]
  • [promoted]
Program Manager

Program Manager

Blue OriginSeattle, WA, US
[job_card.full_time] +1
Applications will be accepted on an ongoing basis until the requisition is closed.At Blue Origin, we envision millions of people living and working in space for the benefit of Earth.We're working t...[show_more][last_updated.last_updated_30]
  • [new]
AIML - Sr Data Engineer, Evaluation

AIML - Sr Data Engineer, Evaluation

Seattle StaffingSeattle, WA, United States
[job_card.full_time]
The SWE Data organization seeks to improve products by using data as the voice of our customers.Within this organization, the Search Data Engineering team builds systems that process data reliably ...[show_more][last_updated.last_updated_variable_hours]
  • [promoted]
Program Manager

Program Manager

AZXSeattle, WA, US
[job_card.full_time]
Our mission is to accelerate positive impact in critical industries through AI transformation.We're growing quickly and already work with category-leaders in real estate (CBRE), energy (LevelTen En...[show_more][last_updated.last_updated_variable_days]
Program Manager

Program Manager

VirtualVocationsRenton, Washington, United States
[job_card.full_time]
A company is looking for a Program Manager to provide support through program management in various functional areas.Key Responsibilities Ensure well-documented policies, workflows, program contr...[show_more][last_updated.last_updated_30]
  • [promoted]
Program Manager

Program Manager

SamprasoftSeattle, WA, US
[job_card.full_time]
Who we are an innovative performance apparel company for yoga, running, training, and other athletic pursuits.[show_more][last_updated.last_updated_30]
GenAI Evaluation Engineer

GenAI Evaluation Engineer

Diverse LynxBellevue, WA, United States
[job_card.variable_hours_ago]
[job_preview.job_type]
  • [job_card.full_time]
[job_card.job_description]

Job Description :

  • Strong understanding of LLMs and generative AI concepts, including model behavior and output evaluation
  • Experience with AI evaluation and benchmarking methodologies, including baseline creation and model comparison
  • Hands-on expertise in Eval testing, creating structured test suites to measure accuracy, relevance, safety, and performance
  • Ability to define and apply evaluation metrics (precisionrecall, BLEUROUGE, F1, hallucination rate, latency, cost per output)Prompt engineering and prompt testing experience across zero-shot, few-shot, and system prompt scenarios
  • Python other programming languages, for automation, data analysis, batch evaluation execution, and API integration
  • Experience with evaluation tools / frameworks (OpenAI Evals, HuggingFace evals, Promptfoo, Ragas, DeepEval, LM Eval Harness)
  • Ability to create datasets, test cases, benchmarks, and ground truth references for consistent scoring
  • Test design and test automation experience, including reproducible evaluation pipelines
  • Knowledge of AI safety, bias, security testing, and hallucination analysis

Nice-to-Have :

  • RAG evaluation experience
  • Azure OpenAI
  • OpenAI
  • Anthropic
  • Google AI platforms
  • Performance benchmarking (speed, throughput, cost)
  • Domain knowledge Office apps enterprise systems networking
  • Diverse Lynx LLC is an Equal Employment Opportunity employer. All qualified applicants will receive due consideration for employment without any discrimination. All applicants will be evaluated solely on the basis of their ability, competence and their proven capability to perform the functions outlined in the corresponding role. We promote and support a diverse workforce across all levels in the company.