Talent.com
AI Evaluation Analyst
AI Evaluation AnalystVirtualVocations • Renton, Washington, United States
AI Evaluation Analyst

AI Evaluation Analyst

VirtualVocations • Renton, Washington, United States
[job_card.variable_days_ago]
[job_preview.job_type]
  • [job_card.full_time]
[job_card.job_description]

A company is looking for an AI Agent Evaluation Analyst (Freelance).

Key Responsibilities

Review evaluation tasks and scenarios for logic, completeness, and realism

Identify inconsistencies, missing assumptions, or unclear decision points

Help define clear expected behaviors (gold standards) for AI agents

Required Qualifications

Excellent analytical thinking with the ability to reason about complex systems

Familiarity with structured data formats, such as JSON / YAML

Experience with policy evaluation, logic puzzles, or structured scenario design

Background in consulting, academia, or research

Exposure to LLMs, prompt engineering, or AI-generated content

[job_alerts.create_a_job]

Analyst • Renton, Washington, United States

[internal_linking.similar_jobs]
GenAI Evaluation Engineer

GenAI Evaluation Engineer

Diverse Lynx • Bellevue, WA, United States
[job_card.full_time]
Strong understanding of LLMs and generative AI concepts, including model behavior and output evaluation.Experience with AI evaluation and benchmarking methodologies, including baseline creation and...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Competitive Intelligence Analyst

Competitive Intelligence Analyst

Cooley LLP • Seattle, WA, United States
[job_card.full_time]
Competitive Intelligence Analyst.Cooley is seeking a Competitive Intelligence Analyst to join the Data Delivery team.Cooley Innovation embraces a culture of customer service excellence and all memb...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Applied Scientist IV : 25-04131 (No C2C)

Applied Scientist IV : 25-04131 (No C2C)

Akraya Inc • Bellevue, Washington, United States
[job_card.full_time]
[filters_job_card.quick_apply]
Primary Skills : Deep Learning (Expert); Machine Learning (Expert), LLM(Expert), AI (Advance), Data Manipulation(Proficient), Model Development-(Proficient). Duration : 12 Months with possible extensi...[show_more]
[last_updated.last_updated_30]
Technical Business Analyst

Technical Business Analyst

Relativity • Seattle, WA, United States
[job_card.full_time]
Relativity's Problem Management is seeking a Technical Business Analyst who excels at using data analytics to uncover trends in quality, client workflows, product performance and efficiency.Your in...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Contact Center AI Automation Analyst

Contact Center AI Automation Analyst

GEICO • Seattle, WA, United States
[job_card.full_time]
Job Description : Contact Center AI & Machine Learning Automation.Contact Center AI & Machine Learning Business Analyst.Richardson, TX, Palo Alto, CA, Seattle, WA, Tampa, FL,.This role is not eligib...[show_more]
[last_updated.last_updated_30] • [promoted]
Travel Physical Therapist

Travel Physical Therapist

Mader MedX • Graham, WA, US
[job_card.full_time]
Mader MedX is seeking a travel Physical Therapist for a travel job in Graham, Washington.Job Description & Requirements.At Mader MedX, our people always come first. With over 20 years of global ...[show_more]
[last_updated.last_updated_30] • [promoted]
AI Evaluation Engineer - Health

AI Evaluation Engineer - Health

Apple • Seattle, WA, United States
[job_card.full_time]
The Health Sensing team builds outstanding technologies to support our users in living their healthiest, happiest lives by providing them with objective, accurate, and timely information about thei...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Lead, Frontier AI Evaluation & Oversight

Lead, Frontier AI Evaluation & Oversight

Scale AI • Seattle, WA, United States
[job_card.full_time]
A leading technology company in Seattle seeks a Head of Evaluation and Oversight Research to lead a team in advancing AI evaluation science. The ideal candidate has a strong research background in m...[show_more]
[last_updated.last_updated_30] • [promoted]
Head of Evaluation and Oversight Research

Head of Evaluation and Oversight Research

Scale AI, Inc. • Seattle, WA, United States
[job_card.full_time]
Scale is the leading data and evaluation partner for frontier AI companies, playing an integral role in advancing the science of evaluating and characterizing large language models (LLMs).Our resea...[show_more]
[last_updated.last_updated_30] • [promoted]
Director, Data and AI Alliances

Director, Data and AI Alliances

Anaplan • Seattle, WA, United States
[job_card.full_time]
At Anaplan, we are a team of innovators focused on optimizing business decision-making through our leading AI-infused scenario planning and analysis platform so our customers can outpace their comp...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Systems Programmer - AI Data Pipelines

Systems Programmer - AI Data Pipelines

Alignerr • Seattle, WA, United States
[job_card.full_time]
AI labs to build, evaluate, and improve next-generation models.We work on real production systems and high-impact research workflows across data, tooling, and infrastructure.Senior Rust Full-Stack ...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
AI Developer

AI Developer

Davis Wright Tremaine • Seattle, WA, United States
[job_card.full_time]
AI Developer | De Novo Innovation Incubator.Davis Wright Tremaine LLP - Building legal solutions at the intersections of clients, law, and operations. Davis Wright Tremaine LLP is looking for an.DeN...[show_more]
[last_updated.last_updated_30] • [promoted]
Sr. Machine Learning Engineer, Applied Research Science

Sr. Machine Learning Engineer, Applied Research Science

Pinterest • Seattle, WA, United States
[job_card.full_time]
Millions of people around the world come to our platform to find creative ideas, dream about new possibilities and plan for memories that will last a lifetime. At Pinterest, we're on a mission to br...[show_more]
[last_updated.last_updated_30] • [promoted]
Master Social Worker - MSW

Master Social Worker - MSW

Fresenius Medical Care • South Hill, WA, US
[job_card.full_time]
This position will be full time at 32 hours.About this role : As a Social Worker with Fresenius Medical Care, you will provide psychosocial services for our dialysis clinic patients.You will w ork w...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Remote Investment Analyst - AI Model Trainer

Remote Investment Analyst - AI Model Trainer

Data Annotation • Renton, WA, United States
[job_card.full_time] +1
We are looking for a finance professional to join our team to train AI models.You will measure the progress of these AI chatbots, evaluate their logic, and solve problems to improve the quality of ...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
AI Engineer - Bellevue

AI Engineer - Bellevue

Aircall • Seattle, WA, United States
[job_card.full_time]
Aircall is a unicorn AI-powered customer communications platform used by 22,000+ companies worldwide to drive revenue, faster resolutions, and scale. We're redefining what a customer communications ...[show_more]
[last_updated.last_updated_variable_hours] • [promoted] • [new]
AI Governance Lead - Remote

AI Governance Lead - Remote

Symetra • Bellevue, WA, United States
[filters.remote]
[job_card.full_time]
Symetra has an exciting opportunity to join our team as an.The AI Governance Lead will be responsible for evolving and managing Symetra's framework governing the ethical, transparent, secure, and c...[show_more]
[last_updated.last_updated_30] • [promoted]
Healthcare Data Analyst, Data Ecosystem Team

Healthcare Data Analyst, Data Ecosystem Team

Truveta • Seattle, Washington, United States
[filters.remote]
[job_card.full_time]
Healthcare Data Analyst, Data Ecosystem Team .Truveta is the world’s first health provider led data platform with a vision of Saving Lives with Data. Our mission is to enable researchers to find cur...[show_more]
[last_updated.last_updated_30] • [promoted]