AI Evaluation Research AssistantScale AI • South Boston, Massachusetts, United States

AI Evaluation Research Assistant

Scale AI • South Boston, Massachusetts, United States

[job_card.variable_hours_ago]

[job_preview.job_type]

[job_card.full_time]

[job_card.job_description]

Join a global community of talented professionals to shape the future of AI. Earn up to $15 USD / hr and additional rewards based on quality of submission.

Outlier is committed to improving the intelligence & safety of AI models. Owned and operated by Scale AI , we’ve recently been featured in Forbes for partnering experts with top AI labs to provide the high quality data for LLMs. We believe AI can only perform as well as the data it’s trained on. That’s why we work with contributors from all over the world , who help improve AI models by providing expert human feedback . This data has led to AI advancements for the world's leading AI labs and large language model builders.

We’ve built a best-in-class remote work platform for our freelance contributors to provide valuable, specialized skills, and we in turn strive to provide them with a positive experience based on our core pillars of reliability, transparency, and flexibility.

What you will be doing

We are looking for someone who speaks fluent English to contribute their expertise toward training and refining cutting-edge AI systems.

Adopt a “user mindset” to produce natural data to meet the realistic needs you have or would use AI for.
Use the tool of rubrics to address user needs in a structured way.
Evaluate AI outputs by reviewing and ranking reasoning and problem-solving responses from large language models.
Contribute across projects depending on your specific skillset and experience.

What we’re looking for

Education : Bachelor’s degree or higher (or currently enrolled).

Analytical and Problem-Solving Skills : Ability to develop complex, professional-level prompts and evaluate nuanced AI reasoning.

Strong Writing : Clear, concise, and engaging writing to explain decisions or critique responses.

Attention to Detail : Commitment to accuracy and ability to assess technical aspects of model outputs.

Nice to Haves :

Experience in fields like literature, creative writing, history, philosophy, theology, etc.

Prior writing or editorial experience (content strategist, technical writer, editor, etc.).

Interest or background in AI, machine learning, or creative tech tools.

Compensation and benefits

Earn up to $15 USD / hr, paid out weekly

Rates vary based on quality, accuracy, and time spent. Paid via PayPal & AirTM

Free access to

Model Playground

Interact, experiment and engage with leading large language models free of cost

Flexible schedule and

time commitment

No contracts, no 9-to-5. You control your schedule. (Most experts spend 5-10 hours / week, up to 40 hours working from home

Join a global community of

Coding experts

Join a global network of experts contributing to advanced AI tools

Disclaimer : For non-core work, such as during initial project onboarding or project overtime phases, lower rates may apply. Certain projects offer incentive payments. Please review the payment terms for each project.

Equal Opportunity Employer : Outlier is committed to fostering a diverse and inclusive work environment. We welcome applicants from all backgrounds and celebrate diversity in our workforce.

[job_alerts.create_a_job]

Research Assistant • South Boston, Massachusetts, United States

[internal_linking.similar_jobs]

Statistical Research Assistant (Remote)

Scale AI • South Boston, Massachusetts, United States

[filters.remote]

[job_card.full_time]

Join a global community of talented professionals to shape the future of AI.Earn up to $15 USD / hr and additional rewards based on quality of submission. Outlier is committed to improving the intelli...[show_more]

[last_updated.last_updated_variable_hours] • [promoted] • [new]

AI Cyber Testing & Evaluation Research Lead

RAND Corporation • Boston, MA, United States

[job_card.full_time]

A leading research organization is seeking a Research Lead - AI Cyber Testing & Evaluation to manage a comprehensive research portfolio on AI cyber capabilities. This role involves significant proje...[show_more]

[last_updated.last_updated_variable_days] • [promoted]

Analyst, Applied Research and Evaluation

National Institute For Children's Health Quality, Inc. • Boston, MA, United States

[job_card.full_time]

If you are unable to complete this application due to a disability, contact this employer to ask for an accommodation or an alternative application process. Analyst, Applied Research and Evaluation....[show_more]

[last_updated.last_updated_variable_days] • [promoted]

Remote Financial Analyst - AI Trainer ($150 per hour)

Mercor • Brockton, Massachusetts, US

[filters.remote]

[job_card.full_time]

UK / Canada / Europe / Singapore / Dubai / Australia-based • •Investment Banking or Private Equity Experts • • for a research project with a leading foundational model AI lab. You are a good fit if you : - Have •...[show_more]

[last_updated.last_updated_variable_days] • [promoted]

Biosecurity Research Resident - AI x Bio Evaluations

RAND • Boston, MA, United States

[job_card.full_time] +1

RAND is seeking a mission-driven.AI x Bio portfolio lead at RAND’s Center on AI, Security, and Technology (CAST) to develop new approaches to evaluate and understand risk at the AI x Bio intersecti...[show_more]

[last_updated.last_updated_variable_days] • [promoted]

Research Lead - AI Cyber Testing & Evaluation

RAND • Boston, MA, United States

[job_card.temporary]

RAND's Meselson Center, part of the Global and Emerging Risks (GER) division, is seeking an accomplished technical leader to drive our ambitious AI cyber evaluation agenda.As Research Lead - AI Cyb...[show_more]

[last_updated.last_updated_30] • [promoted]

AI Trainer -Remote Writing Evaluator

Outlier • Remote, MA, United States

[filters.remote]

[job_card.full_time]

Earn up to $15 / hour + performance bonuses.Outlier, a platform owned and operated by Scale AI, is looking for.If you're passionate about improving models and excited by the future of AI, this is you...[show_more]

[last_updated.last_updated_variable_days] • [promoted]

Senior AI Integration Engineer

C-4 Analytics • Wakefield, Massachusetts, United States

[job_card.full_time]

Senior AI Integration Engineer : Wakefield, MA – C-4 Analytics.PLEASE NOTE : For your candidature to be considered, we kindly request that all sections of the application be completed in full.Please...[show_more]

[last_updated.last_updated_1_day] • [promoted]

Research Lead - AI Cyber Testing & Evaluation

RAND Corporation • Boston, Massachusetts, United States

[job_card.temporary]

[last_updated.last_updated_30] • [promoted]

AI Research Operations Assistant

Scale AI • Boston, Massachusetts, United States

[job_card.full_time]

[last_updated.last_updated_variable_hours] • [promoted] • [new]

Analyst, Applied Research and Evaluation

National Institute for Children s Health Quality • Boston, MA, US

[job_card.full_time]

The National Institute for Children’s Health Quality (NICHQ) is an independent nonprofit organization working for more than a decade to improve children’s health.We help organizations a...[show_more]

[last_updated.last_updated_variable_days] • [promoted]

Senior Analyst - Applied Research & Evaluation (Remote)

National Institute For Children's Health Quality, Inc. • Boston, MA, United States

[filters.remote]

[job_card.full_time]

A nonprofit organization focused on children's health is seeking a Senior Analyst, Applied Research and Evaluation.This role involves leading applied research and evaluation projects with a focus o...[show_more]

[last_updated.last_updated_variable_days] • [promoted]

Biosecurity Research Resident - AI x Bio Evaluations

RAND Corporation • Boston, MA, United States

[job_card.full_time] +1

RAND is seeking a mission-driven.AI x Bio portfolio lead at RAND's Center on AI, Security, and Technology (CAST) to develop new approaches to evaluate and understand risk at the AI x Bio intersecti...[show_more]

[last_updated.last_updated_variable_days] • [promoted]

Remote AI Writing Evaluator

Outlier • Brockton, MA, United States

[filters.remote]

[job_card.full_time]

[last_updated.last_updated_variable_days] • [promoted]

Travel MRI Tech - $3240 / Week

Lancesoft • Brockton, MA, US

[job_card.full_time]

Lancesoft is seeking an experienced MRI Tech for an exciting Travel Allied job in Brockton, MA.Shift : 4x10 hr days Start Date : 01 / 11 / 2026 Duration : 13 weeks Pay : $3240 / Week.Saturday : 7 : 00 AM → 11...[show_more]

[last_updated.last_updated_variable_days] • [promoted]

AI Engineer & Researcher, Inference - Boston, USA

Speechify • Boston, Massachusetts, United States

[job_card.full_time]

PLEASE APPLY THROUGH THIS LINK : .The mission of Speechify is to make sure that reading is never a barrier to learning.Over 50 million people use Speechify’s text-to-speech products to turn whatever ...[show_more]

[last_updated.last_updated_30] • [promoted]

Remote Financial Expert - AI Trainer ($150 per hour)

Mercor • Taunton, Massachusetts, US

[filters.remote]

[job_card.full_time]

[last_updated.last_updated_variable_days] • [promoted]

Medical Assistant

RHC Group Management LLC • Taunton, MA, US

[job_card.full_time]

Revere Medical gives new life to clinics in need of tools, resources, and support so they can start delivering the personalized care their communities deserve. We’re committed to supporting ou...[show_more]

[last_updated.last_updated_30] • [promoted]