Talent.com
AI Evaluation Research Assistant
AI Evaluation Research AssistantScale AI • South Boston, Massachusetts, United States
AI Evaluation Research Assistant

AI Evaluation Research Assistant

Scale AI • South Boston, Massachusetts, United States
[job_card.variable_hours_ago]
[job_preview.job_type]
  • [job_card.full_time]
[job_card.job_description]

Join a global community of talented professionals to shape the future of AI. Earn up to $15 USD / hr and additional rewards based on quality of submission.

Outlier is committed to improving the intelligence & safety of AI models. Owned and operated by Scale AI , we’ve recently been featured in Forbes for partnering experts with top AI labs to provide the high quality data for LLMs. We believe AI can only perform as well as the data it’s trained on. That’s why we work with contributors from all over the world , who help improve AI models by providing expert human feedback . This data has led to AI advancements for the world's leading AI labs and large language model builders.

We’ve built a best-in-class remote work platform for our freelance contributors to provide valuable, specialized skills, and we in turn strive to provide them with a positive experience based on our core pillars of reliability, transparency, and flexibility.

What you will be doing

We are looking for someone who speaks fluent English to contribute their expertise toward training and refining cutting-edge AI systems.

  • Adopt a “user mindset” to produce natural data to meet the realistic needs you have or would use AI for.
  • Use the tool of rubrics to address user needs in a structured way.
  • Evaluate AI outputs by reviewing and ranking reasoning and problem-solving responses from large language models.
  • Contribute across projects depending on your specific skillset and experience.

What we’re looking for

  • Education : Bachelor’s degree or higher (or currently enrolled).
  • Analytical and Problem-Solving Skills : Ability to develop complex, professional-level prompts and evaluate nuanced AI reasoning.
  • Strong Writing : Clear, concise, and engaging writing to explain decisions or critique responses.
  • Attention to Detail : Commitment to accuracy and ability to assess technical aspects of model outputs.
  • Nice to Haves :

  • Experience in fields like literature, creative writing, history, philosophy, theology, etc.
  • Prior writing or editorial experience (content strategist, technical writer, editor, etc.).
  • Interest or background in AI, machine learning, or creative tech tools.
  • Compensation and benefits

    Earn up to $15 USD / hr, paid out weekly

    Rates vary based on quality, accuracy, and time spent. Paid via PayPal & AirTM

    Free access to

    Model Playground

    Interact, experiment and engage with leading large language models free of cost

    Flexible schedule and

    time commitment

    No contracts, no 9-to-5. You control your schedule. (Most experts spend 5-10 hours / week, up to 40 hours working from home

    Join a global community of

    Coding experts

    Join a global network of experts contributing to advanced AI tools

    Disclaimer : For non-core work, such as during initial project onboarding or project overtime phases, lower rates may apply. Certain projects offer incentive payments. Please review the payment terms for each project.

    Equal Opportunity Employer : Outlier is committed to fostering a diverse and inclusive work environment. We welcome applicants from all backgrounds and celebrate diversity in our workforce.

    [job_alerts.create_a_job]

    Research Assistant • South Boston, Massachusetts, United States

    [internal_linking.similar_jobs]
    Statistical Research Assistant (Remote)

    Statistical Research Assistant (Remote)

    Scale AI • South Boston, Massachusetts, United States
    [filters.remote]
    [job_card.full_time]
    Join a global community of talented professionals to shape the future of AI.Earn up to $15 USD / hr and additional rewards based on quality of submission. Outlier is committed to improving the intelli...[show_more]
    [last_updated.last_updated_variable_hours] • [promoted] • [new]
    AI Cyber Testing & Evaluation Research Lead

    AI Cyber Testing & Evaluation Research Lead

    RAND Corporation • Boston, MA, United States
    [job_card.full_time]
    A leading research organization is seeking a Research Lead - AI Cyber Testing & Evaluation to manage a comprehensive research portfolio on AI cyber capabilities. This role involves significant proje...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Analyst, Applied Research and Evaluation

    Analyst, Applied Research and Evaluation

    National Institute For Children's Health Quality, Inc. • Boston, MA, United States
    [job_card.full_time]
    If you are unable to complete this application due to a disability, contact this employer to ask for an accommodation or an alternative application process. Analyst, Applied Research and Evaluation....[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Remote Financial Analyst - AI Trainer ($150 per hour)

    Remote Financial Analyst - AI Trainer ($150 per hour)

    Mercor • Brockton, Massachusetts, US
    [filters.remote]
    [job_card.full_time]
    UK / Canada / Europe / Singapore / Dubai / Australia-based • •Investment Banking or Private Equity Experts • • for a research project with a leading foundational model AI lab. You are a good fit if you : - Have •...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Biosecurity Research Resident - AI x Bio Evaluations

    Biosecurity Research Resident - AI x Bio Evaluations

    RAND • Boston, MA, United States
    [job_card.full_time] +1
    RAND is seeking a mission-driven.AI x Bio portfolio lead at RAND’s Center on AI, Security, and Technology (CAST) to develop new approaches to evaluate and understand risk at the AI x Bio intersecti...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Research Lead - AI Cyber Testing & Evaluation

    Research Lead - AI Cyber Testing & Evaluation

    RAND • Boston, MA, United States
    [job_card.temporary]
    RAND's Meselson Center, part of the Global and Emerging Risks (GER) division, is seeking an accomplished technical leader to drive our ambitious AI cyber evaluation agenda.As Research Lead - AI Cyb...[show_more]
    [last_updated.last_updated_30] • [promoted]
    AI Trainer -Remote Writing Evaluator

    AI Trainer -Remote Writing Evaluator

    Outlier • Remote, MA, United States
    [filters.remote]
    [job_card.full_time]
    Earn up to $15 / hour + performance bonuses.Outlier, a platform owned and operated by Scale AI, is looking for.If you're passionate about improving models and excited by the future of AI, this is you...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Senior AI Integration Engineer

    Senior AI Integration Engineer

    C-4 Analytics • Wakefield, Massachusetts, United States
    [job_card.full_time]
    Senior AI Integration Engineer : Wakefield, MA – C-4 Analytics.PLEASE NOTE : For your candidature to be considered, we kindly request that all sections of the application be completed in full.Please...[show_more]
    [last_updated.last_updated_1_day] • [promoted]
    Research Lead - AI Cyber Testing & Evaluation

    Research Lead - AI Cyber Testing & Evaluation

    RAND Corporation • Boston, Massachusetts, United States
    [job_card.temporary]
    RAND's Meselson Center, part of the Global and Emerging Risks (GER) division, is seeking an accomplished technical leader to drive our ambitious AI cyber evaluation agenda.As Research Lead - AI Cyb...[show_more]
    [last_updated.last_updated_30] • [promoted]
    AI Research Operations Assistant

    AI Research Operations Assistant

    Scale AI • Boston, Massachusetts, United States
    [job_card.full_time]
    Join a global community of talented professionals to shape the future of AI.Earn up to $15 USD / hr and additional rewards based on quality of submission. Outlier is committed to improving the intelli...[show_more]
    [last_updated.last_updated_variable_hours] • [promoted] • [new]
    Analyst, Applied Research and Evaluation

    Analyst, Applied Research and Evaluation

    National Institute for Children s Health Quality • Boston, MA, US
    [job_card.full_time]
    The National Institute for Children’s Health Quality (NICHQ) is an independent nonprofit organization working for more than a decade to improve children’s health.We help organizations a...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Senior Analyst - Applied Research & Evaluation (Remote)

    Senior Analyst - Applied Research & Evaluation (Remote)

    National Institute For Children's Health Quality, Inc. • Boston, MA, United States
    [filters.remote]
    [job_card.full_time]
    A nonprofit organization focused on children's health is seeking a Senior Analyst, Applied Research and Evaluation.This role involves leading applied research and evaluation projects with a focus o...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Biosecurity Research Resident - AI x Bio Evaluations

    Biosecurity Research Resident - AI x Bio Evaluations

    RAND Corporation • Boston, MA, United States
    [job_card.full_time] +1
    RAND is seeking a mission-driven.AI x Bio portfolio lead at RAND's Center on AI, Security, and Technology (CAST) to develop new approaches to evaluate and understand risk at the AI x Bio intersecti...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Remote AI Writing Evaluator

    Remote AI Writing Evaluator

    Outlier • Brockton, MA, United States
    [filters.remote]
    [job_card.full_time]
    Earn up to $15 / hour + performance bonuses.Outlier, a platform owned and operated by Scale AI, is looking for.If you're passionate about improving models and excited by the future of AI, this is you...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Travel MRI Tech - $3240 / Week

    Travel MRI Tech - $3240 / Week

    Lancesoft • Brockton, MA, US
    [job_card.full_time]
    Lancesoft is seeking an experienced MRI Tech for an exciting Travel Allied job in Brockton, MA.Shift : 4x10 hr days Start Date : 01 / 11 / 2026 Duration : 13 weeks Pay : $3240 / Week.Saturday : 7 : 00 AM → 11...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    AI Engineer & Researcher, Inference - Boston, USA

    AI Engineer & Researcher, Inference - Boston, USA

    Speechify • Boston, Massachusetts, United States
    [job_card.full_time]
    PLEASE APPLY THROUGH THIS LINK : .The mission of Speechify is to make sure that reading is never a barrier to learning.Over 50 million people use Speechify’s text-to-speech products to turn whatever ...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Remote Financial Expert - AI Trainer ($150 per hour)

    Remote Financial Expert - AI Trainer ($150 per hour)

    Mercor • Taunton, Massachusetts, US
    [filters.remote]
    [job_card.full_time]
    UK / Canada / Europe / Singapore / Dubai / Australia-based • •Investment Banking or Private Equity Experts • • for a research project with a leading foundational model AI lab. You are a good fit if you : - Have •...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Medical Assistant

    Medical Assistant

    RHC Group Management LLC • Taunton, MA, US
    [job_card.full_time]
    Revere Medical gives new life to clinics in need of tools, resources, and support so they can start delivering the personalized care their communities deserve. We’re committed to supporting ou...[show_more]
    [last_updated.last_updated_30] • [promoted]