Talent.com
Language Model Evaluator (Remote)
Language Model Evaluator (Remote)Scale AI • Baltimore, Maryland, United States
Language Model Evaluator (Remote)

Language Model Evaluator (Remote)

Scale AI • Baltimore, Maryland, United States
[job_card.1_day_ago]
[job_preview.job_type]
  • [job_card.full_time]
  • [filters.remote]
[job_card.job_description]

Join a global community of talented professionals to shape the future of AI. Earn up to $15 USD / hr and additional rewards based on quality of submission.

Outlier is committed to improving the intelligence & safety of AI models. Owned and operated by Scale AI , we’ve recently been featured in Forbes for partnering experts with top AI labs to provide the high quality data for LLMs. We believe AI can only perform as well as the data it’s trained on. That’s why we work with contributors from all over the world , who help improve AI models by providing expert human feedback . This data has led to AI advancements for the world's leading AI labs and large language model builders.

We’ve built a best-in-class remote work platform for our freelance contributors to provide valuable, specialized skills, and we in turn strive to provide them with a positive experience based on our core pillars of reliability, transparency, and flexibility.

What you will be doing

We are looking for someone who speaks fluent English to contribute their expertise toward training and refining cutting-edge AI systems.

  • Adopt a “user mindset” to produce natural data to meet the realistic needs you have or would use AI for.
  • Use the tool of rubrics to address user needs in a structured way.
  • Evaluate AI outputs by reviewing and ranking reasoning and problem-solving responses from large language models.
  • Contribute across projects depending on your specific skillset and experience.

What we’re looking for

  • Education : Bachelor’s degree or higher (or currently enrolled).
  • Analytical and Problem-Solving Skills : Ability to develop complex, professional-level prompts and evaluate nuanced AI reasoning.
  • Strong Writing : Clear, concise, and engaging writing to explain decisions or critique responses.
  • Attention to Detail : Commitment to accuracy and ability to assess technical aspects of model outputs.
  • Nice to Haves :

  • Experience in fields like literature, creative writing, history, philosophy, theology, etc.
  • Prior writing or editorial experience (content strategist, technical writer, editor, etc.).
  • Interest or background in AI, machine learning, or creative tech tools.
  • Compensation and benefits

    Earn up to $15 USD / hr, paid out weekly

    Rates vary based on quality, accuracy, and time spent. Paid via PayPal & AirTM

    Free access to

    Model Playground

    Interact, experiment and engage with leading large language models free of cost

    Flexible schedule and

    time commitment

    No contracts, no 9-to-5. You control your schedule. (Most experts spend 5-10 hours / week, up to 40 hours working from home

    Join a global community of

    Coding experts

    Join a global network of experts contributing to advanced AI tools

    Disclaimer : For non-core work, such as during initial project onboarding or project overtime phases, lower rates may apply. Certain projects offer incentive payments. Please review the payment terms for each project.

    Equal Opportunity Employer : Outlier is committed to fostering a diverse and inclusive work environment. We welcome applicants from all backgrounds and celebrate diversity in our workforce.

    [job_alerts.create_a_job]

    Evaluator • Baltimore, Maryland, United States

    [internal_linking.similar_jobs]
    Senior AI / ML Engineer (SWE-3)

    Senior AI / ML Engineer (SWE-3)

    Leidos • Severn, MD, US
    [job_card.full_time]
    National Security Sector's (NSS) Cyber & Analytics Business Area (CABA).Our talented team is at the forefront in Security Engineering, Computer Network Operations (CNO), Mission Software, A...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Business Rules Support Analyst

    Business Rules Support Analyst

    Inovalon, Inc. • Bowie, MD, United States
    [job_card.full_time]
    Inovalon was founded in 1998 on the belief that technology, and data specifically, would empower the transformation of the entire healthcare ecosystem for the better, improving both outcomes and ec...[show_more]
    [last_updated.last_updated_1_hour] • [promoted] • [new]
    Lab Services Procedure & Training Document Developer

    Lab Services Procedure & Training Document Developer

    American Red Cross • Baltimore, MD, United States
    [job_card.full_time]
    Please use Google Chrome or Mozilla Firefox when accessing Candidate Home.By joining the American Red Cross you will touch millions of lives every year and experience the greatness of the human spi...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Remote Product Tester – $45 / hr + Free Products – Start Now!

    Remote Product Tester – $45 / hr + Free Products – Start Now!

    OCPA • North Codorus, Pennsylvania, us
    [filters.remote]
    [job_card.part_time] +1
    Product Testers are wanted to work from home nationwide in the US to fulfill upcoming contracts with national and international companies. We guarantee 15-25 hours per week with an hourly pay of bet...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Principal / Sr Medical Editor - Regulatory Documents - Copy Editing + QC - NA / Canada Remote Based

    Principal / Sr Medical Editor - Regulatory Documents - Copy Editing + QC - NA / Canada Remote Based

    Syneos Health / inVentiv Health Commercial LLC • Baltimore, MD, United States
    [filters.remote]
    [job_card.full_time]
    Principal / Sr Medical Editor - Regulatory Documents - Copy Editing + QC - NA / Canada Remote Based.Syneos Health is a leading fully integrated biopharmaceutical solutions organization built to acceler...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    AI Model Evaluator (Remote)

    AI Model Evaluator (Remote)

    Scale AI • Baltimore, Maryland, United States
    [filters.remote]
    [job_card.full_time]
    Join a global community of talented professionals to shape the future of AI.Earn up to $15 USD / hr and additional rewards based on quality of submission. Outlier is committed to improving the intelli...[show_more]
    [last_updated.last_updated_1_day] • [promoted]
    Staff Applied Scientist

    Staff Applied Scientist

    Relativity • Baltimore, MD, United States
    [job_card.full_time]
    At Relativity, we're building a world-class Applied Science team to push the boundaries of intelligent systems in the legal domain. We're looking for a Staff Applied Scientist to join our team.Agent...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Research Data Analyst (Contingent II)

    Research Data Analyst (Contingent II)

    InsideHigherEd • Bowie, Maryland, United States
    [job_card.temporary]
    JR101492 Research Data Analyst (Contingent II) (Open).BSU Research and Innovation, JM.Non-Regular Fixed Term (Fixed Term). Performs a variety of professional and administrative duties in support of ...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Remote FinTech Product Analyst - AI Trainer ($50-$60 / hour)

    Remote FinTech Product Analyst - AI Trainer ($50-$60 / hour)

    Data Annotation • Baltimore, Maryland
    [filters.remote]
    [job_card.full_time] +1
    We are looking for a finance professional to join our team to train AI models.You will measure the progress of these AI chatbots, evaluate their logic, and solve problems to improve the quality of ...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Data Exploitation Analyst II

    Data Exploitation Analyst II

    Oceaneering International, Inc. • Hanover, MD, United States
    [job_card.full_time]
    Oceaneering Technologies (OTECH) develops, manufactures, and operates customized marine systems, shipboard equipment, subsea vehicles, and engineered solutions for commercial and U.Oceaneering Aero...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    QA Analyst (hybrid)

    QA Analyst (hybrid)

    ALTA IT Services • Baltimore, MD, US
    [job_card.full_time]
    QA Analyst (hybrid - 3 days per week in office) Baltimore, MD GC Holders and US Citizens per client contract Duties and Responsibilities : As a QA Analyst, you will lead process, procedures and audi...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    LPN / LVN

    LPN / LVN

    Encompass Health Rehabilitation Hospital of Mechanicsburg • New Freedom, PA, US
    [job_card.full_time] +1
    Embark on Your Compassionate LPN / LVN Journey at Encompass Health.Are you in search of a fulfilling healthcare career close to your heart and home? Encompass Health welcomes you warmly, offering a s...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Remote Audio Generalist Evaluator Expert - AI Trainer ($35-$40 per hour)

    Remote Audio Generalist Evaluator Expert - AI Trainer ($35-$40 per hour)

    Mercor • Baltimore, Maryland, US
    [filters.remote]
    [job_card.full_time]
    Mercor is seeking detail-oriented writing experts to contribute to a high-impact audio AI research project with a leading lab. Freelancers will author prompt–golden answer pairs that train and evalu...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Remote Entry-Level Currency Trader

    Remote Entry-Level Currency Trader

    Maverick Currencies • Baltimore, MD
    [filters.remote]
    [job_card.full_time] +1
    Top-ranked proprietary trading firm, Maverick Currencies, is searching for entrepreneurially-minded, profit-driven people to be trained in the art and science of proprietary trading in its online c...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Remote English Language Audio Model Trainer - AI Trainer ($20-$20 per hour)

    Remote English Language Audio Model Trainer - AI Trainer ($20-$20 per hour)

    Mercor • Baltimore, Maryland, US
    [filters.remote]
    [job_card.full_time]
    About the Role : • •We are seeking detail-oriented and enthusiastic individuals to join a cutting-edge AI research initiative. In this role, you will be responsible for recording and evaluation short a...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Remote Text Quality Evaluator

    Remote Text Quality Evaluator

    Outlier • Baltimore, MD, United States
    [filters.remote]
    [job_card.full_time]
    Earn up to $16 USD / hourly and work remotely and flexibly.Outlier, a platform owned and operated by Scale AI, is looking for. If you're passionate about improving models and excited by the future of ...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    AI Trainer -Remote Text Quality Evaluator

    AI Trainer -Remote Text Quality Evaluator

    Outlier • Baltimore, MD, United States
    [filters.remote]
    [job_card.full_time]
    Earn up to $16 USD / hourly and work remotely and flexibly.Outlier, a platform owned and operated by Scale AI, is looking for. If you're passionate about improving models and excited by the future of ...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    AI / ML Compliance Engineer

    AI / ML Compliance Engineer

    C2 Labs • Baltimore, Maryland, United States
    [job_card.full_time]
    IT transformation journey via data-driven IT strategic planning, application rationalization and redevelopment, and innovative research and development of new industry standards and technologies.C2...[show_more]
    [last_updated.last_updated_30] • [promoted]