Language Model Evaluator (Remote)Scale AI • Baltimore, Maryland, United States

Language Model Evaluator (Remote)

Scale AI • Baltimore, Maryland, United States

[job_card.1_day_ago]

[job_preview.job_type]

[job_card.full_time]

[filters.remote]

[job_card.job_description]

Join a global community of talented professionals to shape the future of AI. Earn up to $15 USD / hr and additional rewards based on quality of submission.

Outlier is committed to improving the intelligence & safety of AI models. Owned and operated by Scale AI , we’ve recently been featured in Forbes for partnering experts with top AI labs to provide the high quality data for LLMs. We believe AI can only perform as well as the data it’s trained on. That’s why we work with contributors from all over the world , who help improve AI models by providing expert human feedback . This data has led to AI advancements for the world's leading AI labs and large language model builders.

We’ve built a best-in-class remote work platform for our freelance contributors to provide valuable, specialized skills, and we in turn strive to provide them with a positive experience based on our core pillars of reliability, transparency, and flexibility.

What you will be doing

We are looking for someone who speaks fluent English to contribute their expertise toward training and refining cutting-edge AI systems.

Adopt a “user mindset” to produce natural data to meet the realistic needs you have or would use AI for.
Use the tool of rubrics to address user needs in a structured way.
Evaluate AI outputs by reviewing and ranking reasoning and problem-solving responses from large language models.
Contribute across projects depending on your specific skillset and experience.

What we’re looking for

Education : Bachelor’s degree or higher (or currently enrolled).

Analytical and Problem-Solving Skills : Ability to develop complex, professional-level prompts and evaluate nuanced AI reasoning.

Strong Writing : Clear, concise, and engaging writing to explain decisions or critique responses.

Attention to Detail : Commitment to accuracy and ability to assess technical aspects of model outputs.

Nice to Haves :

Experience in fields like literature, creative writing, history, philosophy, theology, etc.

Prior writing or editorial experience (content strategist, technical writer, editor, etc.).

Interest or background in AI, machine learning, or creative tech tools.

Compensation and benefits

Earn up to $15 USD / hr, paid out weekly

Rates vary based on quality, accuracy, and time spent. Paid via PayPal & AirTM

Free access to

Model Playground

Interact, experiment and engage with leading large language models free of cost

Flexible schedule and

time commitment

No contracts, no 9-to-5. You control your schedule. (Most experts spend 5-10 hours / week, up to 40 hours working from home

Join a global community of

Coding experts

Join a global network of experts contributing to advanced AI tools

Disclaimer : For non-core work, such as during initial project onboarding or project overtime phases, lower rates may apply. Certain projects offer incentive payments. Please review the payment terms for each project.

Equal Opportunity Employer : Outlier is committed to fostering a diverse and inclusive work environment. We welcome applicants from all backgrounds and celebrate diversity in our workforce.

[job_alerts.create_a_job]

Evaluator • Baltimore, Maryland, United States

[internal_linking.similar_jobs]

Senior AI / ML Engineer (SWE-3)

Leidos • Severn, MD, US

[job_card.full_time]

National Security Sector's (NSS) Cyber & Analytics Business Area (CABA).Our talented team is at the forefront in Security Engineering, Computer Network Operations (CNO), Mission Software, A...[show_more]

[last_updated.last_updated_variable_days] • [promoted]

Business Rules Support Analyst

Inovalon, Inc. • Bowie, MD, United States

[job_card.full_time]

Inovalon was founded in 1998 on the belief that technology, and data specifically, would empower the transformation of the entire healthcare ecosystem for the better, improving both outcomes and ec...[show_more]

[last_updated.last_updated_1_hour] • [promoted] • [new]

Lab Services Procedure & Training Document Developer

American Red Cross • Baltimore, MD, United States

[job_card.full_time]

Please use Google Chrome or Mozilla Firefox when accessing Candidate Home.By joining the American Red Cross you will touch millions of lives every year and experience the greatness of the human spi...[show_more]

[last_updated.last_updated_variable_days] • [promoted]

Remote Product Tester – $45 / hr + Free Products – Start Now!

OCPA • North Codorus, Pennsylvania, us

[filters.remote]

[job_card.part_time] +1

Product Testers are wanted to work from home nationwide in the US to fulfill upcoming contracts with national and international companies. We guarantee 15-25 hours per week with an hourly pay of bet...[show_more]

[last_updated.last_updated_30] • [promoted]

Principal / Sr Medical Editor - Regulatory Documents - Copy Editing + QC - NA / Canada Remote Based

Syneos Health / inVentiv Health Commercial LLC • Baltimore, MD, United States

[filters.remote]

[job_card.full_time]

Principal / Sr Medical Editor - Regulatory Documents - Copy Editing + QC - NA / Canada Remote Based.Syneos Health is a leading fully integrated biopharmaceutical solutions organization built to acceler...[show_more]

[last_updated.last_updated_variable_days] • [promoted]

AI Model Evaluator (Remote)

Scale AI • Baltimore, Maryland, United States

[filters.remote]

[job_card.full_time]

Join a global community of talented professionals to shape the future of AI.Earn up to $15 USD / hr and additional rewards based on quality of submission. Outlier is committed to improving the intelli...[show_more]

[last_updated.last_updated_1_day] • [promoted]

Staff Applied Scientist

Relativity • Baltimore, MD, United States

[job_card.full_time]

At Relativity, we're building a world-class Applied Science team to push the boundaries of intelligent systems in the legal domain. We're looking for a Staff Applied Scientist to join our team.Agent...[show_more]

[last_updated.last_updated_30] • [promoted]

Research Data Analyst (Contingent II)

InsideHigherEd • Bowie, Maryland, United States

[job_card.temporary]

JR101492 Research Data Analyst (Contingent II) (Open).BSU Research and Innovation, JM.Non-Regular Fixed Term (Fixed Term). Performs a variety of professional and administrative duties in support of ...[show_more]

[last_updated.last_updated_variable_days] • [promoted]

Remote FinTech Product Analyst - AI Trainer ($50-$60 / hour)

Data Annotation • Baltimore, Maryland

[filters.remote]

[job_card.full_time] +1

We are looking for a finance professional to join our team to train AI models.You will measure the progress of these AI chatbots, evaluate their logic, and solve problems to improve the quality of ...[show_more]

[last_updated.last_updated_30] • [promoted]

Data Exploitation Analyst II

Oceaneering International, Inc. • Hanover, MD, United States

[job_card.full_time]

Oceaneering Technologies (OTECH) develops, manufactures, and operates customized marine systems, shipboard equipment, subsea vehicles, and engineered solutions for commercial and U.Oceaneering Aero...[show_more]

[last_updated.last_updated_variable_days] • [promoted]

QA Analyst (hybrid)

ALTA IT Services • Baltimore, MD, US

[job_card.full_time]

QA Analyst (hybrid - 3 days per week in office) Baltimore, MD GC Holders and US Citizens per client contract Duties and Responsibilities : As a QA Analyst, you will lead process, procedures and audi...[show_more]

[last_updated.last_updated_variable_days] • [promoted]

LPN / LVN

Encompass Health Rehabilitation Hospital of Mechanicsburg • New Freedom, PA, US

[job_card.full_time] +1

Embark on Your Compassionate LPN / LVN Journey at Encompass Health.Are you in search of a fulfilling healthcare career close to your heart and home? Encompass Health welcomes you warmly, offering a s...[show_more]

[last_updated.last_updated_30] • [promoted]

Remote Audio Generalist Evaluator Expert - AI Trainer ($35-$40 per hour)

Mercor • Baltimore, Maryland, US

[filters.remote]

[job_card.full_time]

Mercor is seeking detail-oriented writing experts to contribute to a high-impact audio AI research project with a leading lab. Freelancers will author prompt–golden answer pairs that train and evalu...[show_more]

[last_updated.last_updated_variable_days] • [promoted]

Remote Entry-Level Currency Trader

Maverick Currencies • Baltimore, MD

[filters.remote]

[job_card.full_time] +1

Top-ranked proprietary trading firm, Maverick Currencies, is searching for entrepreneurially-minded, profit-driven people to be trained in the art and science of proprietary trading in its online c...[show_more]

[last_updated.last_updated_30] • [promoted]

Remote English Language Audio Model Trainer - AI Trainer ($20-$20 per hour)

Mercor • Baltimore, Maryland, US

[filters.remote]

[job_card.full_time]

About the Role : • •We are seeking detail-oriented and enthusiastic individuals to join a cutting-edge AI research initiative. In this role, you will be responsible for recording and evaluation short a...[show_more]

[last_updated.last_updated_variable_days] • [promoted]

Remote Text Quality Evaluator

Outlier • Baltimore, MD, United States

[filters.remote]

[job_card.full_time]

Earn up to $16 USD / hourly and work remotely and flexibly.Outlier, a platform owned and operated by Scale AI, is looking for. If you're passionate about improving models and excited by the future of ...[show_more]

[last_updated.last_updated_variable_days] • [promoted]

AI Trainer -Remote Text Quality Evaluator

Outlier • Baltimore, MD, United States

[filters.remote]

[job_card.full_time]

[last_updated.last_updated_variable_days] • [promoted]

AI / ML Compliance Engineer

C2 Labs • Baltimore, Maryland, United States

[job_card.full_time]

IT transformation journey via data-driven IT strategic planning, application rationalization and redevelopment, and innovative research and development of new industry standards and technologies.C2...[show_more]

[last_updated.last_updated_30] • [promoted]