Language Model Evaluator (Remote)Scale AI • Baltimore, Maryland, United States

[error_messages.no_longer_accepting]

Language Model Evaluator (Remote)

Scale AI • Baltimore, Maryland, United States

[job_card.variable_days_ago]

[job_preview.job_type]

[job_card.full_time]

[filters.remote]

[job_card.job_description]

Join a global community of talented professionals to shape the future of AI. Earn up to $15 USD / hr and additional rewards based on quality of submission.

Outlier is committed to improving the intelligence & safety of AI models. Owned and operated by Scale AI , we’ve recently been featured in Forbes for partnering experts with top AI labs to provide the high quality data for LLMs. We believe AI can only perform as well as the data it’s trained on. That’s why we work with contributors from all over the world , who help improve AI models by providing expert human feedback . This data has led to AI advancements for the world's leading AI labs and large language model builders.

We’ve built a best-in-class remote work platform for our freelance contributors to provide valuable, specialized skills, and we in turn strive to provide them with a positive experience based on our core pillars of reliability, transparency, and flexibility.

What you will be doing

We are looking for someone who speaks fluent English to contribute their expertise toward training and refining cutting-edge AI systems.

Adopt a “user mindset” to produce natural data to meet the realistic needs you have or would use AI for.
Use the tool of rubrics to address user needs in a structured way.
Evaluate AI outputs by reviewing and ranking reasoning and problem-solving responses from large language models.
Contribute across projects depending on your specific skillset and experience.

What we’re looking for

Education : Bachelor’s degree or higher (or currently enrolled).

Analytical and Problem-Solving Skills : Ability to develop complex, professional-level prompts and evaluate nuanced AI reasoning.

Strong Writing : Clear, concise, and engaging writing to explain decisions or critique responses.

Attention to Detail : Commitment to accuracy and ability to assess technical aspects of model outputs.

Nice to Haves :

Experience in fields like literature, creative writing, history, philosophy, theology, etc.

Prior writing or editorial experience (content strategist, technical writer, editor, etc.).

Interest or background in AI, machine learning, or creative tech tools.

Compensation and benefits

Earn up to $15 USD / hr, paid out weekly

Rates vary based on quality, accuracy, and time spent. Paid via PayPal & AirTM

Free access to

Model Playground

Interact, experiment and engage with leading large language models free of cost

Flexible schedule and

time commitment

No contracts, no 9-to-5. You control your schedule. (Most experts spend 5-10 hours / week, up to 40 hours working from home

Join a global community of

Coding experts

Join a global network of experts contributing to advanced AI tools

Disclaimer : For non-core work, such as during initial project onboarding or project overtime phases, lower rates may apply. Certain projects offer incentive payments. Please review the payment terms for each project.

Equal Opportunity Employer : Outlier is committed to fostering a diverse and inclusive work environment. We welcome applicants from all backgrounds and celebrate diversity in our workforce.

[job_alerts.create_a_job]

Evaluator • Baltimore, Maryland, United States

[internal_linking.similar_jobs]

Kanuri Operational Language Analyst (OLA)

CTC Group • Fort Meade, MD, US

[job_card.full_time]

Kanuri Operational Language Analysts (OLA).Locations : Annapolis Junction / Ft.The Operational Language Analyst performs tasks required to process voice and / or graphic language materials in support ...[show_more]

[last_updated.last_updated_variable_days] • [promoted]

Exploitation Analyst Level 4

ELEVI Associates • Annapolis Junction, MD, US

[job_card.full_time]

Because You Deserve More Than Just a Job.As an Exploitation Analyst, you will : .You'll Bring These Qualifications.Degree in Network Engineering, Systems Engineering, Information Technology, or r...[show_more]

[last_updated.last_updated_30] • [promoted]

Data Science Level 3 (ML Ops Framework / Cyber Hunting), BS+10 yrs or MS+8

Link, LLC • Fort Meade, MD, US

[job_card.full_time]

Play an integral role in the development of an ML ops framework for cyber incident detection.This is a project with high visibility that seeks to revolutionize cyber hunting.The role is multifacete...[show_more]

[last_updated.last_updated_30] • [promoted]

Cybersecurity Ethics and LLM Evaluation Specialist

Tetrad Digital Integrity LLC • MD, US

[job_card.permanent]

[filters_job_card.quick_apply]

Tetrad Digital Integrity (TDI) is a leading-edge cybersecurity firm with a mission to safeguard and protect our customers from increasing threats and vulnerabilities in this digital age.We are...[show_more]

[last_updated.last_updated_variable_days]

Remote Side Hustle Evaluator - Flexible Online Gig Work

Finance Buzz • Glen Rock, Pennsylvania, US

[filters.remote]

[job_card.temporary]

Are you looking to earn extra income from the comfort of your home? We're seeking motivated individuals to explore and test a variety of remote side hustle opportunities featured on FinanceBuzz.Thi...[show_more]

[last_updated.last_updated_30] • [promoted]

Operational Language Analyst Level 1

IntelliGenesis • Annapolis Junction, MD, US

[job_card.full_time]

Perform tasks required to process voice and / or graphic language materials in support of SIGINT operations.Recover essential elements of information. Render translations and / or transcripts based on s...[show_more]

[last_updated.last_updated_variable_days] • [promoted]

Operational Language Analyst (OLA), Level 1-3 (MM)

The Kenjya-Trusant Group, LLC • Annapolis Junction, MD, US

[job_card.full_time]

The Kenjya-Trusant Group is looking for an Operational Language Analyst (OLA), Level 1-3 to support our customer in Annapolis Junction, MD. Currently accepting Levels 1-3.Provide scanning, translati...[show_more]

[last_updated.last_updated_variable_days] • [promoted]

Experis QA Lead (Remote)

Saransh Inc • Baltimore, Maryland, USA

[filters.remote]

[job_card.full_time]

QA roles with direct experience in data platform and analytics project environments showcasing building and leading a dedicated QA team to support the Enterprise Data & AI Platform defining clear r...[show_more]

[last_updated.last_updated_variable_days] • [promoted]

Exploitation Analyst Level 3

ELEVI Associates • Annapolis Junction, MD, US

[job_card.full_time]

[last_updated.last_updated_30] • [promoted]

Applied Researcher I

Capital One • Baltimore, Maryland, United States

[job_card.full_time] +1

Overview : At Capital One, we are creating trustworthy and reliable AI systems, changing banking for good.For years, Capital One has been leading the industry in using machine learning to create rea...[show_more]

[last_updated.last_updated_variable_days] • [promoted]

Freelance Luxury Brand Evaluator Automotive Project - DC Metro

CXG • MARTINS ADD, MD, US

[job_card.full_time]

[filters_job_card.quick_apply]

Are you a luxury automobile enthusiast who appreciates the finer details of high-end vehicles? If the answer is yes, we are looking for you!. As a Luxury Brand Evaluator, you will step into the worl...[show_more]

[last_updated.last_updated_30]

Data Science Level 3 (Advanced AI, NLP ), BS+10 yrs or MS+8 yrs

Link, LLC • Fort Meade, MD, US

[job_card.full_time]

DS position in X32 as a support for a Natural Language Processing (NLP) project to accurately and automatically tokenize language data with spoken or written origins. This position involves developi...[show_more]

[last_updated.last_updated_30] • [promoted]

Senior Data Modeler (Baltimore)

Motion Recruitment • Baltimore, MD, US

[job_card.full_time] +3

Possible Extension or Contract-to-Hire.Client located in Baltimore, MD).Must be able to work EST hours.Seeking a seasoned Data Modeler to architect and operationalize scalable, governed data struct...[show_more]

[last_updated.last_updated_variable_days] • [promoted]

Remote Text Quality Evaluator

Outlier • Baltimore, MD, United States

[filters.remote]

[job_card.full_time]

Earn up to $15 / hour + performance bonuses.Outlier, a platform owned and operated by Scale AI, is looking for.If you're passionate about improving models and excited by the future of AI, this is you...[show_more]

[last_updated.last_updated_variable_days] • [promoted]

AI Trainer -Remote Text Quality Evaluator

Outlier • Baltimore, MD, United States

[filters.remote]

[job_card.full_time]

Earn up to $16 USD / hourly and work remotely and flexibly.Outlier, a platform owned and operated by Scale AI, is looking for. If you're passionate about improving models and excited by the future of ...[show_more]

[last_updated.last_updated_variable_days] • [promoted]

Operational Language Analyst Level 2

IntelliGenesis • Annapolis Junction, MD, US

[job_card.full_time]

[last_updated.last_updated_30] • [promoted]

Operational Language Analyst

Prime Time Consulting • Annapolis Junction, MD, US

[job_card.full_time]

Prime Time Consulting, a GRVTY Company, provides clients with expert intelligence analysis services.Our clients include defense contractors, industrial and service corporations, and department...[show_more]

[last_updated.last_updated_variable_days] • [promoted]

Operational Language Analyst

The Swift Group • Annapolis Junction, MD, US

[job_card.full_time]

OPS Consulting has over two decades of experience specializing in the most mission-critical operations.We are thought leaders and innovators. The ingenuity of our developers, engineers, cyber expert...[show_more]

[last_updated.last_updated_variable_days] • [promoted]