Talent.com
Freelance Agent Evaluation Engineer
Freelance Agent Evaluation EngineerMindrift • Dallas, TX, US
[error_messages.no_longer_accepting]
Freelance Agent Evaluation Engineer

Freelance Agent Evaluation Engineer

Mindrift • Dallas, TX, US
[job_card.30_days_ago]
[job_preview.job_type]
  • [job_card.part_time]
  • [job_card.permanent]
  • [filters.remote]
  • [filters_job_card.quick_apply]
[job_card.job_description]

Please submit your CV in English and indicate your level of English proficiency.

Mindrift connects specialists with project-based AI opportunities for leading tech companies, focused on testing, evaluating, and improving AI systems. Participation isproject-based, not permanent employment.

What this opportunity involves

You’ll create challenging coding test cases that push AI coding systems to their limits:

  • Review and refine realistic coding tasks based on provided production codebases with realistic scope, requirements and information sources
  • Write comprehensive functional tests that validate actual end-to-end behavior and edge-cases, not just superficial checks
  • Craft “fair but hard” challenges where the AI has all the context it needs, but has to work for it (information scattered across files and external sources, complex reasoning required)
  • Analyze AI failures to understand what the model struggles with vs. what it masters
  • Iterate based on feedback from expert QA reviewers who score your work on 7 quality criteria

What we look for

This opportunity is a good fit for experienced developers, software engineers, and/or test automation specialists open to part-time, non-permanent projects. Ideally, contributors will have:

  • Degree in Computer Science, Software Engineering or related fields
  • 5+ years in software development, primarily Python (pytest, async/await, subprocess, file operations)
  • Background in Full-Stack development, with an equal focus on building React-based interfaces and robust Back-end systems
  • Experience writing tests (functional, integration – not just running them)
  • Docker containers (running evaluations locally in containers)
  • CI/CD understanding (GitHub Actions as a user: triggers, labels, reading results)
  • English proficiency - B2

How it works

Apply → Pass qualification(s) → Join a project → Complete tasks → Get paid

Effort estimate

Tasks for this project are estimated to take 20 hours to complete, depending on complexity. This is an estimate and not a schedule requirement; you choose when and how to work. Tasks must be submitted by the deadline and meet the listed acceptance criteria to be accepted.

Payment

  • Paid contributions, with rates up to $80/hour*
  • Fixed project rate or individual rates, depending on the project
  • Some projects include incentive payments

*Note: Rates vary based on expertise, skills assessment, location, project needs, and other factors. Higher rates may be offered to highly specialized experts. Lower rates may apply during onboarding or non-core project phases. Payment details are shared per project.

[job_alerts.create_a_job]

Freelance Agent Evaluation Engineer • Dallas, TX, US

[internal_linking.similar_jobs]
Associate Engineer

Associate Engineer

Texas Capital Bank • Richardson, TX, United States
[job_card.full_time]
Texas Capital is built to help businesses and their leaders.Our depth of knowledge and expertise allows us to bring the best of the big firms at a scale that works for our clients, with highly expe...[show_more]
[last_updated.last_updated_variable_hours] • [promoted] • [new]
Traveling Project Engineer - Self Perform Operations

Traveling Project Engineer - Self Perform Operations

Turner Construction • Dallas, TX, United States
[job_card.permanent]
Manage and supervise at a project level all engineering and administrative policies, procedures and functions.Coordinate with project field operations to ensure transfer of information is delivered...[show_more]
[last_updated.last_updated_variable_hours] • [promoted] • [new]
ELK Stack Engineer - Lead level

ELK Stack Engineer - Lead level

USAA • Plano, TX, United States
[job_card.full_time]
At USAA, our mission is to empower our members to achieve financial security through highly competitive products, exceptional service and trusted advice.We seek to be the #1 choice for the military...[show_more]
[last_updated.last_updated_variable_hours] • [promoted] • [new]
AI Engineer - GenAI / Agentic Systems

AI Engineer - GenAI / Agentic Systems

TheStaffed • Dallas, TX, United States
[job_card.full_time]
AI Engineer - GenAI / Agentic Systems.AI applications leveraging modern LLM architectures.This role focuses on developing agentic AI systems, RAG pipelines, and scalable APIs that integrate large l...[show_more]
[last_updated.last_updated_variable_hours] • [promoted] • [new]
Implementation Engineer

Implementation Engineer

Purple Drive • Plano, TX, United States
[job_card.full_time]
Collaborate with clients to understand technical requirements and business objectives, ensuring compatibility with existing IT infrastructure.Execute hands-on implementation of completed Arista des...[show_more]
[last_updated.last_updated_variable_hours] • [promoted] • [new]
Fulfillment Agent / Tier #2

Fulfillment Agent / Tier #2

RealManage • Plano, TX, United States
[job_card.full_time]
Imagine working for a dynamic, technology-driven HOA management company that is rapidly expanding, offering abundant opportunities for career advancement, and fostering a company culture that genui...[show_more]
[last_updated.last_updated_variable_hours] • [promoted] • [new]
Decision Intelligence Engineer

Decision Intelligence Engineer

Diverse Lynx • Dallas, TX, United States
[job_card.full_time]
Job Role- Decision Intelligence Engineer.Job Location - Louisville, KY /Dallas, TX (Onsite).Salary Range: $110000 to $130000/Annum.Must Have Technical/Functional Skills.Hands-on experience with imp...[show_more]
[last_updated.last_updated_variable_hours] • [promoted] • [new]
MLOps Engineer - AWS & Databricks - SGWS

MLOps Engineer - AWS & Databricks - SGWS

ShiftCode Analytics • Dallas, TX, United States
[job_card.full_time]
Interview: Virtual (Need Strong candidate with LinkedIn).Hybrid: Dallax, TX and Miramar, FL (Local or nearby).We're seeking a highly skilled MLOps Engineer with deep expertise in AWS and Databricks...[show_more]
[last_updated.last_updated_variable_hours] • [promoted] • [new]
Senior Implementation Engineer

Senior Implementation Engineer

BlackBerry • Dallas, TX, United States
[job_card.full_time]
This is a fully remote position requires strong internet connection and access to major airports.BlackBerry AtHoc is the global leader in networked crisis communication.As a Senior Implementation E...[show_more]
[last_updated.last_updated_variable_hours] • [promoted] • [new]
Performance Engineer

Performance Engineer

Pyramid Consulting • Mesquite, TX, United States
[job_card.temporary]
Please review the job description below and contact me ASAP if you are interested.Employee benefits include, but are not limited to, health insurance (medical, dental, vision), 401(k) plan, and pai...[show_more]
[last_updated.last_updated_variable_hours] • [promoted] • [new]
Engagement Agent

Engagement Agent

AceStack LLC • Plano, TX, United States
[job_card.full_time]
Interacts with clients, ensuring successful delivery of services and high customer satisfaction.This role acts as the primary contact for Mid-Market Platinum Customers.Manage Client Relationships a...[show_more]
[last_updated.last_updated_1_hour] • [promoted] • [new]
AI QA ENGINEER (AGENTIC & GENERATIVE)

AI QA ENGINEER (AGENTIC & GENERATIVE)

Tekfortune Inc • Dallas, TX, United States
[job_card.permanent]
Tekfortune is a fast-growing consulting firm specialized in permanent, contract & project-based staffing services for world's leading organizations in a broad range of industries.In this quickly ch...[show_more]
[last_updated.last_updated_variable_hours] • [promoted] • [new]
Software Engineer - EV

Software Engineer - EV

Toyota Motor Sales, U.S.A., Inc. • Plano, TX, United States
[job_card.full_time]
These are just a few words that describe what life is like at Toyota.As one of the world's most admired brands, Toyota is growing and leading the future of mobility through innovative, high-quality...[show_more]
[last_updated.last_updated_variable_hours] • [promoted] • [new]
Agentic AI Developer - Plano, TX - Onsite

Agentic AI Developer - Plano, TX - Onsite

ConnectedX Inc • Plano, TX, United States
[job_card.temporary]
We are seeking a forward-thinking Agentic AI Developer to design, build, and optimize autonomous AI agents capable of reasoning, learning, and taking actions across complex systems.This role is ide...[show_more]
[last_updated.last_updated_variable_hours] • [promoted] • [new]
Forward Deployed Engineer

Forward Deployed Engineer

CBRE • Dallas, TX, United States
[job_card.full_time]
As a CBRE Forward Deployed Engineer, you will work under broad direction and supervise, develop, maintain, and enhance client systems.This job is part of the Software Engineering job function.They ...[show_more]
[last_updated.last_updated_variable_hours] • [promoted] • [new]
Applied AI Engineer

Applied AI Engineer

Catalyst Labs, LLC • Dallas, TX, United States
[job_card.full_time]
About the job Applied AI Engineer.Catalyst Labs is a leading talent agency with a specialized vertical in Applied AI, Machine Learning, and Data Science.We stand out as an agency that's deeply embe...[show_more]
[last_updated.last_updated_variable_hours] • [promoted] • [new]
Freelance Luxury Brand Evaluator - Fort Worth, TX

Freelance Luxury Brand Evaluator - Fort Worth, TX

CXG • Dallas, TX, US
[job_card.full_time]
[filters_job_card.quick_apply]
Turn your passion for luxury into a career opportunity.Explore the world of premium brands and make a lasting impact in fashion, beauty, jewelry, or automobiles.Join CXG, the global leader in custo...[show_more]
[last_updated.last_updated_30]
Probabilistic Risk Assessment Engineer 1

Probabilistic Risk Assessment Engineer 1

Westinghouse Electric Company, LLC • Dallas, TX, United States
[job_card.full_time]
Are you interested in being part of an innovative team that supports Westinghouse's mission to provide clean energy solutions? At Westinghouse, we recognize that our employees are our most valuable...[show_more]
[last_updated.last_updated_variable_hours] • [promoted] • [new]