Talent.com

Program evaluation [h1.location_city]

[job_alerts.create_a_job]

Program evaluation • oakland ca

[last_updated.last_updated_1_day]
  • [promoted]
Senior Research Scientist, Model Evaluation

Senior Research Scientist, Model Evaluation

CohereSan Francisco, CA, United States
[job_card.full_time]
Senior Research Scientist, Model Evaluation.Senior Research Scientist, Model Evaluation.Evaluation is critical to making progress in scaling intelligence. As models continue to become superhuman in ...[show_more][last_updated.last_updated_variable_days]
  • [promoted]
Impact & Evaluation Coordinator

Impact & Evaluation Coordinator

Delivering Innovation in Supportive HousingSan Francisco, CA, United States
[job_card.full_time] +1
Delivering Innovation in Supportive Housing (DISH).Impact & Evaluation Coordinator.Senior Director of Community Development. Compensation : $70,000 – $85,000.The Impact & Evaluation Coordinator suppo...[show_more][last_updated.last_updated_variable_days]
Program Leader (Elementary Summer Program)

Program Leader (Elementary Summer Program)

CYCSFSan Francisco, CA, US
[job_card.full_time] +1
[filters_job_card.quick_apply]
JOB ANNOUNCEMENT The Community Youth Center of San Francisco (CYC) provides the youth of our city a sense of belonging and vital tools and experiences to succeed in life.Our services in...[show_more][last_updated.last_updated_1_day]
  • [promoted]
Analyzing Evaluation Specialist

Analyzing Evaluation Specialist

sepalSan Francisco, CA, United States
[job_card.full_time] +1
The Analyzing Evaluation Specialist plays a critical role in evaluating task submissions across various specialized fields, including healthcare, law, education, engineering, finance, and software ...[show_more][last_updated.last_updated_variable_days]
AI Evaluation Engineer

AI Evaluation Engineer

Apex SystemsSan Francisco, CA
[job_card.temporary]
[filters_job_card.quick_apply]
Contract : 6 months + extension opportunity.We are looking for engineers to join us on a 6-month contract (with the possibility of extension) our Engineering Team. The primary work is split between e...[show_more][last_updated.last_updated_30]
  • [promoted]
Senior Software Engineer, Data & Evaluation

Senior Software Engineer, Data & Evaluation

WaymoSan Francisco, CA, United States
[job_card.full_time]
Senior Software Engineer, Data & Evaluation.Senior Software Engineer, Data & Evaluation.Waymo is an autonomous driving technology company with the mission to be the world's most trusted driver.Sinc...[show_more][last_updated.last_updated_1_day]
  • [promoted]
Architect, Scalable Model Evaluation Platforms

Architect, Scalable Model Evaluation Platforms

AnthropicSan Francisco, CA, United States
[job_card.full_time]
A leading AI safety company in New York is looking for a Research Engineer to lead the design of their evaluation platform. You will develop methodologies to assess AI model capabilities and collabo...[show_more][last_updated.last_updated_1_day]
  • [promoted]
Research Scientist : AI Evaluation & Alignment

Research Scientist : AI Evaluation & Alignment

Patronus AISan Francisco, CA, United States
[job_card.full_time]
A leading AI company is seeking a Research Scientist to develop cutting-edge AI evaluation systems and conduct transformative research in language models. Candidates should have a background in empi...[show_more][last_updated.last_updated_variable_days]
Student Evaluation Proctor | Temporary • In-Person

Student Evaluation Proctor | Temporary • In-Person

Scientific Adventures for GirlsRichmond, CA, US
[job_card.temporary]
[filters_job_card.quick_apply]
Scientific Adventures for Girls (SAfG) is seeking Student Evaluation Proctors.Do you love working with kids and believe every girl deserves a chance to thrive in STEM?. We're looking for enthusiasti...[show_more][last_updated.last_updated_variable_days]
  • [promoted]
Technical Program Manager, Quality and Reliability

Technical Program Manager, Quality and Reliability

HarveySan Francisco, CA, United States
[job_card.full_time]
At Harvey, we're transforming how legal and professional services operate not incrementally, but end-to-end.By combining frontier agentic AI, an enterprise-grade platform, and deep domain expertise...[show_more][last_updated.last_updated_variable_days]
  • [promoted]
Test & Evaluation Specialist

Test & Evaluation Specialist

Agile DefenseSan Francisco, CA, United States
[job_card.full_time]
At Agile Defense we know that action defines the outcome and new challenges require new solutions.That's why we always look to the future and embrace change with an unmovable spirit and the courage...[show_more][last_updated.last_updated_1_day]
Senior Research Scientist, Model Evaluation

Senior Research Scientist, Model Evaluation

CohereSan Francisco, CA, United States
[job_card.variable_days_ago]
[job_preview.job_type]
  • [job_card.full_time]
[job_card.job_description]

Senior Research Scientist, Model Evaluation

Join to apply for the Senior Research Scientist, Model Evaluation role at Cohere .

Why this role?

Evaluation is critical to making progress in scaling intelligence. As models continue to become superhuman in many real-world use cases, we must continue to develop new evaluation techniques that accurately reflect what models are already capable of, as well as set the agenda for what future models should be capable of. In this role, you are responsible for creating next?generation evaluation methods and infrastructure to measure LLM progress.

As a Senior Research Scientist, Model Evaluation, You Will

  • Create ambitious new evaluation benchmarks that push the limits of what our models can accomplish.
  • Work on highly cross?functional teams to translate model feedback into trustworthy, repeatable evaluations.
  • Conduct research to advance the state?of?the?art in LLM evaluation methods, including training LLM judges; refining LLM?based data synthesis pipelines; and improving evaluation efficiency.
  • Build scalable and reusable tools for digging into model performance.

You May Be a Good Fit If

  • You enjoy rapidly building prototypes that demonstrate the boundaries of what LLMs are capable of, and you have developed resources to measure those capabilities.
  • You have spent dozens of hours reviewing complex data and LLM outputs to ensure high data quality.
  • You are obsessive about rigorously measuring AI capabilities, and also about making sure your measurements actually align with the capabilities you care about.
  • You have strong software engineering skills.
  • We value and celebrate diversity

    We welcome applicants from all backgrounds and are committed to providing equal opportunities. Should you require any accommodations during the recruitment process, please submit an Accommodations Request Form, and we will work together to meet your needs.

    Full?Time Employees At Cohere Enjoy These Perks

  • ???? An open and inclusive culture and work environment
  • ???????? Work closely with a team on the cutting edge of AI research
  • ???? Weekly lunch stipend, in?office lunches & snacks
  • ???? Full health and dental benefits, including a separate budget to take care of your mental health
  • ???? 100% Parental Leave top?up for up to 6 months
  • ???? Personal enrichment benefits towards arts and culture, fitness and well?being, quality time, and workspace improvement
  • ???? Remote?flexible, offices in Toronto, New York, San Francisco, London and Paris, as well as a co?working stipend
  • ?? 6 weeks of vacation (30 working days!)
  • Seniority level

    Mid?Senior level

    Employment type

    Full?time

    Job function

    Other

    Industries

    Software Development

    #J-18808-Ljbffr