Program evaluation [h1.location_city]
[job_alerts.create_a_job]
Program evaluation • oakland ca
- [promoted]
Senior Research Scientist, Model Evaluation
CohereSan Francisco, CA, United States- [promoted]
Impact & Evaluation Coordinator
Delivering Innovation in Supportive HousingSan Francisco, CA, United StatesProgram Leader (Elementary Summer Program)
CYCSFSan Francisco, CA, US- [promoted]
Analyzing Evaluation Specialist
sepalSan Francisco, CA, United StatesAI Evaluation Engineer
Apex SystemsSan Francisco, CA- [promoted]
Senior Software Engineer, Data & Evaluation
WaymoSan Francisco, CA, United States- [promoted]
Architect, Scalable Model Evaluation Platforms
AnthropicSan Francisco, CA, United States- [promoted]
Research Scientist : AI Evaluation & Alignment
Patronus AISan Francisco, CA, United StatesStudent Evaluation Proctor | Temporary • In-Person
Scientific Adventures for GirlsRichmond, CA, US- [promoted]
Technical Program Manager, Quality and Reliability
HarveySan Francisco, CA, United States- [promoted]
Test & Evaluation Specialist
Agile DefenseSan Francisco, CA, United StatesSenior Research Scientist, Model Evaluation
CohereSan Francisco, CA, United States- [job_card.full_time]
Senior Research Scientist, Model Evaluation
Join to apply for the Senior Research Scientist, Model Evaluation role at Cohere .
Why this role?
Evaluation is critical to making progress in scaling intelligence. As models continue to become superhuman in many real-world use cases, we must continue to develop new evaluation techniques that accurately reflect what models are already capable of, as well as set the agenda for what future models should be capable of. In this role, you are responsible for creating next?generation evaluation methods and infrastructure to measure LLM progress.
As a Senior Research Scientist, Model Evaluation, You Will
- Create ambitious new evaluation benchmarks that push the limits of what our models can accomplish.
- Work on highly cross?functional teams to translate model feedback into trustworthy, repeatable evaluations.
- Conduct research to advance the state?of?the?art in LLM evaluation methods, including training LLM judges; refining LLM?based data synthesis pipelines; and improving evaluation efficiency.
- Build scalable and reusable tools for digging into model performance.
You May Be a Good Fit If
We value and celebrate diversity
We welcome applicants from all backgrounds and are committed to providing equal opportunities. Should you require any accommodations during the recruitment process, please submit an Accommodations Request Form, and we will work together to meet your needs.
Full?Time Employees At Cohere Enjoy These Perks
Seniority level
Mid?Senior level
Employment type
Full?time
Job function
Other
Industries
Software Development
#J-18808-Ljbffr