Talent.com

Program evaluation [h1.location_city]

[job_alerts.create_a_job]

Program evaluation • sunnyvale ca

[last_updated.last_updated_variable_days]
  • [promoted]
Member of Technical Staff, Evaluation

Member of Technical Staff, Evaluation

Boson AISanta Clara, CA, US
[job_card.full_time]
Boson AI is an early-stage startup building large language tools for everyone to use.Our founders (Alex Smola,Mu Li), and a team of Deep Learning, Optimization, NLP, AutoML and Statistics scientist...[show_more][last_updated.last_updated_30]
LLM Evaluation Engineer

LLM Evaluation Engineer

The Fountain GroupMountain View, CA
[job_card.full_time]
This isn’t your typical QA role – it’s a unique blend of technical engineering, machine learning evaluation, and data analysis. You’ll work closely with cutting-edge conversational AI technology, de...[show_more][last_updated.last_updated_30]
  • [promoted]
Program Coordinator (Adult Day Program)

Program Coordinator (Adult Day Program)

Friends of Children with Special NeedsSan Jose, CA, US
[job_card.temporary]
Salary : $28 - $36 / hourly (depending on experience).Friends of Children with Special Needs.FCSN) is a Bay Area non-profit organization founded in 1996 and focused on helping individuals with speci...[show_more][last_updated.last_updated_variable_days]
Program Manager

Program Manager

Kanak Elite Services IncSan Jose, California, USA
[job_card.full_time] +1
Job Title : Program Manager Retail / Ecommerce Logistics.SFO CA / Sacramento CA / San Jose CA (Onsite with Location Flexibility). Logistics / Supply Chain Experience.Recent retail experience (manda...[show_more][last_updated.last_updated_variable_days]
  • [promoted]
Program Coordinator

Program Coordinator

Hogarth WorldwideSunnyvale, CA, US
[job_card.full_time]
Hogarth is the Global Content Production Company.Part of WPP, Hogarth partners with one in every two of the world's top 100 brands including Coca-Cola, Ford, Rolex, Nestlé, Mondelez and ...[show_more][last_updated.last_updated_30]
Evaluation & Insights Engineer

Evaluation & Insights Engineer

AppleCupertino, CA, United States
[job_card.full_time]
Weekly Hours : • • 40 • •Role Number : • • 200632687-0836 • •Summary • • Imagine what you could do here.At Apple, great new ideas have a way of becoming extraordinary products, services, and customer ex...[show_more][last_updated.last_updated_variable_days]
Senior Software Engineer, ML Systems Evaluation

Senior Software Engineer, ML Systems Evaluation

ASunnyvale, California, United States
[job_card.full_time]
Our Wayfinder team is building scalable, certifiable autonomy systems to power the next generation of commercial aircraft. Our team of experts is driving the maturation of machine learning and other...[show_more][last_updated.last_updated_variable_days]
  • [promoted]
Member of Technical Staff, Model Evaluation

Member of Technical Staff, Model Evaluation

xAIPalo Alto, CA, US
[job_card.full_time]
AI's mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on engineering exc...[show_more][last_updated.last_updated_30]
  • [promoted]
Program Manager

Program Manager

CBRE GroupMenlo Park, CA, US
[job_card.full_time]
As a CBRE Program Manager, you will manage a team responsible for facilitating small to medium cross-functional projects and programs. This job is part of the Program Management function.They are re...[show_more][last_updated.last_updated_30]
Evaluation Consultant

Evaluation Consultant

Paradise Architectural PANELS & STEELSan Jose, California, USA
[job_card.full_time]
Paradise Architectural PANELS & STEEL is a leading manufacturer of high-quality architectural panels and steel products.We are committed to providing our clients with innovative and sustainable sol...[show_more][last_updated.last_updated_variable_days]
Applied Science Manager, GenAI Evaluation Media (GEM)

Applied Science Manager, GenAI Evaluation Media (GEM)

AmazonSunnyvale, California, USA
[job_card.full_time]
Passionate about creating visual customer experiences that push the boundaries at the forefront of GenAI.The North America Stores GenAI Evaluation Media (GEM) team is seeking an experienced Applied...[show_more][last_updated.last_updated_variable_days]
  • [promoted]
Program Associate

Program Associate

Hope ServicesSan Jose, CA, US
[job_card.full_time]
Are you a person who enjoys helping others? Are you currently seeking fulfillment in your professional life?.Hope Services is Silicon Valleys leading provider of services to people with development...[show_more][last_updated.last_updated_variable_days]
  • [promoted]
Responsible AI ML Engineer – Safety & Evaluation

Responsible AI ML Engineer – Safety & Evaluation

Apple Inc.Cupertino, CA, United States
[job_card.full_time]
A leading technology company in Cupertino seeks a Machine Learning Engineer focused on Responsible AI.You'll work on developing evaluations for safety and fairness in generative AI applications, co...[show_more][last_updated.last_updated_variable_days]
  • [promoted]
Senior Project Manager, Post-Market Safety Evaluation

Senior Project Manager, Post-Market Safety Evaluation

AbbottSanta Clara, CA, US
[job_card.full_time]
Senior Project Manager, Post-Market Safety Evaluation.Abbott is a global healthcare leader that helps people live more fully at all stages of life. Our portfolio of life-changing technologies spans ...[show_more][last_updated.last_updated_variable_days]
Director, Simulation Evaluation

Director, Simulation Evaluation

WaymoMountain View, CA, United States
[job_card.full_time]
Waymo is an autonomous driving technology company with the mission to be the world's most trusted driver.Since its start as the Google Self-Driving Car Project in 2009, Waymo has focused on buildin...[show_more][last_updated.last_updated_30]
TLM, Autonomy Evaluation

TLM, Autonomy Evaluation

NuroMountain View, California
[job_card.full_time]
Nuro is a self-driving technology company on a mission to make autonomy accessible to all.Founded in 2016, Nuro is building the world’s most scalable driver, combining cutting-edge AI with automoti...[show_more][last_updated.last_updated_30]
Wireless Technologies Evaluation Engineer

Wireless Technologies Evaluation Engineer

Tata Consultancy ServicesCupertino, CA
[job_card.full_time]
In this role, you will be part of Product RF Definition team and support the Evaluation and Characterization of various technologies from RF perspective. You will work independently under Product RF...[show_more][last_updated.last_updated_30]
  • [promoted]
Program Manager | (Program management) | Hybrid |

Program Manager | (Program management) | Hybrid |

SamprasoftSunnyvale, CA, US
[job_card.full_time]
Coordinates projects and ensures company resources are utilized appropriately.[show_more][last_updated.last_updated_30]
Member of Technical Staff, Evaluation

Member of Technical Staff, Evaluation

Boson AISanta Clara, CA, US
[job_card.30_days_ago]
[job_preview.job_type]
  • [job_card.full_time]
[job_card.job_description]

Job Description

Job Description

Boson AI is an early-stage startup building large language tools for everyone to use. Our founders (Alex Smola,Mu Li), and a team of Deep Learning, Optimization, NLP, AutoML and Statistics scientists and engineers are working on high quality generative AI models for language and beyond.

We are seeking research scientists and engineers to join our team full-time in our Santa Clara office. As part of your role, you will work on implementing and training deep neural networks, understanding and interpreting model behavior and aligning models to human values. The ideal candidate will possess a strong background in machine learning, and have motivations for developing state-of-the-art models towards AGI.

We encourage you to apply even if you do not believe you meet every single qualification. As long as you are motivated to learn and join the development of foundation models, we’d love to chat.

Responsibilities :

  • Design and run evaluations to measure model’s capabilities.
  • Write efficient and clean code to build evaluation pipeline.
  • Share your findings to help model development and data annotation guidelines.

You may be a good fit if you have :

  • Experience in prompt engineering or other ways to interact with large language models.
  • Experience in data analysis, familiar with data processing and visualization tools.
  • Strong candidates may also have :

  • Proficiency in at least one deep learning framework, such as PyTorch.
  • Think out of box, can find solutions to ambiguously scoped problems.
  • Ability to summarize results, clearly communicate the observations in your work.
  • Participated in research projects on model evaluation or related topics.
  • Experience in training / finetuning large language or multimodal models.
  • Total compensations includes base pay, equity, and benefits.

    We may use artificial intelligence (AI) tools to support parts of the hiring process, such as reviewing applications, analyzing resumes, or assessing responses. These tools assist our recruitment team but do not replace human judgment. Final hiring decisions are ultimately made by humans. If you would like more information about how your data is processed, please contact us.