Talent.com
Tech Lead/Manager, Machine Learning Research Scientist- LLM Evals
Tech Lead/Manager, Machine Learning Research Scientist- LLM EvalsScale AI • San Francisco, CA, United States
Tech Lead / Manager, Machine Learning Research Scientist- LLM Evals

Tech Lead / Manager, Machine Learning Research Scientist- LLM Evals

Scale AI • San Francisco, CA, United States
[job_card.variable_days_ago]
[job_preview.job_type]
  • [job_card.full_time]
[job_card.job_description]

Overview

Tech Lead Manager, Machine Learning Research Scientist- LLM Evals at Scale AI leads a team of research scientists and research engineers to develop and implement novel evaluation methodologies, metrics, and benchmarks for large language models. The role designs and executes a roadmap that defines best practices in data-driven AI development and accelerates the next generation of generative AI models in partnership with top foundational model labs.

Responsibilities

  • Lead a team of research scientists and research engineers on LLM evals.
  • Conduct research on the effectiveness and limitations of existing LLM evaluation techniques.
  • Design and develop novel evaluation benchmarks for large language models, covering instruction following, factuality, robustness, and fairness.
  • Communicate and collaborate with clients and peer teams to facilitate cross-functional projects.
  • Collaborate with internal teams and external partners to refine metrics and create standardized evaluation protocols.
  • Implement scalable and reproducible evaluation pipelines using modern ML frameworks.
  • Publish research findings in top-tier AI conferences and contribute to open-source benchmarking initiatives.
  • Stay up-to-date on research developments, participate in design decisions, and contribute to the research community.
  • Thrive in a high-energy, fast-paced startup environment and dedicate time and effort to drive impactful results.

Qualifications

  • 5+ years of hands-on experience in large language models, NLP, and Transformer modeling, in both research and engineering settings.
  • Track record of landing major research impacts in fast-paced environments.
  • Experience leading a team of research scientists and research engineers.
  • Excellent written and verbal communication skills.
  • Published research in machine learning at major conferences (e.g., NeurIPS, ICML, ICLR, ACL, EMNLP, CVPR) and / or journals.
  • Experience in customer-facing roles.
  • Compensation and Benefits

    Compensation packages include base salary, equity, and benefits. The stated salary range for this full-time role in San Francisco, New York, and Seattle is $260,000 – $350,000 USD. Scale employees may also receive equity-based compensation. Benefits include comprehensive health, dental, and vision coverage, retirement benefits, a learning and development stipend, and generous PTO. Some roles may include additional benefits such as a commuter stipend.

    About Scale and EEO

    Scale AI is an equal opportunity employer. We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability status, gender identity, or veteran status.

    #J-18808-Ljbffr

    [job_alerts.create_a_job]

    Machine Learning Scientist • San Francisco, CA, United States

    [internal_linking.similar_jobs]
    Machine Learning Research Scientist

    Machine Learning Research Scientist

    Autoscience • Menlo Park, CA, US
    [job_card.full_time]
    Develop autonomous research systems that ideate, experiment, and publish scientific papers.You'll be working on the first and best AI system to automate scientific discovery in AI research.Desi...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Research Scientist, Tech Lead Manager, Model Threat Mitigation

    Research Scientist, Tech Lead Manager, Model Threat Mitigation

    DeepMind • San Francisco, California, USA
    [job_card.full_time]
    Artificial Intelligence could be one of humanitys most useful inventions.At Google DeepMind were a team of scientists engineers machine learning experts and more working together to advance the sta...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Lead Machine Learning Engineer

    Lead Machine Learning Engineer

    Mind Company • San Francisco, California, United States
    [job_card.full_time]
    Mind Company's mission is to build non-invasive neural interfaces - that is, enabling a communication layer between humans and other humans or computers, directly using thoughts.In pursuit of this ...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Machine Learning Manager

    Machine Learning Manager

    Hive • San Francisco, California, United States
    [job_card.full_time]
    Hive is the leading provider of cloud-based AI solutions to understand, search, and generate content, and is trusted by hundreds of the world's largest and most innovative organizations.The company...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Senior Machine Learning Scientist, BRAID (Clinical Sciences ML)

    Senior Machine Learning Scientist, BRAID (Clinical Sciences ML)

    Genentech • San Francisco, CA, United States
    [job_card.full_time]
    It’s what drives us to innovate.To continuously advance science and ensure everyone has access to the healthcare they need today and for generations to come. Creating a world where we all have more ...[show_more]
    [last_updated.last_updated_30] • [promoted]
    ML Research Scientist, Sleep Health Tech

    ML Research Scientist, Sleep Health Tech

    Eight Sleep • San Francisco, CA, United States
    [job_card.full_time]
    A forward-thinking sleep technology company in San Francisco is seeking a research scientist to leverage AI and Machine Learning to transform health and fitness experiences through innovative techn...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Machine Learning Manager, Advanced Insights and Modeling

    Machine Learning Manager, Advanced Insights and Modeling

    Block • San Francisco, CA, United States
    [job_card.full_time]
    Machine Learning Manager, Advanced Insights and Modeling.Join to apply for the Machine Learning Manager, Advanced Insights and Modeling role at Block. Block is one company built from many blocks, al...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Machine Learning Lead

    Machine Learning Lead

    Augment Solutions, Inc. • San Francisco, CA, United States
    [job_card.full_time]
    Augment CXM is defining a new category of software - customer experience management - and powering it with a patented semantic neural network. This technology ensures that our clients deliver a fant...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Machine Learning Scientist, LLM Training & Inference Research

    Machine Learning Scientist, LLM Training & Inference Research

    Lila Sciences, Inc. • San Francisco, CA, United States
    [job_card.full_time]
    Machine Learning Scientist I / II, LLM Training & Inference Research.Cambridge, MA USA; San Francisco, CA USA.As a Machine Learning Scientist in LLM Training & Inference Research, you will lead resea...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Machine Learning Operations Lead

    Machine Learning Operations Lead

    Together AI • San Francisco, CA, United States
    [job_card.full_time]
    Together AI is building the AI Inference & Model Shaping Platform that brings the most advanced generative AI models to the world. Our platform powers multi-tenant serverless workloads and dedicated...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Manager II, Machine Learning Engineering, Ads Identity Modeling

    Manager II, Machine Learning Engineering, Ads Identity Modeling

    Pinterest • San Francisco, California, USA
    [job_card.full_time]
    Ads Conversion Modeling group in the Ads Performance org and focuses on identity and conversion modeling that improves conversion ads performance and unlocks new optimization objectives in close pa...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Machine Learning Engineering Lead

    Machine Learning Engineering Lead

    Ohalo • San Francisco, CA, US
    [job_card.full_time]
    Machine Learning Engineering Lead.Ohalo is looking for a hands-on.Machine Learning Engineering Lead.You will steer a small squad of ML / Data / Software Engineers, partnering with quantitative genetici...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Tech Lead / Manager, Machine Learning Research Scientist- LLM Evals

    Tech Lead / Manager, Machine Learning Research Scientist- LLM Evals

    Scale AI, Inc. • San Francisco, California, United States
    [job_card.full_time]
    As the leading data and evaluation partner for frontier AI companies, Scale is dedicated to advancing the evaluation and benchmarking of large language models (LLMs). We are building industry-leadin...[show_more]
    [last_updated.last_updated_30] • [promoted]
    ML Research Scientist

    ML Research Scientist

    Hyperdrive Recruiting • San Francisco, CA, United States
    [job_card.full_time]
    We are an enterprise AI startup building a novel memory infrastructure that reasons across time and context to help organizations operate more intelligently. Build LLM-powered information extraction...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Senior Applied ML Scientist, ML DSP

    Senior Applied ML Scientist, ML DSP

    Gridware • San Francisco, CA, US
    [job_card.full_time]
    Gridware is a San Francisco-based technology company dedicated to protecting and enhancing the electrical grid.We pioneered a groundbreaking new class of grid management called active grid response...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Senior Machine Learning Scientist

    Senior Machine Learning Scientist

    Tahoe Therapeutics • South San Francisco, CA, US
    [job_card.full_time]
    Tahoe Therapeutics is a biotechnology company pioneering a fundamentally new approach to drug discovery, one that begins with the biology of real patients. Our Mosaic platform is the first to make i...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Research Scientist, Tech Lead Manager, Model Threat Mitigation

    Research Scientist, Tech Lead Manager, Model Threat Mitigation

    Google DeepMind • San Francisco, CA, United States
    [job_card.full_time]
    Research Scientist, Tech Lead Manager, Model Threat Mitigation.Google DeepMind – Join to apply for the.Research Scientist, Tech Lead Manager, Model Threat Mitigation. Artificial Intelligence could b...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Research Scientist - Machine Learning

    Research Scientist - Machine Learning

    Extropic • San Francisco, CA, United States
    [job_card.full_time]
    Extropic’s hardware massively accelerates certain kinds of probabilistic inference.Our ML team works on the science of training models in the thermodynamic paradigm, and we are looking for senior r...[show_more]
    [last_updated.last_updated_30] • [promoted]