Talent.com
Research Scientist - Data (Hayward)
Research Scientist - Data (Hayward)Storm3 • Hayward, CA, US
Research Scientist - Data (Hayward)

Research Scientist - Data (Hayward)

Storm3 • Hayward, CA, US
[job_card.variable_days_ago]
[job_preview.job_type]
  • [job_card.part_time]
[job_card.job_description]

Research Scientist - Data focus

💊 Foundation Models, AI Research Institute

🌎 San Francisco Bay Area, USA

💸 $200,000 - $350,000 salary + bonus

Come join a revolutionary AI research lab in SF Bay Area that is poised to develop & publish high-impact breakthroughs in GenAI - across LLMs and Multimodal AI.

As part of the team, youll work at the intersection of data, large-scale training, and foundation model innovation. You will collaborate with world-class researchers, data scientists, and engineers to solve critical challenges in creating robust, scalable, and reasoning-capable LLMs. Your research will shape the way data is curated, processed, and leveraged to train the next generation of intelligent systems.

Responsibilities :

  • Lead research on data-centric approaches for LLMs , including pretraining corpus design, data valuation, and speculative decoding strategies.
  • Develop pipelines to process challenging data sources into structured and reproducible training datasets.
  • Build and optimize agentic data pipelines , integrating retrieval, self-curation, and multi-agent feedback for high-quality training and evaluation data.
  • Collaborate with researchers on alignment and reasoning-focused training that leverage data-driven approaches for improving LLM capabilities.
  • Prototype and deploy evaluation frameworks to measure data quality, coverage, and downstream impact on LLM reasoning.
  • Publish findings at top-tier venues (e.g., NeurIPS, ICLR, ACL, EMNLP) and represent the institute at international conferences.
  • Contribute to open-source tools, datasets, and benchmarks that advance the global foundation model research community.

Requirements :

  • Masters degree in Computer Science, Data Science, or a related technical field (PhD strongly preferred)
  • Experience collecting and curating high-quality text data including multi-lingual data.
  • Hands-on experience with large-scale dataset curation and preprocessing for ML / LLM training.
  • Prior works synthesizing complex datasets. Code, math, and agentic data are higher priority
  • Experience with ML infrastructure for scalable training, evaluation, and debugging .
  • Experience at the intersection of data and post-training (RL / SFT)
  • Proven ability to independently drive research questions related to data quality, scaling, or reasoning .
  • Preferred Experience :

  • Experience with retrieval-augmented generation (RAG) , agentic data pipelines, or reasoning benchmarks.
  • Contributions to speculative decoding, self-curation, or reinforcement learning from synthetic data .
  • Background in knowledge graphs, semantic search, or indexing systems .
  • Strong publication record in leading AI conferences.
  • Prior contributions to open-source ML data tools or benchmarks .
  • Prior work on speculative decoding / contributions to LLM serving engines
  • Prior work on training LLM-as-a-judge
  • Deep expertise with tokenization / training tokenizers
  • Why apply :

  • Opportunity to build out a new division at the forefront of AI innovation
  • FAANG competitive salary & package
  • Work alongside superstars from FAANG labs & leading AI companies
  • Medical, Dental and Vision Insurance
  • Relocation package available
  • 🌎 San Francisco Bay Area, USA

    📧 Interested in applying? Please click on the Easy Apply button or alternatively email me your resume at stefani.lukic@storm3.com

    [job_alerts.create_a_job]

    Research Scientist • Hayward, CA, US

    [internal_linking.related_jobs]
    Data Scientist (Hayward)

    Data Scientist (Hayward)

    Hanalytica GmbH • Hayward, CA, US
    [job_card.part_time]
    Data Scientist (San Francisco, CA).Our client is seeking a highly motivated and skilled Data Scientist to join a fast-paced, agile team focused on applying the latest advancements in artificial int...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Staff Data Scientist (Hayward)

    Staff Data Scientist (Hayward)

    Quantix Search • Hayward, CA, US
    [job_card.part_time]
    Staff Data Scientist | San Francisco | $250K$300K + Equity.Were partnering with one of the fastest-growing AI companies in the world to hire a Staff Data Scientist. Backed by over $230M from top-tie...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Senior Data Scientist

    Senior Data Scientist

    Waymo • Mountain View, CA, United States
    [job_card.full_time]
    Waymo is an autonomous driving technology company with the mission to be the most trusted driver.Since its start as the Google Self-Driving Car Project in 2009, Waymo has focused on building the Wa...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Data Science AI Modeler (Life Sciences Biotech) (Hayward)

    Data Science AI Modeler (Life Sciences Biotech) (Hayward)

    Vida Group International • Hayward, CA, United States
    [job_card.full_time]
    The heart of our Biotech client is at the forefront of biotechnology, leveraging cutting-edge artificial intelligence and machine learning technologies to revolutionize drug discovery and developme...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Data Scientist, Innovation

    Data Scientist, Innovation

    Picarro • Santa Clara, CA, United States
    [job_card.full_time]
    Lead Data Scientist, Innovation.Job Location : Santa Clara, CA (preferred) or Remote- US-based.Picarro is transforming gas utility operations with innovative solutions for methane emissions manageme...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Clinical Lab Scientist - Concord Microbiology Department - Part Time - 8 Hour - Variable Shift

    Clinical Lab Scientist - Concord Microbiology Department - Part Time - 8 Hour - Variable Shift

    John Muir Health • Concord, CA, United States
    [job_card.part_time]
    Performs and demonstrates competency, proficiency and understanding of the test procedures in the Clinical Laboratory up to and including high complexity testing. Medical Laboratory Technology / Clini...[show_more]
    [last_updated.last_updated_1_day] • [promoted]
    Founding ML Scientist (Hayward)

    Founding ML Scientist (Hayward)

    Greylock • Hayward, CA, US
    [job_card.full_time] +1
    Were looking for an Applied RS who can run the gamut of ML from infra to modeling and own the entire ML pipeline taking advanced models into production. Our ideal candidate will be looking to build...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Research Data Scientist

    Research Data Scientist

    InsideHigherEd • Stanford, California, United States
    [job_card.full_time] +1
    Dean of Research, Stanford, California, United States.Research📅Dec 13, 2024 Post Date📅105424 Requisition #This is a 3-year fixed term appointment. This position is part of a new initiative i...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    AI Research Scientist I (Intern) United States

    AI Research Scientist I (Intern) United States

    Cisco Systems, Inc. • San Jose, CA, United States
    [job_card.full_time]
    Please note this posting is to advertise potential job opportunities.This exact role may not be open today but could open in the near future. When you apply, a Cisco representative may contact you d...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Microbiology Technician / Associate Scientist

    Microbiology Technician / Associate Scientist

    Joulé • Pleasanton, CA, US
    [job_card.full_time]
    Job Title : Microbiology Technician / Associate Scientist Location : Pleasanton, California Type : Contract Contractor Work Model : Onsite Hours : 8 : 00 to 4 : 30 or 8 : 30 to 5 : 00 Monday to Friday Responsibil...[show_more]
    [last_updated.last_updated_variable_hours] • [promoted] • [new]
    Data Scientist - Open on W2 only

    Data Scientist - Open on W2 only

    Dataflix • San Jose, CA, United States
    [filters.remote]
    [job_card.full_time]
    We are seeking a highly motivated and detail-oriented Data Scientist to join our team.In this role, you will be responsible for analyzing large datasets to uncover insights, develop predictive mode...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Head of Product Governance and Data Analytics

    Head of Product Governance and Data Analytics

    Allspring Global Investments • Walnut Creek, CA, United States
    [job_card.full_time]
    Work where your ideas have impact.Allspring Global Investments is a leading independent asset management firm that offers a broad range of investment products and solutions designed to help meet cl...[show_more]
    [last_updated.last_updated_1_day] • [promoted]
    Senior Research Engineer (Hayward)

    Senior Research Engineer (Hayward)

    Harrison Clarke • Hayward, CA, United States
    [job_card.full_time]
    A fast-growing, deeply technical AI company is looking for a.This is an opportunity to work at the frontier of AI, helping design and evaluate models that can understand, write, and reason about co...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Senior Manager, REMS Data Programmer

    Senior Manager, REMS Data Programmer

    Jazz Pharmaceuticals • Fremont, California, USA
    [job_card.full_time]
    If you are a current Jazz employee please apply via the Internal Career site.Jazz Pharmaceuticals is a global biopharma company whose purpose is to innovate to transform the lives of patients and ...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Staff Applied Data Scientist (PhD)

    Staff Applied Data Scientist (PhD)

    Jerry • Palo Alto, California, United States
    [job_card.full_time]
    Jerry is disrupting the car industry.With our super app, we are ushering in a new era for 300M drivers in the US and radically improving the consumer automotive experience.Architect complex deep le...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Sr Research Data Scientist

    Sr Research Data Scientist

    Roku • San Jose, California, United States
    [job_card.full_time]
    Teamwork makes the stream work.Roku is changing how the world watches TV.Roku is the #1 TV streaming platform in the U.Canada, and Mexico, and we've set our sights on powering every television in t...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Infectious Disease Physician - San Francisco Bay Area

    Infectious Disease Physician - San Francisco Bay Area

    Privia Health, LLC • Walnut Creek, US
    [job_card.full_time]
    We are currently looking for a BC / BE .The incoming provider will work alongside 9 physicians, 1 nurse practitioner, 2 registered nurses, and a tenured support staff, ensuring .Our physici...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Infectious Disease Physician - San Francisco Bay Area

    Infectious Disease Physician - San Francisco Bay Area

    HealthEcareers - Client • Walnut Creek, CA, USA
    [job_card.full_time]
    We are currently looking for a BC / BE .The incoming provider will work alongside 9 physicians, 1 nurse practitioner, 2 registered nurses, and a tenured support staff, ensuring .In the hospital, our ...[show_more]
    [last_updated.last_updated_30] • [promoted]