Talent.com
Research Scientist - Data (Hayward)
Research Scientist - Data (Hayward)Storm3 • Hayward, CA, US
Research Scientist - Data (Hayward)

Research Scientist - Data (Hayward)

Storm3 • Hayward, CA, US
[job_card.variable_days_ago]
[job_preview.job_type]
  • [job_card.part_time]
[job_card.job_description]

Research Scientist - Data focus

💊 Foundation Models, AI Research Institute

🌎 San Francisco Bay Area, USA

💸 $200,000 - $350,000 salary + bonus

Come join a revolutionary AI research lab in SF Bay Area that is poised to develop & publish high-impact breakthroughs in GenAI - across LLMs and Multimodal AI.

As part of the team, youll work at the intersection of data, large-scale training, and foundation model innovation. You will collaborate with world-class researchers, data scientists, and engineers to solve critical challenges in creating robust, scalable, and reasoning-capable LLMs. Your research will shape the way data is curated, processed, and leveraged to train the next generation of intelligent systems.

Responsibilities :

  • Lead research on data-centric approaches for LLMs , including pretraining corpus design, data valuation, and speculative decoding strategies.
  • Develop pipelines to process challenging data sources into structured and reproducible training datasets.
  • Build and optimize agentic data pipelines , integrating retrieval, self-curation, and multi-agent feedback for high-quality training and evaluation data.
  • Collaborate with researchers on alignment and reasoning-focused training that leverage data-driven approaches for improving LLM capabilities.
  • Prototype and deploy evaluation frameworks to measure data quality, coverage, and downstream impact on LLM reasoning.
  • Publish findings at top-tier venues (e.g., NeurIPS, ICLR, ACL, EMNLP) and represent the institute at international conferences.
  • Contribute to open-source tools, datasets, and benchmarks that advance the global foundation model research community.

Requirements :

  • Masters degree in Computer Science, Data Science, or a related technical field (PhD strongly preferred)
  • Experience collecting and curating high-quality text data including multi-lingual data.
  • Hands-on experience with large-scale dataset curation and preprocessing for ML / LLM training.
  • Prior works synthesizing complex datasets. Code, math, and agentic data are higher priority
  • Experience with ML infrastructure for scalable training, evaluation, and debugging .
  • Experience at the intersection of data and post-training (RL / SFT)
  • Proven ability to independently drive research questions related to data quality, scaling, or reasoning .
  • Preferred Experience :

  • Experience with retrieval-augmented generation (RAG) , agentic data pipelines, or reasoning benchmarks.
  • Contributions to speculative decoding, self-curation, or reinforcement learning from synthetic data .
  • Background in knowledge graphs, semantic search, or indexing systems .
  • Strong publication record in leading AI conferences.
  • Prior contributions to open-source ML data tools or benchmarks .
  • Prior work on speculative decoding / contributions to LLM serving engines
  • Prior work on training LLM-as-a-judge
  • Deep expertise with tokenization / training tokenizers
  • Why apply :

  • Opportunity to build out a new division at the forefront of AI innovation
  • FAANG competitive salary & package
  • Work alongside superstars from FAANG labs & leading AI companies
  • Medical, Dental and Vision Insurance
  • Relocation package available
  • 🌎 San Francisco Bay Area, USA

    📧 Interested in applying? Please click on the Easy Apply button or alternatively email me your resume at stefani.lukic@storm3.com

    [job_alerts.create_a_job]

    Research Scientist • Hayward, CA, US

    [internal_linking.related_jobs]
    Research Scientist (Hayward)

    Research Scientist (Hayward)

    Goliath Partners • Hayward, CA, United States
    [job_card.full_time]
    We're working with a San Francisco client that's got a research team of 50~ professionals and looking to further expand it. They are specifically looking to flesh out their Research Group by hiring ...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Data Scientist (Hayward)

    Data Scientist (Hayward)

    Hanalytica GmbH • Hayward, CA, US
    [job_card.part_time]
    Data Scientist (San Francisco, CA).Our client is seeking a highly motivated and skilled Data Scientist to join a fast-paced, agile team focused on applying the latest advancements in artificial int...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Staff Data Scientist (Hayward)

    Staff Data Scientist (Hayward)

    Quantix Search • Hayward, CA, US
    [job_card.part_time]
    Staff Data Scientist | San Francisco | $250K$300K + Equity.Were partnering with one of the fastest-growing AI companies in the world to hire a Staff Data Scientist. Backed by over $230M from top-tie...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Senior Data Scientist

    Senior Data Scientist

    Waymo • Mountain View, CA, United States
    [job_card.full_time]
    Waymo is an autonomous driving technology company with the mission to be the most trusted driver.Since its start as the Google Self-Driving Car Project in 2009, Waymo has focused on building the Wa...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Data Science AI Modeler (Life Sciences Biotech) (Hayward)

    Data Science AI Modeler (Life Sciences Biotech) (Hayward)

    Vida Group International • Hayward, CA, United States
    [job_card.full_time]
    The heart of our Biotech client is at the forefront of biotechnology, leveraging cutting-edge artificial intelligence and machine learning technologies to revolutionize drug discovery and developme...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    research scientist - RL (Hayward)

    research scientist - RL (Hayward)

    Cerebro • Hayward, CA, US
    [job_card.part_time]
    Join a Leading Applied Research Lab Pushing the Boundaries of Reinforcement Learning.Are you passionate about advancing the frontiers of. Develop novel optimization-based methods.Drive your own rese...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Founding Data Scientist (Hayward)

    Founding Data Scientist (Hayward)

    Intelletec • Hayward, CA, United States
    [job_card.full_time]
    Fast-growing AI healthcare startup.Hands-on analytics and modeling : .Strong statistical modeling and ML knowledge (scikit-learn, StatsModels, PyTorch). Comfortable with ambiguity, able to own end-to-...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Founding ML Scientist (Hayward)

    Founding ML Scientist (Hayward)

    Greylock • Hayward, CA, US
    [job_card.full_time] +1
    Were looking for an Applied RS who can run the gamut of ML from infra to modeling and own the entire ML pipeline taking advanced models into production. Our ideal candidate will be looking to build...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Senior Data Scientist, Research - TikTok

    Senior Data Scientist, Research - TikTok

    TikTok • San Jose, CA, United States
    [job_card.full_time]
    Senior Data Scientist, Research – TikTok.Be among the first 25 applicants.Design TikTok’s online decision‑making mechanism and optimize experimental frameworks. Develop quantitative evaluation metho...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Research Data Scientist

    Research Data Scientist

    InsideHigherEd • Stanford, California, United States
    [job_card.full_time] +1
    Dean of Research, Stanford, California, United States.Research📅Dec 13, 2024 Post Date📅105424 Requisition #This is a 3-year fixed term appointment. This position is part of a new initiative i...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Senior Research Data Scientist, Central Operations Analytics

    Senior Research Data Scientist, Central Operations Analytics

    Google Inc. • Mountain View, CA, United States
    [job_card.full_time]
    Senior Research Data Scientist, Central Operations Analytics.Google place Mountain View, CA, USA.Master's degree in statistics, data science, mathematics, physics, economics, operations research, e...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Research Scientist (Hayward)

    Research Scientist (Hayward)

    kadence • Hayward, CA, United States
    [job_card.full_time]
    We are a seed-stage AI company building the industry standard for evaluating and benchmarking large language models on real enterprise tasks. As a Research Scientist, you will develop new benchmarks...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Senior Research Engineer (Hayward)

    Senior Research Engineer (Hayward)

    Harrison Clarke • Hayward, CA, United States
    [job_card.full_time]
    A fast-growing, deeply technical AI company is looking for a.This is an opportunity to work at the frontier of AI, helping design and evaluate models that can understand, write, and reason about co...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Staff Data Scientist - Post Sales (Hayward)

    Staff Data Scientist - Post Sales (Hayward)

    Harnham • Hayward, CA, United States
    [job_card.full_time]
    Staff Data Scientist Post Sales.This fast-growing Series E AI SaaS company is redefining how modern engineering teams build and deploy applications. Were expanding our data science organization to ...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Senior Manager, REMS Data Programmer

    Senior Manager, REMS Data Programmer

    Jazz Pharmaceuticals • Fremont, California, USA
    [job_card.full_time]
    If you are a current Jazz employee please apply via the Internal Career site.Jazz Pharmaceuticals is a global biopharma company whose purpose is to innovate to transform the lives of patients and ...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Senior AI Research Engineer (Hayward)

    Senior AI Research Engineer (Hayward)

    Deep Abacus • Hayward, CA, US
    [job_card.part_time]
    VC-backed Series B startup on a high-growth trajectory.Revolutionizing commerce through direct engagement with powerful conversation AI. Join a world-class team with opportunity for rapid career adv...[show_more]
    [last_updated.last_updated_variable_hours] • [promoted] • [new]
    Research Scientist - Data

    Research Scientist - Data

    Storm3 • Fremont, CA, United States
    [job_card.full_time]
    Research Scientist - Data focus.Foundation Models, AI Research Institute.Come join a revolutionary AI research lab in SF Bay Area that is poised to develop & publish high-impact breakthroughs in Ge...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Senior Product Data Scientist, Marketplace

    Senior Product Data Scientist, Marketplace

    Futureshaper.com • Mountain View, CA, United States
    [job_card.full_time]
    Waymo is an autonomous driving technology company with the mission to be the world's most trusted driver.Since its start as the Google Self-Driving Car Project in 2009, Waymo has focused on buildin...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]