Talent.com
Research Scientist - Data (San Francisco)
Research Scientist - Data (San Francisco)Storm3 • San Francisco, CA, US
Research Scientist - Data (San Francisco)

Research Scientist - Data (San Francisco)

Storm3 • San Francisco, CA, US
[job_card.variable_days_ago]
[job_preview.job_type]
  • [job_card.part_time]
[job_card.job_description]

Research Scientist - Data focus

💊 Foundation Models, AI Research Institute

🌎 San Francisco Bay Area, USA

💸 $200,000 - $350,000 salary + bonus

Come join a revolutionary AI research lab in SF Bay Area that is poised to develop & publish high-impact breakthroughs in GenAI - across LLMs and Multimodal AI.

As part of the team, youll work at the intersection of data, large-scale training, and foundation model innovation. You will collaborate with world-class researchers, data scientists, and engineers to solve critical challenges in creating robust, scalable, and reasoning-capable LLMs. Your research will shape the way data is curated, processed, and leveraged to train the next generation of intelligent systems.

Responsibilities :

  • Lead research on data-centric approaches for LLMs , including pretraining corpus design, data valuation, and speculative decoding strategies.
  • Develop pipelines to process challenging data sources into structured and reproducible training datasets.
  • Build and optimize agentic data pipelines , integrating retrieval, self-curation, and multi-agent feedback for high-quality training and evaluation data.
  • Collaborate with researchers on alignment and reasoning-focused training that leverage data-driven approaches for improving LLM capabilities.
  • Prototype and deploy evaluation frameworks to measure data quality, coverage, and downstream impact on LLM reasoning.
  • Publish findings at top-tier venues (e.g., NeurIPS, ICLR, ACL, EMNLP) and represent the institute at international conferences.
  • Contribute to open-source tools, datasets, and benchmarks that advance the global foundation model research community.

Requirements :

  • Masters degree in Computer Science, Data Science, or a related technical field (PhD strongly preferred)
  • Experience collecting and curating high-quality text data including multi-lingual data.
  • Hands-on experience with large-scale dataset curation and preprocessing for ML / LLM training.
  • Prior works synthesizing complex datasets. Code, math, and agentic data are higher priority
  • Experience with ML infrastructure for scalable training, evaluation, and debugging .
  • Experience at the intersection of data and post-training (RL / SFT)
  • Proven ability to independently drive research questions related to data quality, scaling, or reasoning .
  • Preferred Experience :

  • Experience with retrieval-augmented generation (RAG) , agentic data pipelines, or reasoning benchmarks.
  • Contributions to speculative decoding, self-curation, or reinforcement learning from synthetic data .
  • Background in knowledge graphs, semantic search, or indexing systems .
  • Strong publication record in leading AI conferences.
  • Prior contributions to open-source ML data tools or benchmarks .
  • Prior work on speculative decoding / contributions to LLM serving engines
  • Prior work on training LLM-as-a-judge
  • Deep expertise with tokenization / training tokenizers
  • Why apply :

  • Opportunity to build out a new division at the forefront of AI innovation
  • FAANG competitive salary & package
  • Work alongside superstars from FAANG labs & leading AI companies
  • Medical, Dental and Vision Insurance
  • Relocation package available
  • 🌎 San Francisco Bay Area, USA

    📧 Interested in applying? Please click on the Easy Apply button or alternatively email me your resume at stefani.lukic@storm3.com

    [job_alerts.create_a_job]

    Research Scientist • San Francisco, CA, US

    [internal_linking.related_jobs]
    research scientist - RL (San Francisco)

    research scientist - RL (San Francisco)

    Cerebro • San Francisco, CA, US
    [job_card.part_time]
    Join a Leading Applied Research Lab Pushing the Boundaries of Reinforcement Learning.Are you passionate about advancing the frontiers of. Develop novel optimization-based methods.Drive your own rese...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Data Scientist (San Francisco)

    Data Scientist (San Francisco)

    Hanalytica GmbH • San Francisco, CA, US
    [job_card.part_time]
    Data Scientist (San Francisco, CA).Our client is seeking a highly motivated and skilled Data Scientist to join a fast-paced, agile team focused on applying the latest advancements in artificial int...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Research Scientist

    Research Scientist

    Yugal Tech Academy • San Francisco, CA, United States
    [job_card.full_time]
    The Scaling Laws group within Architecture works to advance our foundational understanding of our deep learning stack.We study the algorithmic scaling behavior across architecture, optimization, an...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Research Scientist - Data (San Francisco)

    Research Scientist - Data (San Francisco)

    Storm3 • San Francisco, CA, United States
    [job_card.full_time]
    Research Scientist - Data focus.Foundation Models, AI Research Institute.Come join a revolutionary AI research lab in SF Bay Area that is poised to develop & publish high-impact breakthroughs in Ge...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Senior RL Research Scientist (San Francisco)

    Senior RL Research Scientist (San Francisco)

    DeepRec.ai • San Francisco, CA, US
    [job_card.part_time]
    Senior RL Research Scientist / Reinforcement Learning Scientist.Join a frontier AI team building systems that can act in the physical world, experimenting, optimizing, and controlling real processe...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Data Scientist (San Francisco)

    Data Scientist (San Francisco)

    Skale • San Francisco, CA, United States
    [job_card.full_time]
    We're working with a Series A health tech start-up pioneering a revolutionary approach to healthcare AI, developing neurosymbolic systems that combine statistical learning with structured medical k...[show_more]
    [last_updated.last_updated_variable_hours] • [promoted] • [new]
    Founding AI Research Scientist

    Founding AI Research Scientist

    DimRed • San Francisco, CA, United States
    [job_card.full_time]
    DimRed is building a platform to automate the most challenging parts of scaling AI Agents and LLM-based systems.We believe the path forward for AI systems is grounded in reliability and interpretab...[show_more]
    [last_updated.last_updated_30] • [promoted]
    AI-Driven Life Sciences Research Scientist

    AI-Driven Life Sciences Research Scientist

    anthropic • San Francisco, CA, United States
    [job_card.full_time]
    A leading AI research company in San Francisco is seeking a Research Scientist for its Life Sciences team.This position blends AI with biological research to drive scientific discovery.Ideal candid...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Research Scientist (San Francisco)

    Research Scientist (San Francisco)

    kadence • San Francisco, CA, US
    [job_card.part_time]
    We are a seed-stage AI company building the industry standard for evaluating and benchmarking large language models on real enterprise tasks. As a Research Scientist, you will develop new benchmarks...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Founding Data Scientist (San Francisco)

    Founding Data Scientist (San Francisco)

    Intelletec • San Francisco, CA, United States
    [job_card.full_time]
    Fast-growing AI healthcare startup.Hands-on analytics and modeling : .Strong statistical modeling and ML knowledge (scikit-learn, StatsModels, PyTorch). Comfortable with ambiguity, able to own end-to-...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Remote Research Scientist (San Francisco)

    Remote Research Scientist (San Francisco)

    Grant Thornton • San Francisco, CA, United States
    [filters.remote]
    [job_card.full_time] +1
    We are seeking a highly motivated and innovative.This role involves designing, conducting, and evaluating research projects that support the development of new scientific and technical solutions.Th...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Data Scientist, Care Intelligence (San Francisco, hybrid)

    Data Scientist, Care Intelligence (San Francisco, hybrid)

    Pomelo Care, Inc. • San Francisco, CA, United States
    [job_card.full_time]
    Data Scientist, Care Intelligence (San Francisco, hybrid).Pomelo Care is a multi-disciplinary team of clinicians, engineers and problem solvers who are passionate about improving care for moms and ...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Data Scientist, Growth Hybrid - San Francisco

    Data Scientist, Growth Hybrid - San Francisco

    Grammarly, Inc. • San Francisco, CA, US
    [job_card.full_time]
    Superhuman offers a dynamic hybrid working model for this role.This flexible approach gives team members the best of both worlds : plenty of focus time along with in-person collaboration that helps ...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Senior AI Research Scientist New San Francisco, California

    Senior AI Research Scientist New San Francisco, California

    Workato Inc • San Francisco, CA, United States
    [job_card.full_time]
    Workato transforms technology complexity into business opportunity.As the leader in enterprise orchestration, Workato helps businesses globally streamline operations by connecting data, processes, ...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Research Data Scientist

    Research Data Scientist

    Tatari • San Francisco, CA, United States
    [job_card.full_time]
    Tatari is on a mission to revolutionize TV advertising.We work with disruptor brands—like Calm, Vuori, Rocket Money, and hundreds more—to grow their business using linear and streaming TV ads.Our p...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Research Scientist : AI Systems for Open‑Ended Discovery

    Research Scientist : AI Systems for Open‑Ended Discovery

    Intology • San Francisco, CA, United States
    [job_card.full_time]
    A leading research and development firm in San Francisco seeks a skilled individual to join their core R&D team focused on building end-to-end automated research systems. Ideal candidates will posse...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Research Scientist (San Francisco)

    Research Scientist (San Francisco)

    Goliath Partners • San Francisco, CA, United States
    [job_card.full_time]
    We're working with a San Francisco client that's got a research team of 50~ professionals and looking to further expand it. They are specifically looking to flesh out their Research Group by hiring ...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    AI Research Scientist I (Intern) United States

    AI Research Scientist I (Intern) United States

    Cisco • San Francisco, CA, United States
    [job_card.full_time]
    Please note this posting is to advertise potential job opportunities.This exact role may not be open today but could open in the near future. When you apply, a Cisco representative may contact you d...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]