Talent.com
Data Scientist, Knowledge Graphs
Data Scientist, Knowledge GraphsMithrl Inc. • San Francisco, CA, United States
Data Scientist, Knowledge Graphs

Data Scientist, Knowledge Graphs

Mithrl Inc. • San Francisco, CA, United States
[job_card.variable_days_ago]
[job_preview.job_type]
  • [job_card.full_time]
[job_card.job_description]

ABOUT MITHRL

We imagine a world where new medicines reach patients in months, not years, and where scientific breakthroughs happen at the speed of thought.

Mithrl is building the world’s first commercially available AI Co-Scientist. It is a discovery engine that transforms messy biological data into insights in minutes. Scientists ask questions in natural language, and Mithrl responds with real analysis, novel targets, hypotheses, and patent‑ready reports.

OUR TRACTION

12X year-over-year revenue growth

Trusted by leading biotechs and big pharma across three continents

Driving real breakthroughs from target discovery to patient outcomes.

ABOUT THE ROLE

We are hiring a Data Scientist, Knowledge Graphs to build and scale the biological knowledge layer that powers the Mithrl AI Co-Scientist. This role focuses on ingesting and harmonizing the world’s most important biological data sources and curating the relationships that allow our system to reason across pathways, targets, diseases, compounds, and multimodal datasets.

You will ingest data from public consortia and well maintained peer reviewed sources and unify them into a coherent, versioned knowledge graph. You will identify new node types, define relationship schemas, harmonize variable IDs, and ensure metadata remains consistent across all integrated sources. You will also build automated curation pipelines that expand and refine the knowledge graph using both data driven methods and domain logic.

Beyond ingestion and curation, you will create the tools and frameworks that allow users to interact with the knowledge graph and even build their own custom graphs based on the results they generate inside Mithrl. Your work will form the foundation for pathway reasoning, target scoring, evidence aggregation, and multimodal interpretation inside the AI Co-Scientist.

WHAT YOU WILL DO

Ingest, harmonize, and version high value public biological datasets such as CellxGene, Gemma, ARCHS4, ENCODE, GTEx, TCGA, etc.

Ingest well maintained peer reviewed knowledgebases including OpenTargets, HPA, and similar resources

Build automated pipelines to curate and expand relationships inside the knowledge graph

Define and evolve schemas for node types, relationships, metadata rules, and ontology alignment

Harmonize variable IDs and metadata fields across all imported sources to create a unified knowledge layer

Build and maintain versioning, change tracking, and provenance systems for all data and relationships

Develop the framework that allows users to build custom knowledge graphs from the analyses they run inside Mithrl

Build features that allow users to explore, query, and interact with their graphs

Work closely with ML engineers, bioinformatics teams, and discovery application teams to ensure the knowledge graph supports downstream reasoning and analysis

Validate the correctness, completeness, and integrity of the knowledge graph across releases

WHAT YOU BRING

Required Qualifications

Strong experience in data science, bioinformatics, computational biology, or a related field

Experience working with biological knowledgebases, public datasets, or ontology driven systems

Familiarity with graph data structures, relationship modeling, and knowledge graph concepts

Experience harmonizing heterogeneous biological datasets and mapping variable IDs across sources

Proficiency in Python and scientific computing libraries

Ability to build ingestion pipelines for structured or semi structured biological data

Strong understanding of metadata standards, biological ontologies, and domain logic

Ability to translate complex biological information into structured, machine readable representations

Excellent communication skills and comfort collaborating across engineering and scientific teams

Nice to Have

Experience with graph databases or graph query languages

Experience with KG curation, link prediction, relationship extraction, or graph based ML

Familiarity with multi modal data integration

Previous work on biological or chemical knowledge graphs

Experience with public consortia such as ENCODE, GTEx, TCGA, or ChEMBL, etc.

Prior experience in a tech bio startup or scientific software environment

WHAT YOU WILL LOVE AT MITHRL

You will build the core knowledge layer that the AI Co-Scientist uses to reason about biology

Team : Join a tight‑knit, talent-dense team of engineers, scientists, and builders

Culture : We value consistency, clarity, and hard work. We solve hard problems through focused daily execution

Speed : We ship fast (2x / week) and improve continuously based on real user feedback

Location : Beautiful SF office with a high‑energy, in‑person culture

Benefits : Comprehensive PPO health coverage through Anthem (medical, dental, and vision) + 401(k) with top‑tier plans

We encourage you to apply even if you do not believe you meet every single qualification. Not all strong candidates will meet every single qualification as listed. Research shows that people who identify as being from underrepresented groups are more prone to experiencing imposter syndrome and doubting the strength of their candidacy, so we urge you not to exclude yourself prematurely and to submit an application if you're interested in this work. We think AI systems like the ones we're building have enormous social and ethical implications. We think this makes representation even more important, and we strive to include a range of diverse perspectives on our team.

#J-18808-Ljbffr

[job_alerts.create_a_job]

Data Scientist • San Francisco, CA, United States

[internal_linking.similar_jobs]
Staff Data Scientist, Monetization & Growth

Staff Data Scientist, Monetization & Growth

Character.AI • Redwood City, CA, United States
[job_card.full_time]
A groundbreaking tech company is looking for a Staff Data Scientist to join their team in Redwood City.The ideal candidate will have over 7 years of experience in consumer-facing products, focusing...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Senior Data Scientist, Compliance Technology

Senior Data Scientist, Compliance Technology

OKX • San Francisco, CA, United States
[job_card.full_time]
Senior Data Scientist, Compliance Technology.At OKX, we believe that the future will be reshaped by crypto, and ultimately contribute to every individual's freedom. OKX is a leading crypto exchange,...[show_more]
[last_updated.last_updated_30] • [promoted]
Data Scientist, Knowledge Graphs

Data Scientist, Knowledge Graphs

Mithrl • San Francisco, CA, US
[job_card.full_time]
We imagine a world where new medicines reach patients in months, not years, and where scientific breakthroughs happen at the speed of thought. Mithrl is building the world’s first commercially...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Zappos Data Scientist III, Zappos / Shopbop Catalog Engineering

Zappos Data Scientist III, Zappos / Shopbop Catalog Engineering

Amazon • San Francisco, CA, United States
[job_card.full_time]
As a Data Scientist on the Shopbop / Zappos Catalog Tech team, you will design and implement scientific approaches to revolutionize how we manage and enhance our product catalog data for our world‑cl...[show_more]
[last_updated.last_updated_variable_hours] • [promoted] • [new]
Lead Growth Data Scientist for AI Search & Insights

Lead Growth Data Scientist for AI Search & Insights

Adobe Inc. • San Francisco, CA, United States
[job_card.full_time]
A leading technology company is seeking a Lead Growth Marketing Data Scientist to define success metrics in AI-powered search environments. This strategic role involves developing analytical framewo...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Data Scientist II

Data Scientist II

Earnest • San Francisco, California, United States
[job_card.full_time]
Our mission is to make higher education accessible and affordable for everyone.We empower students with financial support and supercharge their ability to pay down their debt, so they can get on th...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Data Scientist

Data Scientist

Menlo Ventures • San Francisco, CA, United States
[job_card.full_time]
Engineering, product, & design.Graphite builds consumer-quality tools for modern software engineering teams, so they can ship faster and create amazing products. Anyone can start using Graphite indi...[show_more]
[last_updated.last_updated_30] • [promoted]
Data Scientist 1

Data Scientist 1

Poshmark, Inc. • Redwood City, CA, United States
[job_card.full_time]
Poshmark is a leading fashion resale marketplace powered by a vibrant, highly engaged community of buyers and sellers and real-time social experiences. Designed to make online selling fun, more soci...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Senior Data Scientist, Analytics - GTM Ads

Senior Data Scientist, Analytics - GTM Ads

King River Capital Group • San Francisco, CA, United States
[job_card.full_time]
Discord is used by over 200 million people every month for many different reasons, but there’s one thing that nearly everyone does on our platform : . Over 90% of our users play games, spending a comb...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Staff Data Scientist, Growth

Staff Data Scientist, Growth

Airwallex Pty Ltd. • San Francisco, CA, United States
[job_card.full_time]
Airwallex is the only unified payments and financial platform for global businesses.Powered by our unique combination of proprietary infrastructure and software, we empower over 200,000 businesses ...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Data Scientist - Growth

Data Scientist - Growth

Pantera Capital • San Francisco, CA, United States
[job_card.full_time]
Perplexity is an AI-powered answer engine founded in December 2022 and growing rapidly as one of the world’s leading AI platforms. Perplexity has raised over $1B in venture investment from some of t...[show_more]
[last_updated.last_updated_30] • [promoted]
Data Scientist, Creator Success

Data Scientist, Creator Success

Roblox • San Mateo, CA, United States
[job_card.full_time]
Every day, tens of millions of people come to Roblox to explore, create, play, learn, and connect with friends in 3D immersive digital experiences– all created by our global community of developers...[show_more]
[last_updated.last_updated_30] • [promoted]
Staff Data Scientist, Growth

Staff Data Scientist, Growth

Patreon • San Francisco, CA, United States
[job_card.full_time]
Patreon is the best place for creators to build memberships by providing exclusive access to their work and a deeper connection with their communities. We’re building a content and community platfor...[show_more]
[last_updated.last_updated_30] • [promoted]
Senior Data Scientist - Gusto 401(k)

Senior Data Scientist - Gusto 401(k)

Monograph • San Francisco, California, United States
[job_card.full_time]
About Gusto At Gusto, we're on a mission to grow the small business economy.We handle the hard stuff—like payroll, health insurance, 401(k)s, and HR—so owners can focus on their craft and customers...[show_more]
[last_updated.last_updated_variable_hours] • [promoted] • [new]
Data Scientist

Data Scientist

Waymo • San Francisco, CA, United States
[job_card.full_time]
Waymo is an autonomous driving technology company with the mission to be the most trusted driver.Since its start as the Google Self-Driving Car Project in 2009, Waymo has focused on building the Wa...[show_more]
[last_updated.last_updated_30] • [promoted]
Data Scientist with Public Trust

Data Scientist with Public Trust

VirtualVocations • Oakland, California, United States
[job_card.full_time]
A company is looking for a Data Scientist / Analyst to support modernization efforts in their contact centers.Key Responsibilities Update and optimize business queries and data pulls to incorporate...[show_more]
[last_updated.last_updated_variable_hours] • [promoted] • [new]
Data Scientist II, Growth Marketing

Data Scientist II, Growth Marketing

Pinterest • San Francisco, California, United States
[job_card.full_time]
Millions of people around the world come to our platform to find creative ideas, dream about new possibilities and plan for memories that will last a lifetime. At Pinterest, we're on a mission to br...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Data Scientist - Digital Measures

Data Scientist - Digital Measures

Verily • San Bruno, CA, United States
[job_card.full_time]
Who We Are • •Verily is a subsidiary of Alphabet that is using a data-driven approach to change the way people manage their health and the way healthcare is delivered. Launched from Google X in 2015, ...[show_more]
[last_updated.last_updated_30] • [promoted]