Talent.com
Machine Learning Data Engineer - Systems & Retrieval
Machine Learning Data Engineer - Systems & RetrievalZyphra • Palo Alto, California, United States
Machine Learning Data Engineer - Systems & Retrieval

Machine Learning Data Engineer - Systems & Retrieval

Zyphra • Palo Alto, California, United States
[job_card.30_days_ago]
[job_preview.job_type]
  • [job_card.full_time]
[job_card.job_description]

Zyphra is an artificial intelligence company based in Palo Alto, California.

The Role :

As a Machine Learning Data Engineer - Systems & Retrieval , you will build and optimize the data infrastructure that fuels our machine learning systems. This includes designing high-performance pipelines for collecting, transforming, indexing, and serving massive, heterogeneous datasets from raw web-scale data to enterprise document corpora. You’ll play a central role in architecting retrieval systems for LLMs and enabling scalable training and inference with clean, accessible, and secure data. You’ll have an impact across both research and product teams by shaping the foundation upon which intelligent systems are trained, retrieved, and reasoned over.

You’ll work across :

Design and implementation of distributed data ingestion and transformation pipelines

Building retrieval and indexing systems that support RAG and other LLM-based methods

Mining and organizing large unstructured datasets, both in research and production environments

Collaborating with ML engineers, systems engineers, and DevOps to scale pipelines and observability

Ensuring compliance and access control in data handling, with security and auditability in mind

Requirements :

Strong software engineering background with fluency in Python

Experience designing, building, and maintaining data pipelines in production environments

Deep understanding of data structures, storage formats, and distributed data systems

Familiarity with indexing and retrieval techniques for large-scale document corpora

Understanding of database systems (SQL and NoSQL), their internals, and performance characteristics

Strong attention to security, access controls, and compliance best practices (e.g., GDPR, SOC2)

Excellent debugging, observability, and logging practices to support reliability at scale

Strong communication skills and experience collaborating across ML, infra, and product teams

Bonus Skill Set :

Experience building or maintaining LLM-integrated retrieval systems (e.g, RAG pipelines)

Academic or industry background in data mining, search, recommendation systems, or IR literature

Experience with large-scale ETL systems and tools like Apache Beam, Spark, or similar

Familiarity with vector databases (e.g., FAISS, Weaviate, Pinecone) and embedding-based retrieval

Understanding of data validation and quality assurance in machine learning workflows

Experience working on cross-functional infra and MLOps teams

Knowledge of how data infrastructure supports training pipelines, inference serving, and feedback loops

Comfort working across raw, unstructured data, structured databases, and model-ready formats

Why Work at Zyphra :

Our research methodology is to make grounded, methodical steps toward ambitious goals. Both deep research and engineering excellence are equally valued

We strongly value new and crazy ideas and are very willing to bet big on new ideas

We move as quickly as we can; we aim to minimize the bar to impact as low as possible

We all enjoy what we do and love discussing AI

Benefits and Perks :

Comprehensive medical, dental, vision, and FSA plans

Competitive compensation and 401(k)

Relocation and immigration support on a case-by-case basis

On-site meals prepared by a dedicated culinary team; Thursday Happy Hours

In-person team in Palo Alto, CA, with a collaborative, high-energy environment

If you're excited by the challenge of high-scale, high-performance data engineering in the context of cutting-edge AI, you’ll thrive in this role. Apply Today!

[job_alerts.create_a_job]

Machine Learning Engineer • Palo Alto, California, United States

[internal_linking.similar_jobs]
Machine Learning Engineer, Data / Systems

Machine Learning Engineer, Data / Systems

Waymo • Mountain View, California, United States
[job_card.full_time]
Waymo is an autonomous driving technology company with the mission to be the world's most trusted driver.Since its start as the Google Self-Driving Car Project in 2009, Waymo has focused on buildin...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Machine Learning Engineer

Machine Learning Engineer

Institute of Foundation Models • Sunnyvale, CA, US
[job_card.full_time]
About the Institute of Foundation Models.We are a dedicated research lab for building, understanding, using, and risk-managing foundation models. Our mandate is to advance research, nurture the next...[show_more]
[last_updated.last_updated_30] • [promoted]
Machine Learning Engineer

Machine Learning Engineer

Institute Of Foundation Models • Sunnyvale, California, United States
[job_card.full_time]
About the Institute of Foundation Models.We are a dedicated research lab for building, understanding, using, and risk-managing foundation models. Our mandate is to advance research, nurture the next...[show_more]
[last_updated.last_updated_30] • [promoted]
Lead Machine Learning Engineer, Recommender Systems

Lead Machine Learning Engineer, Recommender Systems

HP IQ • Palo Alto, California, United States
[job_card.full_time]
HP IQ is HP’s new AI innovation lab.Combining startup agility with HP’s global scale, we’re building intelligent technologies that redefine how the world works, creates, and collaborates.We’re asse...[show_more]
[last_updated.last_updated_30] • [promoted]
Machine Learning Engineer

Machine Learning Engineer

Amiri Recruiting • Mountain View, CA, US
[job_card.full_time]
This is an opportunity with an early stage startup.We're looking for an ML research-focused software engineer to join us on our mission to build AI superpowers for developers.Train and fine-tun...[show_more]
[last_updated.last_updated_30] • [promoted]
Machine Learning Engineer

Machine Learning Engineer

Gridmatic • Cupertino, California, United States
[job_card.full_time]
Bay Area and Houston that is accelerating the clean energy transition by applying our expertise in data, machine learning, and energy to power markets. We are the rare startup that has multiple year...[show_more]
[last_updated.last_updated_30] • [promoted]
Machine Learning Engineer

Machine Learning Engineer

ProductNow • Palo Alto, CA, US
[job_card.full_time]
Explore and experiment with state-of-the-art AI models and machine learning techniques, contributing to core product features powered by ML. Own projects end-to-end – from understanding proble...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Machine Learning Engineer

Machine Learning Engineer

RADAR • Sunnyvale, CA, US
[job_card.full_time]
At RADAR, we're transforming the way the world thinks about physical retail.RADAR has raised over $104M from top investors, retailers, and strategics and works with some of the world's reta...[show_more]
[last_updated.last_updated_30] • [promoted]
Sr. Machine Learning Engineer, GAI Search Relevance

Sr. Machine Learning Engineer, GAI Search Relevance

Moveworks • Mountain View, California, United States
[job_card.full_time]
As a senior member of the core platform team, you will play a key role in shaping the evolution of moveworks conversational AI platform. You will have the opportunity to - build enterprise products ...[show_more]
[last_updated.last_updated_30] • [promoted]
Machine Learning Engineer - GenAI, LLM, Agentic AI

Machine Learning Engineer - GenAI, LLM, Agentic AI

Eightfold • Santa Clara, California, United States
[job_card.full_time]
Research, design, development, and deployment of advanced AI agents and agentic systems.Architect and implement complex multi-agent systems, including planning, decision-making, and execution capab...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Founding Machine Learning Engineer

Founding Machine Learning Engineer

Key Technology • Fremont, CA, United States
[job_card.full_time]
You’ll design, build, and ship ranking and recommendation systems that make every match feel more personal and improve week after week. Train and fine-tune LLMs / encoders.Collaborate across ML, platf...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Machine Learning Engineer, ML Resources

Machine Learning Engineer, ML Resources

Waymo • Mountain View, California, United States
[job_card.full_time]
Waymo is an autonomous driving technology company with the mission to be the world's most trusted driver.Since its start as the Google Self-Driving Car Project in 2009, Waymo has focused on buildin...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
LLM Research Engineer

LLM Research Engineer

Cypress HCM • Mountain View, CA, US
[job_card.full_time]
Design, train, and fine-tune large language models (e.GPT, LLaMA, PaLM) for various applications.Conduct research on cutting-edge techniques in natural language processing (NLP) and machine learnin...[show_more]
[last_updated.last_updated_30] • [promoted]
Machine Learning Engineer, ML Resources

Machine Learning Engineer, ML Resources

The Rundown AI, Inc. • Mountain View, CA, United States
[job_card.full_time]
Waymo is an autonomous driving technology company with the mission to be the world's most trusted driver.Since its start as the Google Self-Driving Car Project in 2009, Waymo has focused on buildin...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Machine Learning Engineer, Recommendation

Machine Learning Engineer, Recommendation

Newsbreak • Mountain View, California, United States
[job_card.full_time]
NewsBreak is redefining the way users interact with local news and their communities.By bridging local users, local content creators, and local businesses, our mission is to foster safer, more vibr...[show_more]
[last_updated.last_updated_30] • [promoted]
Machine Learning Engineer (Data Science)

Machine Learning Engineer (Data Science)

Autonomous Healthcare • Santa Clara, CA, US
[job_card.full_time]
At Autonomous Healthcare, we are at the forefront of medical innovation, developing the next generation of devices that will revolutionize patient care. Our mission is to commercialize breakthrough ...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Sr. Staff Machine Learning Engineer, Closeup Relevance

Sr. Staff Machine Learning Engineer, Closeup Relevance

Pinterest • Palo Alto, CA, United States
[job_card.full_time]
Millions of people around the world come to our platform to find creative ideas, dream about new possibilities and plan for memories that will last a lifetime. At Pinterest, we're on a mission to br...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Machine Learning Engineer 756

Machine Learning Engineer 756

Protegrity • Palo Alto, California, United States
[filters.remote]
[job_card.full_time]
At Protegrity, we lead innovation by using AI and quantum-resistant cryptography to transform data protection across cloud-native, hybrid, on-premises, and open source environments.We leverage adva...[show_more]
[last_updated.last_updated_30] • [promoted]