Talent.com
Research Engineer, Synthetic Data
Research Engineer, Synthetic DataCartesia • San Francisco, CA, United States
Research Engineer, Synthetic Data

Research Engineer, Synthetic Data

Cartesia • San Francisco, CA, United States
[job_card.30_days_ago]
[job_preview.job_type]
  • [job_card.full_time]
[job_card.job_description]

About Cartesia

Our mission is to build the next generation of AI : ubiquitous, interactive intelligence that runs wherever you are. Today, not even the best models can continuously process and reason over a year-long stream of audio, video and text—1B text tokens, 10B audio tokens and 1T video tokens—let alone do this on-device.

We're pioneering the model architectures that will make this possible. Our founding team met as PhDs at the Stanford AI Lab, where we invented State Space Models or SSMs, a new primitive for training efficient, large-scale foundation models. Our team combines deep expertise in model innovation and systems engineering paired with a design-minded product engineering team to build and ship cutting edge models and experiences.

We're funded by leading investors at Index Ventures and Lightspeed Venture Partners, along with Factory, Conviction, A Star, General Catalyst, SV Angel, Databricks and others. We're fortunate to have the support of many amazing advisors, and 90+ angels across many industries, including the world's foremost experts in AI.

The Role

The future of AI training will be built on a foundation of high-quality synthetic data. We are looking for a creative and resourceful Synthetic Data Specialist to design and build the systems that generate training data at an unprecedented scale. This is a unique, high-impact role, where you will solve critical data bottlenecks and directly accelerate our research progress.

What you’ll do

Evaluate fidelity, diversity, and usefulness of synthetic data across LLMs, audio generation, and audio understanding.

Implement techniques for steering data generation to improve model intelligence through data and mitigate bias.

Build automated quality control systems to validate and filter generated data

Design synthetic datasets at large scale to develop model capabilities.

Stay on the cutting edge of research in synthetic data generation, data augmentation, and generative models.

What we’re looking for

Experience with generative models (speech, text, or multimodal).

Strong applied ML background with a focus on data-centric approaches.

Understanding of evaluation methods for synthetic data quality.

Excitement for building scalable systems that bridge research and production.

Familiarity with building large-scale distributed systems for synthetic data generation

Our culture

🏢 We’re an in-person team based out of San Francisco. We love being in the office, hanging out together and learning from each other everyday.

🚢 We ship fast. All of our work is novel and cutting edge, and execution speed is paramount. We have a high bar, and we don’t sacrifice quality and design along the way.

🤝 We support each other. We have an open and inclusive culture that’s focused on giving everyone the resources they need to succeed.

Our perks

🍽 Lunch, dinner and snacks at the office.

🏥 Fully covered medical, dental, and vision insurance for employees.

🏦 401(k).

✈️ Relocation and immigration support.

🦖 Your own personal Yoshi.

#J-18808-Ljbffr

[job_alerts.create_a_job]

Research Engineer • San Francisco, CA, United States

[internal_linking.similar_jobs]
Research Engineer, AI Discovery & Infra

Research Engineer, AI Discovery & Infra

Anthropic • San Francisco, CA, United States
[job_card.full_time]
A prominent AI research organization seeks a Research Engineer to work on large-scale infrastructure for AI systems.This role involves designing systems, optimizing pipelines, and enhancing perform...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Research Engineer

Research Engineer

Jobright.ai • San Francisco, CA, United States
[job_card.full_time]
Be among the first 25 applicants.Get AI-powered advice on this job and more exclusive features.Jobright is an AI-powered career platform that helps job seekers discover the top opportunities in the...[show_more]
[last_updated.last_updated_30] • [promoted]
Senior Research Engineer

Senior Research Engineer

Far.Ai • Berkeley, California, USA
[job_card.full_time]
AI research institute dedicated to ensuring advanced AI is safe and beneficial for everyone.Our mission is to facilitate breakthrough AI safety research advance global understanding of AI risks and...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Research Engineer

Research Engineer

Appliedcompute • San Francisco, CA, United States
[job_card.full_time]
Applied Compute builds Specific Intelligence for enterprises, unlocking the knowledge inside a company to train custom models and deploy an in-house agent workforce. Today’s state-of-the-art AI isn’...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Research Engineer, Data Ingestion

Research Engineer, Data Ingestion

The Rundown AI, Inc. • San Francisco, CA, United States
[job_card.full_time]
We are looking for an experienced Research Engineer to join the Data Ingestion team, which owns the problem of acquiring all of the available data on the internet through a large scale web crawler....[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Applied Research Engineer

Applied Research Engineer

Labelbox • San Francisco, California, USA
[job_card.full_time]
At Labelbox were building the critical infrastructure that powers breakthrough AI models at leading research labs and enterprises. Since 2018 weve been pioneering data-centric approaches that are fu...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
AI Research Engineer, Enterprise Evaluations

AI Research Engineer, Enterprise Evaluations

Scale AI, Inc. • San Francisco, CA, United States
[job_card.full_time]
Scale AI is seeking a technically rigorous and driven.This high-impact role is critical to our mission of delivering the industry's leading. You will be a hands-on contributor to the core systems th...[show_more]
[last_updated.last_updated_30] • [promoted]
Technical Solutions Engineer

Technical Solutions Engineer

Lancesoft INC • San Bruno, CA, US
[job_card.full_time]
Job Title : Technical Solutions Engineer.Location : Hybrid 3x / week onsite in our San Bruno, CA 94066 OR Dallas (Coppell), TX 75019. Duration : 06 Months (Possible extension or...[show_more]
[last_updated.last_updated_30] • [promoted]
Research Engineer

Research Engineer

Generalagents • San Francisco, CA, United States
[job_card.full_time]
General Agents is an applied research lab exploring the frontiersof autonomous intelligence.Our mission is to liberate humanity from digital labor. We are a team of researchers, engineers, and opera...[show_more]
[last_updated.last_updated_30] • [promoted]
Research Engineer

Research Engineer

Decagon • San Francisco, California, United States
[job_card.full_time]
Decagon is building the most advanced conversational AI agents for the enterprise.Since starting the company, we've been on a tear, winning over customers like. Duolingo, Notion, Rippling, Eventbrit...[show_more]
[last_updated.last_updated_30] • [promoted]
Research Engineer

Research Engineer

gamma.app • San Francisco, CA, United States
[job_card.full_time]
We're building the creative layer for modern communication.Every month, over a billion people make presentations — but the tools they use to make them haven't evolved in decades.We're changing that...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Research Engineer

Research Engineer

Magic • San Francisco, CA, United States
[job_card.full_time]
Get AI-powered advice on this job and more exclusive features.Magic’s mission is to build safe AGI that accelerates humanity’s progress on the world’s most important problems.We believe the most pr...[show_more]
[last_updated.last_updated_30] • [promoted]
Research Engineer - ML / Systems

Research Engineer - ML / Systems

Epsilon • San Francisco, CA, United States
[job_card.full_time]
We're tackling one of healthcare's most critical challenges in medical imaging and diagnostics.Our company operates at the intersection of cutting‑edge AI and clinical practice, building technology...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Research Engineer

Research Engineer

Anyscale • San Francisco, California, United States
[job_card.full_time]
Ray in their tech stacks to accelerate the progress of AI applications out into the real world.With Anyscale, we’re building the best place to run Ray, so that any developer or data scientist can s...[show_more]
[last_updated.last_updated_30] • [promoted]
Research Engineer – Synthetic Data for Vision

Research Engineer – Synthetic Data for Vision

Sesame • San Francisco, CA, United States
[job_card.full_time]
Sesame believes in a future where computers are lifelike - with the ability to see, hear, and collaborate with us in ways that feel natural and human. With this vision, we're designing a new kind of...[show_more]
[last_updated.last_updated_30] • [promoted]
Data Adaptation Research Engineer

Data Adaptation Research Engineer

Adaption • San Francisco, CA, United States
[job_card.full_time]
Data Adaptation Research Engineer.About Us : We believe the future is adaptable, and not one-size-fits-all.We will lead in real-time efficient adaptation that combines algorithm with innovative inte...[show_more]
[last_updated.last_updated_30] • [promoted]
Research Engineer / Scientist, Trustworthy AI

Research Engineer / Scientist, Trustworthy AI

OpenAI • San Francisco, CA, United States
[job_card.full_time]
The Safety Systems team is responsible for various safety work to ensure our best models can be safely deployed to the real world to benefit society and is at the forefront of OpenAI's mission to b...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Founding Research Engineer

Founding Research Engineer

The LLM Data Company • San Francisco, CA, United States
[job_card.full_time]
The LLM Data Company (YC X25) provides post-training data and RL environments to foundation model labs and frontier applied AI companies. Tier 1 VCs and are growing 200%+ month-over-month.Design and...[show_more]
[last_updated.last_updated_variable_days] • [promoted]