Talent.com
Principal Data Engineer
Principal Data EngineerSanas • Palo Alto, CA, United States
Principal Data Engineer

Principal Data Engineer

Sanas • Palo Alto, CA, United States
[job_card.30_days_ago]
[job_preview.job_type]
  • [job_card.full_time]
[job_card.job_description]

Sanas.ai is pioneering the future of human communication. Founded by a team of Stanford researchers and entrepreneurs with deep industry experience, Sanas has developed the world’s first real-time speech transformation platform capable of accent translation, noise elimination, speech enhancement, and cross-language communication.

Sanas makes conversations clearer, more inclusive, and more effective, removing barriers that prevent people from being understood, regardless of accent, background noise, or native language.

Since going to market in 2023, Sanas has scaled at an extraordinary pace, growing from $0 to $32M ARR in under two years, with a projected >

$50M ARR by the end of 2025. The company recently recorded its first $10M quarter and is on track to achieve $120M in ARR next year. With a SaaS-based model, Sanas serves some of the world’s largest enterprises, including Comcast, UPS, UHG. Today, Sanas technology is deployed across >

17 of the Fortune 500 and continuing to accelerate growth.

The company’s valuation has a clear trajectory toward multi-billion-dollar market capitalization as it continues to expand into new verticals and product categories. With a TAM that spans all human in the loop communications and beyond, Sanas has the potential to impact every industry and every global interaction.

Sanas is revolutionizing the way we communicate with the world’s first real-time algorithm, designed to modulate accents, eliminate background noises, and magnify speech clarity. Pioneered by seasoned startup founders with a proven track record of creating and steering multiple unicorn companies, our groundbreaking GDP-shifting technology sets a gold standard.

Sanas is a 200-strong team, established in 2020. In this short span, we’ve successfully secured over $100 million in funding. Our innovation has been supported by the industry’s leading investors, including Insight Partners, Google Ventures, Quadrille Capital, General Catalyst, Quiet Capital, and other influential investors. Our reputation is further solidified by collaborations with numerous Fortune 100 companies. With Sanas, you’re not just adopting a product; you’re investing in the future of communication.

Weʼre looking for an experienced and forward-thinking Principal Data Engineer to lead the design and implementation of our end-to-end data infrastructure for industry leading Voice AI products. This is a high impact role where you will shape the technical vision, own strategic architecture decisions, and mentor a growing team of Data engineers focused on delivering reliable and scalable data systems for Machine Learning at scale.

Youʼll work cross-functionally with AI research scientists, Infrastructure and product teams to ensure that data - from raw audio to training-ready features - is consistently accessible, compliant and optimized for speed and scale. Youʼll help push the boundaries of real-time Voice AI!

Key Responsibilities :

  • Architect and lead the development of large scale data pipelines and data lakes to ingest, transform and serve high quality data for AI model training, product telemetry and analytics.
  • Drive long‑term data infrastructure strategy across streaming and batch, feature store extensions, Iceberg / Delta lake choices, metadata management, and lakehouse evolution.
  • Drive platform and infrastructure decisions, optimizing compute fleets (e.g.Ray, Spark clusters), orchestration tooling Airflow, Dagster), and streaming stacks Kafka, Flink)
  • Collaborate with AI research scientists, engineering leads, product, finance, marketing, and legal to align data architecture with business and regulatory requirements.
  • Advocate best practices in data governance, lineage, observability, testing, tooling, and disaster recovery across pipelines and data stores.
  • Act as a mentor and technical leader - review design and code, share patterns, elevate team capability, and support recruitment and hiring
  • Drive build vs buy decisions for tools to implement data quality and observability solutions to achieve high data quality.

Qualifications :

  • 10+ years of experience in Data Engineering, Infrastructure, or ML Systems, with at least 2+ years in a technical leadership capacity.
  • Expertise in building distributed batch and real-time data systems
  • Expertise in Databases (like Postgres) and Data Lakes (like Snowflake, Databricks and ClickHouse
  • Experience using Data Processing frameworks like Spark, Flink and Ray
  • Deep Experience with cloud platforms AWS / GCP, object storage (e.g., S3), and orchestrators like Airflow and Dagster
  • Strong knowledge of data lifecycle management, including privacy, security, compliance and reproducibility
  • Comfortable working in a fast-paced startup environment
  • Strategic mindset and proven ability to collaborate across engineering, ML and product teams to deliver infrastructure that scales with the business.
  • Nice to Have :

  • Familiarity with audio data and its unique challenges, like large file sizes, time- series features, metadata handling, is a strong plus
  • Experience with Voice AI models like ASR, TTS and speaker verification.
  • Familiarity with real-time data processing frameworks like Kafka, Flink, Druid and Pinot
  • Familiarity with ML workflows including : MLOps, feature engineering, model training and inference.
  • Experience with labeling tools, audio annotation platforms, or human-in-the- loop annotation pipelines.
  • The pay range for this role is :

    250,000 - 350,000 USD per year (Palo Alto Office)

    #J-18808-Ljbffr

    [job_alerts.create_a_job]

    Principal Data Engineer • Palo Alto, CA, United States

    [internal_linking.related_jobs]
    5027 Senior Principal Engineer R&D

    5027 Senior Principal Engineer R&D

    Pratum Companies • San Jose, CA, United States
    [job_card.full_time]
    Working out of TAEC’s San Jose Storage Design Center, the selected applicant will work as a member of an R&D team on Hard Disk Drive related projects. The work environment involves developing leadin...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Staff / Principal Data Platform & Analytics Engineer

    Staff / Principal Data Platform & Analytics Engineer

    PROCEPT BioRobotics • San Jose, California, United States, 95101
    [job_card.full_time]
    Staff / Principal Data Platform & Analytics Engineer.Embark on an enriching journey with PROCEPT BioRobotics, where our vision, mission, and values guide everything we do as a company.At PROCEPT, we ...[show_more]
    [last_updated.last_updated_variable_days]
    Data Engineer | 2025PX05009 | 477|DB-26453

    Data Engineer | 2025PX05009 | 477|DB-26453

    Mindverse Consulting Services • Mountain View, California, United States
    [filters.remote]
    [job_card.full_time]
    We are looking for experienced contract data / software engineer contractors to support the Multi-Cloud Efficiency (MCE) team in scaling cost attribution infrastructure and improving financial visibi...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Data Engineer

    Data Engineer

    Midjourney • San Jose, CA, United States
    [job_card.full_time]
    Midjourney is a research lab exploring new mediums to expand the imaginative powers of the human species.We are a small, self-funded team focused on design, human infrastructure, and AI.We have no ...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Principal Data Engineer

    Principal Data Engineer

    Uber • Sunnyvale, California, United States
    [job_card.full_time]
    About the Role This is a Technical Data Leader position.The Data Engineering team focuses on building core Business Intelligence and Data Solutions for multiple business verticals at Uber, like Ube...[show_more]
    [last_updated.last_updated_variable_hours] • [promoted] • [new]
    Principal Engineer - AI Infrastructure Abstractions

    Principal Engineer - AI Infrastructure Abstractions

    Diversity Talent Scouts • San Jose, CA, US
    [job_card.full_time]
    Principal AI Infrastructure Abstraction Engineer.AI compute environments scalable, secure, and developer-friendly.Your work will focus on creating abstractions that hide hardware complexity while p...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Principal Software Engineer

    Principal Software Engineer

    Supermicro • San Jose, CA, United States
    [job_card.full_time]
    Supermicro is a Top Tier provider of advanced server, storage, and networking solutions for Data Center, Cloud Computing, Enterprise IT, Hadoop / Big Data, Hyperscale, HPC and IoT / Embedded customers...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Principal Machine Learning Engineer

    Principal Machine Learning Engineer

    Cisco Systems, Inc. • San Jose, CA, United States
    [job_card.full_time]
    We are an agile team with a startup feel and a strong bias for action.We move fast, embrace failure as part of the process, and stay focused on solving real-world problems for defenders on the fron...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Data Engineer - Open on W2 only

    Data Engineer - Open on W2 only

    Dataflix • San Jose, CA, United States
    [filters.remote]
    [job_card.full_time]
    We are looking for a Data Engineer to build out and scale our Analytics platform.As a member of the team, you will be responsible for building and scaling a robust platform that will act as the dri...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Principal Software Engineer II - Elasticsearch - Query Engine, Database Internals

    Principal Software Engineer II - Elasticsearch - Query Engine, Database Internals

    Elastic • Mountain View, CA, United States
    [job_card.full_time]
    Elastic, the Search AI Company, enables everyone to find the answers they need in real time, using all their data, at scale - unleashing the potential of businesses and people.The Elastic Search AI...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Principal Machine Learning Engineer

    Principal Machine Learning Engineer

    Cisco Systems • San Jose, CA, United States
    [job_card.full_time]
    We are an agile team with a startup feel and a strong bias for action.We move fast, embrace failure as part of the process, and stay focused on solving real‑world problems for defenders on the fron...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Principal AI Engineer

    Principal AI Engineer

    TENEX.AI • San Jose, California, United States
    [job_card.full_time]
    TENEX is an AI-native, automation-first, built-for-scale Managed Detection and Response (MDR) provider.We are a force multiplier for defenders, helping organizations enhance their cybersecurity pos...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Lead Data Engineer

    Lead Data Engineer

    Midi Health • Palo Alto, California, United States
    [job_card.full_time]
    We're looking for a Lead Data Engineer to spearhead design, implementation, and iteration of a world-class, modern data infrastructure that will power all of analytics, data science, and ML / AI syst...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Data Engineer

    Data Engineer

    Balbix • San Jose, California, United States
    [job_card.full_time]
    The Balbix Security Cloud uses AI and automation to reinvent how the World's leading organizations reduce their cyber risk. With Balbix, security teams can accurately inventory their cloud and on-pr...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Data Engineer

    Data Engineer

    Institute Of Foundation Models • Sunnyvale, California, United States
    [job_card.full_time]
    About the Institute of Foundation Models.We are a dedicated research lab for building, understanding, using, and risk-managing foundation models. Our mandate is to advance research, nurture the next...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Principal Data Engineer

    Principal Data Engineer

    Sanas • Palo Alto, California, United States, 94301
    [job_card.full_time]
    Founded by a team of Stanford researchers and entrepreneurs with deep industry experience, Sanas has developed the world’s first real-time speech transformation platform capable of accent translati...[show_more]
    [last_updated.last_updated_30]
    Big Data Principal

    Big Data Principal

    VirtualVocations • Fremont, California, United States
    [job_card.full_time]
    A company is looking for a Big Data Principal.Key Responsibilities Design and develop data models using Data Vault methodologies Utilize Data Vault techniques to build enterprise data lakes and ...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Lead Data Engineer (Hayward)

    Lead Data Engineer (Hayward)

    Mentor Talent Acquisition • Hayward, CA, US
    [job_card.part_time]
    Were looking for a Lead Data Engineer to spearhead the design, implementation, and iteration of a world-class, modern data infrastructure that powers analytics, data science, and ML / AI systems.You ...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]