Talent.com
Tech Lead, Data & Inference Engineer
Tech Lead, Data & Inference EngineerCatalyst Labs • Greenwich, Connecticut, USA
Tech Lead, Data & Inference Engineer

Tech Lead, Data & Inference Engineer

Catalyst Labs • Greenwich, Connecticut, USA
[job_card.variable_days_ago]
[job_preview.job_type]
  • [job_card.full_time]
[job_card.job_description]

Our Client

A fast moving and venture backed advertising technology startup based in San Francisco. They have raised twelve million dollars in funding and are transforming how business to business marketers reach their ideal customers. Their identity resolution technology blends business and consumer signals to convert static audience lists into high match and cross channel segments without the use of cookies. By transforming first party and third party data into precision targetable audiences across platforms such as Meta Google and YouTube they enable marketing teams to reach higher match rates reduce wasted advertising spend and accelerate pipeline growth. With a strong understanding of how business buyers behave in channels that have traditionally been focused on business to consumer activity they are redefining how business brands scale demand generation and account based efforts.

About Us

Catalyst Labs is a leading talent agency with a specialized vertical in Applied AI Machine Learning and Data Science. We stand out as an agency thats deeply embedded in our clients recruitment operations.

We collaborate directly with Founders CTOs and Heads of AI in those themes who are driving the next wave of applied intelligence from model optimization to productized AI workflows. We take pride in facilitating conversations that align with your technical expertise creative problem-solving mindset and long-term growth trajectory in the evolving world of intelligent systems.

Location : San Francisco

Work type : Full Time

Compensation : above market base bonus equity

Roles & Responsibilities

Lead the design development and scaling of an end to end data platform from ingestion to insights ensuring that data is fast reliable and ready for business use.

Build and maintain scalable batch and streaming pipelines transforming diverse data sources and third party application programming interfaces into trusted and low latency systems.

Take full ownership of reliability cost and service level objectives. This includes achieving ninety nine point nine percent uptime maintaining minutes level latency and optimizing cost per terabyte. Conduct root cause analysis and provide long lasting solutions.

Operate inference pipelines that enhance and enrich data. This includes enrichment scoring and quality assurance using large language models and retrieval augmented generation. Manage version control caching and evaluation loops.

Work across teams to deliver data as a product through the creation of clear data contracts ownership models lifecycle processes and usage based decision making.

Guide architectural decisions across the data lake and the entire pipeline stack. Document lineage trade offs and reversibility while making practical decisions on whether to build internally or buy externally.

Scale integration with application programming interfaces and internal services while ensuring data consistency high data quality and support for both real time and batch oriented use cases.

Mentor engineers review code and raise the overall technical standard across teams. Promote data driven best practices throughout the organization.

Qualifications

Bachelors or Masters degree in Computer Science Computer Engineering Electrical Engineering or Mathematics.

Excellent written and verbal communication; proactive and collaborative mindset.

Comfortable in hybrid or distributed environments with strong ownership and accountability.

A founder-level bias for actionable to identify bottlenecks automate workflows and iterate rapidly based on measurable outcomes.

Demonstrated ability to teach mentor and document technical decisions and schemas clearly.

Core Experience

6 to 12 years of experience building and scaling production-grade data systems with deep expertise in data architecture modeling and pipeline design.

Expert SQL (query optimization on large datasets) and Python skills.

Hands-on experience with distributed data technologies (Spark Flink Kafka) and modern orchestration tools (Airflow Dagster Prefect).

Familiarity with dbt DuckDB and the modern data stack; experience with IaC CI / CD and observability.

Exposure to Kubernetes and cloud infrastructure (AWS GCP or Azure).

Bonus : Strong skills for faster onboarding and system integration.

Previous experience at a high-growth startup (10 to 200 people) or early-stage environment with a strong product mindset.

Key Skills

Apache Hive,S3,Hadoop,Redshift,Spark,AWS,Apache Pig,NoSQL,Big Data,Data Warehouse,Kafka,Scala

Employment Type : Full Time

Experience : years

Vacancy : 1

[job_alerts.create_a_job]

Lead Data Engineer • Greenwich, Connecticut, USA

[internal_linking.related_jobs]
Senior Azure Data Engineer

Senior Azure Data Engineer

Oakridge Staffing • Stamford, CT, United States
[job_card.full_time]
Great opportunity with a private equity firm located in Stamford, CT.Designing and implementing machine learning solutions as part of high-volume data ingestion and transformation pipelines.Experie...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Lead AI Engineer

Lead AI Engineer

PepsiCo • Purchase, NY, United States
[job_card.full_time]
As an AI Engineer specializing in AI Agents, you will play a pivotal role in our organization's transformation strategies by designing and developing domain-specific AI agents and solutions.Prototy...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
VP Data Engineering & Science

VP Data Engineering & Science

TKO • Stamford, CT, United States
[job_card.full_time]
The VP, Data Engineering and Science will be responsible for leading the design, implementation, and continuous improvement of the organization’s data infrastructure and architecture.This role requ...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
VP Data Engineering & Science (Stamford)

VP Data Engineering & Science (Stamford)

TKO • Stamford, CT, US
[job_card.part_time]
The VP, Data Engineering and Science will be responsible for leading the design, implementation, and continuous improvement of the organizations data infrastructure and architecture.This role requi...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Excellent Opportunity - Technical Account Lead - Data & AI - Fulltime.

Excellent Opportunity - Technical Account Lead - Data & AI - Fulltime.

PSRTEK • Stamford, CT, United States
[job_card.full_time]
[filters_job_card.quick_apply]
Dear All, Greetings from PSRTEK Inc.We are looking for Technical Account ...[show_more]
[last_updated.last_updated_variable_days]
Director, ERM - Actuary / Data Scientist (Catastrophe Modeling)

Director, ERM - Actuary / Data Scientist (Catastrophe Modeling)

W. R. Berkley Corporation • Greenwich, CT, United States
[job_card.full_time]
A leading commercial insurer in Greenwich, CT is seeking an experienced professional for its Enterprise Risk Management Team. The role focuses on exposure management and catastrophe modeling, requir...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Staff Machine Learning Engineer, MLOps / LLMOps

Staff Machine Learning Engineer, MLOps / LLMOps

Teladoc Health • Purchase, NY, United States
[job_card.full_time]
Join the team leading the next evolution of virtual care.At Teladoc Health, you are empowered to bring your true self to work while helping millions of people live their healthiest lives.Here you w...[show_more]
[last_updated.last_updated_30] • [promoted]
Head of Enterprise AI Platforms & Process Excellence

Head of Enterprise AI Platforms & Process Excellence

Harman • Stamford, CT, United States
[job_card.full_time]
A leading technology company is seeking a hands-on technical leader to drive enterprise-scale AI adoption in its automotive division. The role involves leading the architecture and development of AI...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Enterprise Data Specialist Lead Engineer

Enterprise Data Specialist Lead Engineer

JPS Tech Solutions LLC • Stamford, CT, United States
[job_card.full_time]
[filters_job_card.quick_apply]
Job Title : Enterprise Data Specialist Lead Engineer Location : Stamford, CT[show_more]
[last_updated.last_updated_variable_days]
Data Engineer

Data Engineer

Innovative Rocket Technologies Inc. • New Hyde Park, NY, US
[job_card.full_time]
[filters_job_card.quick_apply]
Data is pivotal to our goal of frequent launch and rapid iteration.We’re recruiting a Data Engineer at iRocket to build pipelines, analytics, and tools that support propulsion test, launch operatio...[show_more]
[last_updated.last_updated_30]
Senior Master Data Governance & MDM Leader

Senior Master Data Governance & MDM Leader

EY • Stamford, CT, United States
[job_card.full_time]
A global consulting firm is seeking a Senior Master Data Management professional to guide clients in data governance and strategy implementation. The role includes designing governance frameworks, e...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Senior Azure Data Engineer (Stamford)

Senior Azure Data Engineer (Stamford)

Oakridge Staffing • Stamford, CT, US
[job_card.part_time]
Great opportunity with a private equity firm located in Stamford, CT.Designing and implementing machine learning solutions as part of high-volume data ingestion and transformation pipelines.Experie...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Data Scientist, Enterprise Analytics & Insights

Data Scientist, Enterprise Analytics & Insights

New York Blood Center • City of Rye, NY, United States
[job_card.full_time]
A leading nonprofit health organization is seeking a Data Scientist in New York.You will analyze complex data sets, develop high-quality data products, and collaborate with various teams to improve...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Principal AI Cybersecurity Engineer

Principal AI Cybersecurity Engineer

Teladoc Health • Purchase, NY, United States
[job_card.full_time]
Join the team leading the next evolution of virtual care.At Teladoc Health, you are empowered to bring your true self to work while helping millions of people live their healthiest lives.Here you w...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Senior Data Engineer (Queens)

Senior Data Engineer (Queens)

Amtex Systems Inc • Queens, NY, US
[job_card.part_time]
Looking for Data Engineer experience with SQL, Python, Spark, and Azure.Solid understanding of ETL processes and data engineering best practices and Background or exposure to Data Science concepts.[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Senior Director - Data, Analytics & AI - Value, Strategy and Delivery Models (Remote United States)

Senior Director - Data, Analytics & AI - Value, Strategy and Delivery Models (Remote United States)

Gartner • Stamford, CT, United States
[filters.remote]
[job_card.full_time]
What makes Gartner Business and Technology Insights a GREAT fit for you? When you join Gartner, you'll be part of a team that values curiosity, expert insights, bold ideas and intellectual courage,...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
C++ Market Data Engineer (USA)

C++ Market Data Engineer (USA)

Trexquant Investment • Stamford, CT, US
[job_card.full_time]
[filters_job_card.quick_apply]
Trexquant is a growing systematic fund at the forefront of quantitative finance, with a core team of highly accomplished researchers and engineers. To keep pace with our expanding global trading ope...[show_more]
[last_updated.last_updated_30]
C++ Market Data Engineer

C++ Market Data Engineer

TBG | The Bachrach Group • Stamford, CT, United States
[job_card.full_time]
We are seeking a C++ Market Data Engineer to design and optimize ultra-low-latency feed handlers that power global trading systems. This is a high-impact role where your code directly drives real-ti...[show_more]
[last_updated.last_updated_variable_days] • [promoted]