Talent.com
Staff Data Engineer
Staff Data EngineerBlackbird.AI • US
Staff Data Engineer

Staff Data Engineer

Blackbird.AI • US
[job_card.1_day_ago]
[job_preview.job_type]
  • [job_card.full_time]
  • [filters.remote]
  • [filters_job_card.quick_apply]
[job_card.job_description]

Blackbird.AI helps organizations discover emergent threats and stay one step ahead of real-world harm through our AI-powered Narrative and Risk Intelligence Platform. Our commitment is to prioritize safety and security, providing the tools to identify potential risks and ensure a safer environment proactively. No matter the job or where it's located, we're all connected by a shared vision : To lead and enhance the landscape of risk intelligence.

As a Staff Data Engineer, you will play a critical role in architecting and scaling our data platform and AI / ML processing infrastructure. You'll be a technical leader responsible for our entire data ecosystem—from ingestion pipelines that process diverse data sources to the lakehouse architecture that powers our narrative analysis capabilities. You'll architect systems that seamlessly support batch and streaming data patterns while building real time alerting on generated insights.

You'll work at the intersection of data engineering, AI-powered data transformation, and platform engineering, making architectural decisions that will shape our ability to detect misinformation, disinformation, and narrative attacks at scale while managing costs effectively. A key aspect of this role involves building intelligent pipelines that use traditional AI and generative AI to cluster, enrich, classify, and extract insights from data as it flows through our system.

As a Staff Data Engineer you will :

  • Design and implement scalable data platform architecture on Databricks, supporting both batch and streaming ingestion
  • Build robust, fault-tolerant data ingestion pipelines that integrate with multiple third-party APIs and data providers
  • Design and implement AI-powered enrichment stages within pipelines—applying ML clustering, generative AI summarization, classification, and entity extraction to transform raw data into actionable intelligence
  • Build analytical systems with full-text search capabilities using Elasticsearch for rapid querying and analysis of enriched data
  • Work with AI / ML researchers to implement, integrate and scaling AI processing
  • Expose data platform capabilities as APIs and other interfaces for downstream consumption by applications and services
  • Optimize data lake and lakehouse architecture for performance, cost-efficiency, and scalability
  • Design and implement data quality frameworks, monitoring, and alerting systems
  • Design efficient architectures for calling external AI APIs and managing rate limits, costs, and reliability
  • Architect solutions with cost-efficiency as a first-class concern, implementing monitoring and optimization strategies for compute and storage
  • Make critical build-vs-buy decisions and establish architectural standards for the data organization
  • Mentor engineers and elevate the team's technical capabilities through code reviews, design discussions, and knowledge sharing

Requirements

  • 8+ years of software engineering experience with 5+ years focused on data platforms or data engineering
  • Deep expertise with Databricks, Apache Spark, and data lakehouse architectures
  • Strong experience building and operating data pipelines at scale (handling TBs+ of data)
  • Experience integrating AI / ML capabilities into data pipelines (clustering, LLM APIs, classification, summarization)
  • Proficiency in Python, DBT, and SQL for data processing and pipeline development
  • Experience with both batch and streaming large scale data processing patterns
  • Strong understanding of cloud platforms (AWS, Azure)
  • Excellent communication skills and ability to mentor engineers
  • Preferred Qualifications :

  • Experience designing both batch and streaming / near real-time data architectures
  • Proficiency with Elasticsearch for building analytical systems with full-text search capabilities
  • Hands-on experience with LLM APIs and understanding of rate limiting and cost optimization
  • Experience with Agentic AI, context engineering, and evaluation
  • Background in trust & safety, security, or content moderation domains
  • Experience with data observability tools and building comprehensive monitoring systems
  • Prior experience at a startup or fast-paced environment
  • Apply agentic coding tools for day to day development
  • Familiarity with Databricks' Lakeflow, Agent Bricks, and vector databases
  • What We Value :

  • Technical Excellence : You write clean, maintainable code and make thoughtful architectural decisions
  • Pragmatism : You balance perfection with shipping and know when to optimize vs. when "good enough" is sufficient
  • Ownership : You take end-to-end responsibility for your systems and their reliability
  • Collaboration : You elevate those around you and thrive in a team environment
  • Impact Orientation : You focus on outcomes and business value, not just technical elegance
  • Learning Mindset : You stay current with evolving technologies and continuously improve your craft
  • We've outlined specific skills, experience, and requirements for this position, but don't stress if you don't meet every single one. Our Talent Team is dedicated to discovering exceptional individuals, and they might identify a relevant aspect of your background that suits this role or another opportunity within Blackbird.AI.

    If you have passion for the role, please still apply.

    Benefits

  • Competitive compensation package, 401(k), and equity - everyone has a stake in our growth!
  • Comprehensive health benefits for you and your loved ones, including wellness days and monthly wellness reimbursements - an apple a day doesn't always keep the doctor away!
  • Generous vacation policy, encouraging you to take the time you need - we trust you to strike the right work / life balance!
  • A flexible work environment with opportunities to collaborate with your team in person - you can have it all!
  • Inclusion and Impact - soar to new heights!
  • Professional development stipend - never stop learning!
  • [job_alerts.create_a_job]

    Staff Data Engineer • US

    [internal_linking.similar_jobs]
    Staff Full Stack / AI Engineer

    Staff Full Stack / AI Engineer

    Pocket Prep • Remote, Remote, United States
    [job_card.full_time]
    We believe that education should be within everyone’s reach.Professional certification exams are often a stressful and expensive barrier to career advancement - Pocket Prep strives to prepare our m...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Staff AI Software Engineer

    Staff AI Software Engineer

    Agility Robotics • Remote, Remote, United States
    [filters.remote]
    [job_card.full_time]
    Our robot, Digit, is the first to be sold into workplaces across the globe.Our team is differentiated by its expertise in imagining, engineering, and delivering robots with advanced mobility, dexte...[show_more]
    [last_updated.last_updated_variable_hours] • [promoted] • [new]
    AI Integrations Staff Engineer

    AI Integrations Staff Engineer

    Vetcove • Remote, Remote, United States
    [filters.remote]
    [job_card.full_time]
    Join Vetcove and help modernize the future of veterinary software and the pet parent healthcare experience.Our suite of platforms features a market-leading procurement marketplace, an ultra-modern ...[show_more]
    [last_updated.last_updated_variable_hours] • [promoted] • [new]
    Staff Software Engineer - AI SDK

    Staff Software Engineer - AI SDK

    Temporal Technologies • United States, United States, United States
    [job_card.full_time] +1
    Temporal is an open source programming model that can simplify code, make applications more reliable, and help developers focus on the important things like delivering features faster.Our amazing u...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Staff AI Engineer

    Staff AI Engineer

    You.com • Remote, Remote, United States
    [filters.remote]
    [job_card.full_time]
    AI-powered search and productivity platform designed to empower users with personalized, efficient, and trustworthy search experiences. As a cutting-edge technology company, we combine advanced AI m...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Staff Machine Learning Engineer, Gen AI

    Staff Machine Learning Engineer, Gen AI

    Weave • Remote, Remote, United States
    [filters.remote]
    [job_card.full_time]
    Weave is looking for engineers hungry for fun challenges who can join our self-empowered teams and contribute in both technical and non-technical ways. You will be joining a team of talented develop...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Staff Machine Learning Platform Engineer

    Staff Machine Learning Platform Engineer

    Forterra • Remote, Remote, United States
    [filters.remote]
    [job_card.full_time]
    At Forterra, machine learning is core to our business.Whether it's perception, behavior generation, navigation or automating business workflows, machine learning helps us drive towards our mission ...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Staff ML Engineer, Dynamic World Perception

    Staff ML Engineer, Dynamic World Perception

    Stack Av • Remote, Remote, United States
    [filters.remote]
    [job_card.full_time]
    Stack is developing revolutionary AI and advanced autonomous systems designed to enhance safety, reliability, and efficiency of modern operations. Stack's autonomous technology incorporates cutting-...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Senior Staff Machine Learning Engineer, Search Platform

    Senior Staff Machine Learning Engineer, Search Platform

    Reddit • Remote, Remote, United States
    [filters.remote]
    [job_card.full_time]
    Reddit is a community of communities.It’s built on shared interests, passion, and trust and is home to the most open and authentic conversations on the internet. Every day, Reddit users submit, vote...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Senior Staff Software Engineer - Observo AI

    Senior Staff Software Engineer - Observo AI

    Sentinelone • Remote, Remote, United States
    [filters.remote]
    [job_card.full_time]
    Senior Staff Software Engineer.AI-driven data pipeline optimization platform.This role will be responsible for leading the architectural design and technical strategy for high-performance systems t...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Staff Backend engineer

    Staff Backend engineer

    Jobot • US
    [job_card.full_time]
    An innovative AI-driven technology company is seeking a Staff Backend Engineer to help build the core infrastructure behind its real-time platform. This Jobot Job is hosted by : Tarek Hamzeh.Are you ...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Staff AI Software Engineer and Technical Educator

    Staff AI Software Engineer and Technical Educator

    Udacity • United States, United States, United States
    [job_card.full_time]
    Udacity is now an Accenture company, and exciting things are happening! 🚀 We are on a mission of forging futures in tech through. We offer a unique and immersive online learning platform, powering ...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Software Engineer II - Data

    Software Engineer II - Data

    Wati.io • Hong Kong, Hong Kong SAR, United States
    [job_card.full_time]
    WhatsApp-first conversational growth platform, empowering businesses to build deeper customer relationships and accelerate revenue growth. Trusted and loved by over 14,000 customers across 100+ coun...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Lead Data Engineer

    Lead Data Engineer

    Fusemachines • Remote, Remote, United States
    [filters.remote]
    [job_card.full_time]
    Fusemachines is a leading AI strategy, talent, and education services provider.Adjunct Associate Professor at Columbia University, Fusemachines has a core mission of democratizing AI.With a presenc...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Staff AI Product Engineer, Code

    Staff AI Product Engineer, Code

    Semgrep • Remote, Remote, United States
    [job_card.full_time]
    Semgrep is on a mission to make it expensive to exploit software.As the team behind the most popular SAST, we built the Semgrep AppSec Platform to deliver industry-leading code, dependency, and sec...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Full-Stack AI Engineer

    Full-Stack AI Engineer

    Coursera • United States, United States, United States
    [job_card.full_time]
    Coursera was launched in 2012 by Andrew Ng and Daphne Koller with a mission to provide universal access to world-class learning. It is now one of the largest online learning platforms in the world, ...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Staff Machine Learning Engineer

    Staff Machine Learning Engineer

    Niche • Remote, Remote, United States
    [filters.remote]
    [job_card.full_time]
    Niche is the leader in school search.Our mission is to make researching and enrolling in schools easy, transparent, and free. With in-depth profiles on every school and college in America, 140 milli...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Senior Staff Machine Learning Engineer, Relevance and Personalization

    Senior Staff Machine Learning Engineer, Relevance and Personalization

    Airbnb • United States, United States, United States
    [job_card.full_time]
    Join Airbnb’s Relevance and Personalization team, where you’ll have a unique opportunity to shape the discovery experience for over 150M global users! You’ll take the lead on projects that power se...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]